What is Constitutional AI?
AI FundamentalsLast updated:
An alignment approach where the model critiques and revises its own outputs based on a set of written principles.
Constitutional AI, developed by Anthropic, replaces some human feedback with AI self-evaluation against explicit principles. The model generates responses, evaluates them against its constitution, and revises them, reducing the need for human labeling.