RESEARCH AREA

AI Alignment

Technical approaches to aligning AI goals with human values.

500+ papers1 researchers

Key Papers

Constitutional AI: Harmlessness from AI Feedback

arXiv20222,000 citations

Researchers

Dario Amodei

Anthropic