RESEARCH AREA
AI Alignment
Technical approaches to aligning AI goals with human values.
500+ papers1 researchers
Key Papers
Constitutional AI: Harmlessness from AI Feedback
arXiv20222,000 citations
RESEARCH AREA
Technical approaches to aligning AI goals with human values.
Constitutional AI: Harmlessness from AI Feedback