ASAL · AI & ML
Ensuring that powerful AI systems reliably pursue intended goals. Research spans RLHF, constitutional methods, oversight scalability, and formal specification of human values.
Mission & Focus
Ensuring that powerful AI systems reliably pursue intended goals. Research spans RLHF, constitutional methods, oversight scalability, and formal specification of human values.
Active Projects
Research Team
Recent Publications