The latest from
Buck Shlegeris
Announcing ControlConf 2026
User's avatar
Alex Mallen
Will reward-seekers respond to distant incentives?
User's avatar
Ryan Greenblatt
How do we (more) safely defer to AIs?
User's avatar
Julian Stastny
How do we (more) safely defer to AIs?
User's avatar
Josh Clymer
Will AI systems drift into misalignment?
User's avatar
Vivek Hebbar
Recent Redwood Research project proposals
User's avatar
Redwood Research blog
Redwood Research blog
We research catastrophic AI risks and techniques that could be used to mitigate them.
Recommendations
AI Futures Project
Daniel Kokotajlo
The Power Law
Peter Wildeford
ForeWord
Forethought