Redwood Research blog
Subscribe
Sign in
Home
Reading List
Archive
About
Latest
Top
Discussions
Being honest with AIs
When and why we should refrain from lying
Aug 21
•
Lukas Finnveden
4
Share this post
Redwood Research blog
Being honest with AIs
Copy link
Facebook
Email
Notes
More
My AGI timeline updates from GPT-5 (and 2025 so far)
AGI before 2029 now seems substantially less likely
Aug 20
•
Ryan Greenblatt
16
Share this post
Redwood Research blog
My AGI timeline updates from GPT-5 (and 2025 so far)
Copy link
Facebook
Email
Notes
More
3
Four places where you can put LLM monitoring
To wit: LLM APIs, agent scaffolds, code review, and detection-and-response systems
Aug 9
•
Fabien Roger
and
Buck Shlegeris
14
Share this post
Redwood Research blog
Four places where you can put LLM monitoring
Copy link
Facebook
Email
Notes
More
1
July 2025
Should we update against seeing relatively fast AI progress in 2025 and 2026?
Maybe we should (re)assess the case for relatively fast progress after the GPT-5 release.
Jul 28
•
Ryan Greenblatt
21
Share this post
Redwood Research blog
Should we update against seeing relatively fast AI progress in 2025 and 2026?
Copy link
Facebook
Email
Notes
More
4
Why it's hard to make settings for high-stakes control research
It's like making challenging evals, but more constrained
Jul 18
•
Buck Shlegeris
8
Share this post
Redwood Research blog
Why it's hard to make settings for high-stakes control research
Copy link
Facebook
Email
Notes
More
4
Recent Redwood Research project proposals
Empirical AI security/safety projects across a variety of areas
Jul 14
•
Ryan Greenblatt
,
Buck Shlegeris
,
Julian Stastny
,
Josh Clymer
,
Alex Mallen
, and
Vivek Hebbar
19
Share this post
Redwood Research blog
Recent Redwood Research project proposals
Copy link
Facebook
Email
Notes
More
4
What's worse, spies or schemers?
And what if you have both at once?
Jul 9
•
Buck Shlegeris
and
Julian Stastny
10
Share this post
Redwood Research blog
What's worse, spies or schemers?
Copy link
Facebook
Email
Notes
More
Ryan on the 80,000 Hours podcast
Ryan’s podcast with Rob Wiblin has just come out!
Jul 8
•
Buck Shlegeris
6
Share this post
Redwood Research blog
Ryan on the 80,000 Hours podcast
Copy link
Facebook
Email
Notes
More
How much novel security-critical infrastructure do you need during the singularity?
And what does this mean for AI control?
Jul 5
•
Buck Shlegeris
12
Share this post
Redwood Research blog
How much novel security-critical infrastructure do you need during the singularity?
Copy link
Facebook
Email
Notes
More
Two proposed projects on abstract analogies for scheming
We should study methods to train away deeply ingrained behaviors in LLMs that are structurally similar to scheming.
Jul 4
•
Julian Stastny
6
Share this post
Redwood Research blog
Two proposed projects on abstract analogies for scheming
Copy link
Facebook
Email
Notes
More
There are two fundamentally different constraints on schemers
"They need to act aligned" often isn't precise enough
Jul 2
•
Buck Shlegeris
8
Share this post
Redwood Research blog
There are two fundamentally different constraints on schemers
Copy link
Facebook
Email
Notes
More
1
June 2025
Jankily controlling superintelligence
How much time can control buy us during the intelligence explosion?
Jun 27
•
Ryan Greenblatt
6
Share this post
Redwood Research blog
Jankily controlling superintelligence
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts