Discussion about this post

User's avatar
Uncharitable Takes's avatar

Fire title

Expand full comment
Jack Edwards's avatar

Seems like your overall thesis is somewhat contra the idea that we could see "substantial gains from lots of AI automation of RL environment construction," so I'm curious why you don't have that as a bigger part of your estimate. This strikes me as a plausible channel for recursive self-improvement/bootstrapping.

How confident are you that we're many years away from models autonomously building challenging agentic RL environments (e.g. writing most of the infrastructure code for GPT Plays Pokemon)?

Expand full comment
3 more comments...

No posts

Ready for more?