Discussion about this post

User's avatar
Pepamo's avatar

Fire title

Expand full comment
Super Statistician's avatar

Seems like your overall thesis is somewhat contra the idea that we could see "substantial gains from lots of AI automation of RL environment construction," so I'm curious why you don't have that as a bigger part of your estimate. This strikes me as a plausible channel for recursive self-improvement/bootstrapping.

How confident are you that we're many years away from models autonomously building challenging agentic RL environments (e.g. writing most of the infrastructure code for GPT Plays Pokemon)?

Expand full comment
3 more comments...

No posts