Distinguish between inference scaling and…

Ryan Greenblatt

Feb 11

Most recent progress probably isn't from unsustainable inference scaling

3 Comments

(inference costs on the METR tasks)

Great post .

Shouldn't it be easy to check this?

Just look at whether the doubling time for inference costs is faster than the doubling time for horizon length.

Anders Cairns Woodruff

If you're interested in this question, I have a more detailed empirical analysis here:

https://blog.redwoodresearch.org/p/ais-capability-improvements-havent

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts