Discussion about this post

User's avatar
Peter Treyson's avatar

Could Redwood build a website to submit alignment ideas? For those of use not in a major lab. I think many great ideas never hit paper. There's friction to writing, collecting sources, grammar. I'd love a site where anyone go submit ideas of any maturity. Everything from a passing thought to a full blog post. Then they are stored a DB anyone can filter, download, etc. There would be a lot of noise, but filtering and de-duplication work could be fun projects. A couple cool feature ideas for the site come to mind. First, is an interview agent that will probe you about your idea. Maybe even a "harsh critic please defend yourself" setting. Second is an area for LLM generated ideas. Some submissions will likely incorrectly mark their LLM generated ideas as human, but I think that's okay. The filtering, de-duplication, and prioritization projects can help handle that. Third, is some cool front-end data widgets to explore connections, citations and ideas. In the end, I'd love for an area where automated alignment researchers and human researchers can sample ideas. I think the public internet has some capture of ideas, but its hard to collate effectively, hard to uniformly sample from, and many, many ideas just never get posted due to friction.

No posts

Ready for more?