Discussion about this post

User's avatar
Andy X Andersen's avatar

A neural net modifying its own weights is likely not going to work well for very large and very finely-tuned networks. It will also be immensely expensive.

Identifying a subset of weights that can be made tunable is likely going to be very task-dependent and won't generalize.

I think it makes more sense for an AI agent to work hard at inference time, when the solution is not known, and save the final streamlined path to the solution, the way people record their own eventual polished strategies.

That way the AI agent builds a vast trove of recipes it discovers that can later be either fetched dynamically when dealing with similar problems, or baked in a future large training run.

Expand full comment
3 more comments...

No posts

Ready for more?