Check out our poster today at the M3L workshop at NeurIPS sites.google.com/view/m3l-202...
14.12.2024 17:48 β π 0 π 1 π¬ 0 π 0@maxsimchowitz.bsky.social
Check out our poster today at the M3L workshop at NeurIPS sites.google.com/view/m3l-202...
14.12.2024 17:48 β π 0 π 1 π¬ 0 π 0Can a language model improve itself without external verifier? We pose self-improvement as a computational challenge, and show how self-training might surmount it. Joint work with @djfoster.bsky.social and MSR.
Self-Improvement in Language Models: The Sharpening Mechanism
arxiv.org/abs/2412.01951