New paper! Self-consistency is the unifying thread behind many exciting emerging training objectives: introspection, calibration, self-improvement, self-distillation, and more. We think it should be *the* organizing principle for the next generation of LM training.
08.03.2026 22:52 β
π 6
π 0
π¬ 0
π 0
New paper: It's time to optimize for π self-consistency π
Weβve pushed LLMs to the limits of available data, yet failures like sycophancy and factual inconsistency persist.
We argue these stem from the same assumption: that behavior can be specified one I/O pair at a time. π§΅
08.03.2026 22:37 β
π 6
π 1
π¬ 1
π 1
Thank you @belindazli.bsky.social for the great talk "Solving the Specification Problem through Interactionβ at our weekly seminar!
#NLProc
23.01.2026 16:26 β
π 9
π 3
π¬ 0
π 0