Link: chiragnagpal.github.io/papers/llm_l...
16.10.2025 23:51 β π 0 π 0 π¬ 0 π 0@nagpalchirag.bsky.social
Link: chiragnagpal.github.io/papers/llm_l...
16.10.2025 23:51 β π 0 π 0 π¬ 0 π 04/ There's still a lot of opportunity for statistical thinking in the Large Language Model era!
There's ample estimation problems in Language Modeling and AI Alignment waiting to be solved using classic statistical tools!
3/ Through illustrative examples on a toy data and simulation on a real-word chatbot style dataset, I show how censoring adjusted estimators result in better estimates of trajectory length distributions.
16.10.2025 23:50 β π 0 π 0 π¬ 1 π 02/ In a new monograph, I show that is a classical statistical estimation problem called πΎππ£π¨π€π§ππ£π.
I show that estimators like the πππ₯π‘ππ£-πππππ§, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.
ππππππ of generations from an πππ is an important heuristic used in post-training to understand model behavior.
π½ππ due to a πππππΏ ππππ πΎππππππ ππππΏππ, a large number of trajectories get truncated before ever reaching [ππ’π¦] token.
πππ ππππ does one accurately estimate model generation length ?
Direct preference optimization is cox regression
16.09.2025 04:25 β π 1 π 0 π¬ 0 π 0Kernel regression is the only way to achieve AGI
11.09.2025 04:26 β π 2 π 0 π¬ 0 π 1