's Avatar

@nagpalchirag.bsky.social

3 Followers  |  4 Following  |  7 Posts  |  Joined: 11.09.2025  |  1.5893

Latest posts by nagpalchirag.bsky.social on Bluesky

Link: chiragnagpal.github.io/papers/llm_l...

16.10.2025 23:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

4/ There's still a lot of opportunity for statistical thinking in the Large Language Model era!

There's ample estimation problems in Language Modeling and AI Alignment waiting to be solved using classic statistical tools!

16.10.2025 23:50 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

3/ Through illustrative examples on a toy data and simulation on a real-word chatbot style dataset, I show how censoring adjusted estimators result in better estimates of trajectory length distributions.

16.10.2025 23:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

2/ In a new monograph, I show that is a classical statistical estimation problem called π˜Ύπ™šπ™£π™¨π™€π™§π™žπ™£π™œ.

I show that estimators like the 𝙆𝙖π™₯𝙑𝙖𝙣-π™ˆπ™šπ™žπ™šπ™§, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.

16.10.2025 23:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

𝙇𝙀𝙉𝙂𝙏𝙃 of generations from an π™‡π™‡π™ˆ is an important heuristic used in post-training to understand model behavior.

π˜½π™π™ due to a π™π™„π™“π™€π˜Ώ π™Žπ™„π™•π™€ π˜Ύπ™Šπ™‰π™π™€π™“π™ π™’π™„π™‰π˜Ώπ™Šπ™’, a large number of trajectories get truncated before ever reaching [π—˜π—’π—¦] token.

π™ƒπ™Šπ™’ 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?

16.10.2025 23:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

Direct preference optimization is cox regression

16.09.2025 04:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Kernel regression is the only way to achieve AGI

11.09.2025 04:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

@nagpalchirag is following 4 prominent accounts