Abhinav Moudgil's Avatar

Abhinav Moudgil

@amoudgl.bsky.social

PhD student, Mila abhinavmoudgil.com

180 Followers  |  84 Following  |  3 Posts  |  Joined: 20.11.2024  |  1.2912

Latest posts by amoudgl.bsky.social on Bluesky

This tool is especially useful in cases when evaluations are expensive (e.g. LM harness eval) and you want to track model performance during training.

15.06.2025 22:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - amoudgl/assayer: Python RQ watchdog to automatically evaluate ML model checkpoints offline during training Python RQ watchdog to automatically evaluate ML model checkpoints offline during training - amoudgl/assayer

github: github.com/amoudgl/assa...

15.06.2025 22:27 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

New side project!

assayer: A simple Python-RQ based tool to automatically monitor and evaluate ML model checkpoints offline during training.

15.06.2025 22:27 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
AuPair: Golden Example Pairs for Code Repair Scaling up inference-time compute has proven to be a valuable strategy in improving the performance of Large Language Models (LLMs) without fine-tuning. An important task that can benefit from additio...

Excited to share our recent work, AuPair, an inference-time technique that builds on the premise of in-context learning to improve LLM coding performance!
arxiv.org/abs/2502.18487

๐Ÿงต

17.03.2025 11:16 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3

@amoudgl is following 18 prominent accounts