Saurav Jha's Avatar

Saurav Jha

@saurav-jha.bsky.social

www.sauravjha.com.np πŸ‡³πŸ‡΅ in πŸ‡¨πŸ‡¦ IVADO postdoc @mila-quebec.bsky.social Ex applied scientist #Openstream.ai; Ex Intern at #Tencent, #Sony, #Inria; PhD @unsw.edu.au πŸ‡¦πŸ‡Ί ; Ex MLE #Factset

109 Followers  |  101 Following  |  4 Posts  |  Joined: 17.11.2024
Posts Following

Posts by Saurav Jha (@saurav-jha.bsky.social)

Post image

Streaming Reinforcement Learning (RL) is a huge challenge: transitions are used once and discarded immediately. This makes agents extremely sample-inefficient. But what if we could "squeeze" more information out of every single frame?

Check out our latest paper!

24.02.2026 15:22 β€” πŸ‘ 2    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Video thumbnail

New work, just accepted @ICLR: "The Expressive Limits of Diagonal SSMs for State-Tracking"
We give a complete characterization of what diagonal SSMs can and cannot compute on state-tracking tasks and the answer is deeply connected to group theory.
πŸ§΅πŸ‘‡

10.02.2026 16:54 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Can LLMs play Hangman? Spoiler alert: Not yet.
Check out β€œLLMs Can’t Play Hangman: On the Necessity of a Private Working Memory for Language Agents”, led by Davide Baldelli, Ali Parviz, AmalZouaq and Sarath Chandar.

27.01.2026 16:20 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Can LLMs become CAD designers?
Check out β€œCADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design”, which is now published in Transactions on Machine Learning Research (TMLR)!

20.01.2026 15:55 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image Post image

Life update - last month I moved to #montreal πŸ‡¨πŸ‡¦ from #Sydney πŸ‡¦πŸ‡Ί to kick off my @ivado.bsky.social postdoc fellowship at @mila-quebec.bsky.social. Must say I am constantly amused by: 1. How walkable the city is.
2. How easy is it to reach out to diverse research communities within #mila ! πŸ˜€

03.10.2025 15:08 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

πŸŽ‰ Happy to share that our paper β€œMining your own secrets: Diffusion Classifier scores for Continual Personalization of Text-to-Image Diffusion Models” has been accepted to #ICLR2025!

πŸ‘‰ The work results from my #Sony summer internship in the stunning #TokyoπŸ—Ό city

Preprint: arxiv.org/pdf/2410.00700

22.01.2025 23:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I ran across a busy Sander at a #neurips party with a similar question - he was still patient enough to explain stuff. This talk further clarifies a good amount of my doubts. Recommend watching if you're working on diffusion / LLMs for generation!

25.12.2024 23:43 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

I validate this

17.12.2024 07:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0