Sourav's Avatar

Sourav

@souravmishra.bsky.social

I like to talk about ML & 3D. University of Tokyo alumni. Previously: Virginia Tech, Stanford Biodesign. Visiting Fellow @ MSR Big Data

118 Followers  |  546 Following  |  64 Posts  |  Joined: 20.10.2023  |  1.8956

Latest posts by souravmishra.bsky.social on Bluesky

Post image

L5 Manager stork making sure the L3 and L4 employee storks finishing their task properly πŸ˜‚

cc @pierrealquier.bsky.social

18.05.2025 05:52 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
now publishers - A Tutorial on Meta-Reinforcement Learning Publishers of Foundations and Trends, making research accessible

Our survey on meta reinforcement learning has now been published by Foundations and Trends in Machine Learning: nowpublishers.com/article/Deta...

18.04.2025 15:19 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Big win for Stanford NLP to have @yejinchoinka.bsky.social Looking forward to new amazing directions

08.04.2025 03:57 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Excited to see Waymo come to Tokyo

* Tokyo has a passive, secondary transportation mode which will stay strong despite the subway.

* It is generally considered to be one of the safest cities. Low chances of vandalism.

* People are curious to try out new technology across all age strata.

08.04.2025 03:21 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Tracing the thoughts of a large language model Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms

Picking the brains of the Claude model. Anthropic has made a new foray into "AI biology" and some interesting case studies on what the assumptions are.

Turns out LLMs don't entirely piece together their response the way we do. Great investigate work
www.anthropic.com/research/tra...

27.03.2025 21:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
On the Biology of a Large Language Model

Can we understand the mechanisms of a frontier AI model?

πŸ“ Blog post: www.anthropic.com/research/tra...
πŸ§ͺ "Biology" paper: transformer-circuits.pub/2025/attribu...
βš™οΈ Methods paper: transformer-circuits.pub/2025/attribu...

Featuring basic multi-step reasoning, planning, introspection and more!

27.03.2025 18:18 β€” πŸ‘ 126    πŸ” 29    πŸ’¬ 4    πŸ“Œ 3
Post image

In the Ghibli world I look like this. Not bad. Not bad at all! :-)

27.03.2025 21:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Is "vibe coding" same as "prompt engineering" from 6 mo ago?

I am having trouble keeping up with names and acronyms now, forget research papers

25.03.2025 12:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

don’t learn to write code, learn to read code

23.03.2025 21:13 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Too bad it's a little late for Team Australia for the Olympics

25.03.2025 11:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Such a beautiful representation of the vernal equinox 😍

20.03.2025 11:09 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

"There was a girl from India who had overstayed her student visa for 3 days before heading back home. She then came back to the US on a new, valid visa to finish her master’s degree and was handed over to ICE due to the 3 days she had overstayed"

WOW. Just wow. Cruelty is in vogue for US govt. now

20.03.2025 10:30 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Best LLMs are decided by evals on leaderboards
(I am looking at you, Gemini)

20.03.2025 09:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Docs Docs: Your new companion to collaborate on documents efficiently, intuitively, and securely.

Like Notion? Well this is a open source clone sponsored by FR-DE governments. And frankly, it looks great

docs.numerique.gouv.fr/login/

18.03.2025 12:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The arrival of early spring *chef's kiss* Nice shots Sid. Get yourself a proper camera

17.03.2025 03:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

OLMo2 goodies ↓

14.03.2025 03:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks.

Comparable to best open-weight models, but a fraction of training compute. When you have a good recipe, ✨ magical things happen when you scale it up!

13.03.2025 18:36 β€” πŸ‘ 58    πŸ” 15    πŸ’¬ 3    πŸ“Œ 3

True open source looks like OLMO2 πŸ˜… codes weights, recipes all available to inspect

14.03.2025 03:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The 3rd edition of Interpretable Machine Learning is out! πŸŽ‰ Major cleanup, better examples, and new chapters on Data & Models, Interpretability Goals, Ceteris Paribus, and LOFO Importance.

The book remains free to read for everyone. But you can also buy ebook or paperback.

13.03.2025 12:09 β€” πŸ‘ 16    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
Introducing Gemma 3: The most capable model you can run on a single GPU or TPU Today, we're introducing Gemma 3, our most capable, portable and responsible open model yet.

Introducing our Gemma 3 open models, the most capable models that you can run on a single GPU or TPU. Multimodal, multilingual, 128k context length, and exceeds quality of other open models that are an order of magnitude larger in terms of hardware footprint. πŸŽ‰

blog.google/technology/d...

13.03.2025 14:55 β€” πŸ‘ 140    πŸ” 21    πŸ’¬ 2    πŸ“Œ 3

Nice. Congratulations Pierre.

(I am pretty sure I won't understand it. It looks pretty dense πŸ˜‚πŸ˜… so I am honest with you. But congratulations for the acceptance πŸ₯³)

12.03.2025 11:30 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

ML project naming is getting out of hands πŸ˜€

11.03.2025 00:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Seems like it. I saw the chapters themselves have some typos.

However that 'gradient tape' example is a bit... bizarre ? Things could have been left at static vs dynamic computation graph (define-and-run vs. define-while-run for easier speaking)

I hope the book doesn't turn out a mish-mash

09.03.2025 18:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Ha! You are waiting on it too with preorder ! πŸ˜€ It was originally slated January & it has slipped behind schedule

09.03.2025 13:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Any news about the book release @fchollet.bsky.social ? The new chapters look promising especially

09.03.2025 13:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I have a solid feeling that Bsky has better academic discussion than Twitter as of now

Musk's app will be relegated to all kinds of shitposting in general by me. Serious posts belong here πŸ˜…

05.03.2025 12:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
David Picard

I updated my ML lecture material: davidpicard.github.io/teaching/
I show many (boomer) ML algorithms with working implementation to prevent the black box effect.
Everything is done in notebooks so that students can play with the algorithms.
Book-ish pdf export: davidpicard.github.io/pdf/poly.pdf

27.02.2025 19:09 β€” πŸ‘ 37    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0

I love this characterization πŸ˜… So accurate

05.03.2025 11:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sutton and Barto won the 2024 ACM Turing award. Reinforcement learning is getting its due finally

05.03.2025 11:28 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Nice!

05.03.2025 11:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@souravmishra is following 20 prominent accounts