Alex Irpan's Avatar

Alex Irpan

@alexirpan.bsky.social

Research Scientist @ Google DeepMind. Formerly Robotics, now AI Safety. Has a blog. Views are my own.

1,296 Followers  |  2 Following  |  12 Posts  |  Joined: 14.11.2024  |  1.5183

Latest posts by alexirpan.bsky.social on Bluesky

Preview
Authentic Imperfection Auto-Tune is great.

I didn't know where this post was going when I started and I'm not sure where it went now that it ended, but that felt correct in some way.

www.alexirpan.com/2025/11/16/a...

16.11.2025 17:31 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

First paper since switching into AI safety teamπŸŽ‰

We look at problems that could be solved if the model behaved consistently over a set of prompts, and tried training that in output space and internal activations. Both were effective. See thread or paper for details.

05.11.2025 18:26 β€” πŸ‘ 17    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Ten Years Later My blog turns ten years old today. The big 1-0. Thanks for reading!

Today is my 10 year blogging anniversary.
www.alexirpan.com/2025/08/18/t...

18.08.2025 16:24 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025 Bronies are older fans of My Little Pony: Friendship is Magic. They are mostly male, typically in 20s-30s age wise, and have been trending older and more female over time. (A lot of girls in the origi...

For the past month I have been working on a blog post about niche MLP fandom drama. Well here it is.

www.alexirpan.com/2025/07/21/b...

21.07.2025 17:11 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

"I don't play gacha games because they're a scam"
vs
"Let me do one more hyperparam sweep before giving up. One more prompt tuning run. I swear we'll beat baseline. I know it's gonna beat the baseline this time. It's gonna win. This time for sure."

05.06.2025 01:16 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Who is AI For? Who is AI for right now? There are obvious use cases. Image generation for people who want filler art for work presentations, or just to mess around. Coding assistance for people who code, vibe coding...

www.alexirpan.com/2025/04/01/w...

01.04.2025 16:28 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
MIT Mystery Hunt 2025 This has spoilers for MIT Mystery Hunt 2025. Spoilers are not labeled or hidden.

My MIT Mystery Hunt post for the year

www.alexirpan.com/2025/01/28/m...

28.01.2025 16:42 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

I am now back from #MITMysteryHunt with no memory of anything besides Hunt from MLK weekend. Really this is probably for the best.

21.01.2025 16:38 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Using AI to Get the Neopets Destruct-o-Match Avatar I have blogged about Neopets before, but a quick refresher. Neopets is a web game from the early 2000s, which recently celebrated its 25th anniversary and maintains a small audience of adult millennia...

It is time for more posts about Neopets

www.alexirpan.com/2025/01/09/d...

10.01.2025 07:43 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The ship has sailed, but I wish the ML reporting default was % incorrect rather than % correct. It better matches loss curves and magnifies the capture of edge cases.

95% accuracy -> 97.5% accuracy = meh
5% error -> 2.5% error = omg we've halved the error rate

19.12.2024 20:36 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The question of "how's o1 using its test compute" is better asked to someone who worked on it, since AFAIK that hasn't been disclosed. But yes, language models having really dynamic / freeform actions makes them hard to think about.

05.12.2024 04:56 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Late Takes on OpenAI o1 I realize how late this is, but I didn’t get a post out while o1 was fresh, and still feel like writing one despite it being cold. (Also, OpenAI just announced they’re going to ship new stuff starting...

I wrote some stuff on OpenAI o1

www.alexirpan.com/2024/12/04/l...

04.12.2024 18:11 β€” πŸ‘ 15    πŸ” 2    πŸ’¬ 1    πŸ“Œ 2

@alexirpan is following 2 prominent accounts