Sonia's Avatar

Sonia

@soniajoseph.bsky.social

AI researcher at Mila, visiting researcher at Meta Also on X: @soniajoseph_

1,304 Followers  |  343 Following  |  49 Posts  |  Joined: 08.10.2024
Posts Following

Posts by Sonia (@soniajoseph.bsky.social)

Spotify – Web Player

Link to podcast:

open.spotify.com/episode/1tdJ...

07.11.2025 00:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

We covered a lot of ground, from how AI learns physics from video and forms internal models of the world, to the risks of deception, sycophancy, and misaligned goals in robotics.

07.11.2025 00:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

It was a pleasure to be interviewed about world model interpretability, physical intelligence, and robot security by Paige Harriman @climatepaige.bsky.social.

It takes skill to lead an interview that everyone from technical researchers to laymen can enjoy and understand! πŸ€–

tinyurl.com/ycypkmjf

07.11.2025 00:33 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

tell people here

11.07.2025 16:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Hi welcome

10.07.2025 22:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ™πŸ™

26.05.2025 03:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It has been extremely disorienting traveling between the tech right in SF and the academic left in Montreal and has induced a somewhat deep sense of moral relativism.

23.05.2025 02:49 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Half the people who say you don’t need to go to grad school to do AI research have been to grad school

04.05.2025 02:52 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

We visualized thousands of CLIP SAE features in collaboration between @fraunhoferhhi.bsky.social and @mila-quebec.bsky.social!

Search thousands of interpretable CLIP features in our vision atlas, with autointerp labels and scores like clarity and polysemanticity.

Link: tinyurl.com/3ffy8xk6

28.04.2025 14:45 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

How is this different from capitalism

26.04.2025 15:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Ok nice, I’m trying to figure out if the current state of AI culture in the US is some path dependent Yudkowsky thing or more universal

26.04.2025 14:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I’m really excited about Diffusion Steering Lens, an intuitive and elegant new β€œlogit lens” technique for decoding the attention and MLP blocks of vision transformers!

Vision is much more expressive than language, so some new mech interp rules apply:

25.04.2025 13:36 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

Why are there no doomers in China?

23.04.2025 20:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A genuine question for the death by 2027 people:

China authors 30% of top papers, is winning in video zero shot retrieval with InternVideo2, & has crazy centralization of data and infra.

The CCP is not stupid. What does it mean, if they have no plans for doom within 2 yrs?

23.04.2025 20:30 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

To contain the β€œcrazy” in clean, Euclidean lines is a tragedy.

Many researchers have left the field because of attempts by others to shut it down or contain it. And the consequences on the industry have been profoundly negative.

23.04.2025 17:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I would encourage more canonical EA alignment researcher types to learn to recognize this cognitive style. And to see that you need it for your field to succeed.

23.04.2025 17:50 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I would encourage those of this archetype to lean into it, to be even more unhinged, and to go all the way. Find others of their likeness and coordinate.

23.04.2025 17:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The celebrity fixation is often due to parasociality (a safe person to β€œsocialize” with) and a template for social masking (you can mimic their behaviors).

The beauty/makeup fixation is just like any other autistic fixation, but sublimated into acceptable, female coded pursuits.

23.04.2025 17:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Girls of this neurotype are often wildly fractal, independent, and creative. I’ve noticed they can also freak out a lot of EA / alignment researcher types, who try to contain the β€œcrazy” in clean, Euclidean lines.

23.04.2025 17:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

AI research is famously autisticβ€” but autism in women can look different: intense daydreaming, an obsession with celebrities, or a fixation with female-coded activities like makeup/beauty. It’s socially masked and not visible to the untrained eye.

23.04.2025 17:49 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Post image

Looking forward to being a speaker at the first mechanistic interpretability workshop for vision at CVPR! πŸ”₯

sites.google.com/view/miv-cvp...

Paper submission deadline March 1st

06.02.2025 18:51 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Leaving a digital trail, just in case, given there is speculation of foul play currently in the Valley.

I am happy and healthy and intend to live a long and beautiful life. If anything were to happen to me, that would be highly suspicious and should be investigated.

15.12.2024 21:29 β€” πŸ‘ 20    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
x.com

I’ve been giving some context on AI development environments here, particularly how co-living and tribal bonds among researchers in Silicon Valley influence ethical decisions, whistleblowing, and the trajectory of AGI development.

x.com/soniajoseph_...

06.12.2024 16:36 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
x.com

I’ve been giving some context on AI development environments here, particularly how co-living and tribal bonds among researchers in Silicon Valley influence ethical decisions, whistleblowing, and the trajectory of AGI development.

x.com/soniajoseph_...

06.12.2024 16:36 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

The purpose of the original tweet was to alert my allies; I’m not really sure the point of your response. If you don’t find it useful, feel free to block.

06.12.2024 16:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I’ll be tweeting out something important this weekend that has been happening in the shadows of the AI industry.

06.12.2024 16:07 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Different paths toward safe AI at different Marr's levels

Different paths toward safe AI at different Marr's levels

Excited to release what we’ve been working on at Amaranth Foundation, our latest whitepaper, NeuroAI for AI safety! A detailed, ambitious roadmap for how neuroscience research can help build safer AI systems while accelerating both virtual neuroscience and neurotech. 1/N

02.12.2024 16:17 β€” πŸ‘ 148    πŸ” 51    πŸ’¬ 5    πŸ“Œ 17

How to drive your research forward?

β€œI tested the idea we discussed last time. Here are some results. It does not work. (… awkward silence)”

Such conversations happen so many times when meetings with students. How do we move forward?

You need …

01.12.2024 22:09 β€” πŸ‘ 90    πŸ” 18    πŸ’¬ 1    πŸ“Œ 1

My great great grandmother learned English to host and entertain the wives of British officers over tea in Madras. My grandmother would end up getting a PhD in Indo Anglian literature. It’s funny how history moves.

01.12.2024 18:50 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

thank you for your words!

01.12.2024 18:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0