Nikhil's paper has many other remarkable experiments, including interventions that reveal discrete reasoning steps where the two people in a story are aware or unaware of each others' actions.
The picture is not complete, but it's worth reading and contemplating.
25.06.2025 15:00 β π 1 π 0 π¬ 1 π 0
Benjamin Riley (@benjaminjriley.bsky.social)
This isn't what cognitive scientists mean by theory of mind, and LLMs are not "applying" theory of mind when they produce text that includes the words "birds" and "cat" and "mind."
Looking inside these models is a way to break into the Chinese room; it is a way to approach the puzzle of whether apparent skills like Theory-of-Mind are just an LLM Clever Hans trick, or whether the model contains reasonable representations.
bsky.app/profile/ben...
25.06.2025 15:00 β π 1 π 0 π¬ 1 π 0
A double dereference can be seen at L35, where patching states on identical words between identical permuted stories changes the answer.
That's because "dereference thing 2" looks for the floating definition of "where is thing 2."
The patch redirects this second indirection!
25.06.2025 15:00 β π 1 π 0 π¬ 1 π 0
It is strong evidence that what's stored deeper than 50 is not a value but a POINTER dereferenced at 55.
Dai calls these OI's arxiv.org/abs/2409.05448 and
@fjiahai.bsky.social⬠calls them binding IDs arxiv.org/abs/2310.17191
It is a general "lookback" pattern.
Next surprise is nested...
25.06.2025 15:00 β π 2 π 0 π¬ 1 π 0
There is a lot to unpack in Nikhil's paper and it merits a close reading.
The first thing to understand is his remarkable Fig 2 experiment. Why does the patching of one state, which alters coffee->tea, switch to coffee->beer when you move states deeper than layer 55?
25.06.2025 15:00 β π 1 π 0 π¬ 1 π 0
This is the book to read before protesting in LA
11.06.2025 06:46 β π 1 π 0 π¬ 0 π 0
If you do NOT live in a red state, then please have a FRIENDLY chat with somebody who does, to make sure they are aware of what is happening and the stakes.
It does NO good to shout at MAGA. We need to talk to people. Here are some thoughts about engaging on X.
x.com/davidbau/st...
03.06.2025 16:15 β π 0 π 0 π¬ 0 π 0
If you live in in ME, KS, WV, IN, LA, MS, TX, ID, NC, AK, TN, MT, ND, AL, NE, SC, or any red state then your senator has outsized influence.
Read here about local impact and how to contact them. They DO listen to voters. They WILL listen to you!
thevisible.net/posts/005-a...
03.06.2025 16:15 β π 1 π 0 π¬ 1 π 0
FRIENDS: American science is being decimated by Congress NOW.
Your help is needed to fix this. The current DC plan PERMANENTLY slashes NSF, NIH, all science training. Money isn't redirectedβit's gone.
Please read+share what's happening
thevisible.net/posts/004-s...
03.06.2025 16:15 β π 5 π 0 π¬ 1 π 1
Recognize that when you engage with the broad public, many will disbelieve you.
You will find many who will "mansplain" science back to you.
Push back with clarity and evidence. You will help reveal the TACO nature of authoritarian views.
Do not fear that. It is our job.
29.05.2025 19:53 β π 3 π 0 π¬ 0 π 0
My concrete advice to PhD students:
(1) Do not be cowed by the fascist horde. Do engage with the public, especially skeptics.
(2) Speak on the things where you are expert, not where you are a dabbler. But recognize you are expert in many things.
(3) Be friendly, clear and firm.
29.05.2025 19:49 β π 2 π 0 π¬ 1 π 0
We must not hide in our bluesky corner where voters are not. We need to flood the unfriendly airwaves on X, youtube, tiktok. And we must show up with our faces.
We need to be vulnerable, because no AI misinformation bot can match "being a real person."
Show your face. Defend your work.
29.05.2025 11:09 β π 2 π 0 π¬ 1 π 0
We need to get our heads out of our *sses.
This is not the moment to focus on your personal ambition, to show why your latest sophisticated widget is better than doctor competitor's intricate theorem.
The whole scientific franchise is under attack. It is time to defend it to the public.
29.05.2025 11:04 β π 3 π 0 π¬ 1 π 0
We cannot let our fear of political retribution to lead us to cede the internet to stone-age propaganda. Academics: please stand up on social (and all) media. You are expert teachers.
Share your personal stories. Defend your work to the public. Defend your international students.
29.05.2025 10:54 β π 4 π 0 π¬ 1 π 0
Even my lefty Boston neighbors do not know. They think Rubio is expelling troublemakers, radicals, communists. Or that just Harvard is in the crosshairs.
They are totally unaware that he has stopped all student visas, or why that kills US science.
If we are academics: we need to teach.
29.05.2025 10:48 β π 1 π 0 π¬ 1 π 0
Because of propaganda Americans do not understand what Rubio is doing with visas. "I gave you a visa to come and study," they think.
x.com/CitizenFree...
NO, he has not!! Please help explain to X how Rubio has stopped *ALL* student visas, and how it is killing US science.
29.05.2025 10:27 β π 6 π 0 π¬ 1 π 0
Here is some evidence. But it doesn't seem to support βͺ@ukraineman101.bsky.socialβ¬.
www.economist.com/science-and-...
29.05.2025 10:13 β π 3 π 0 π¬ 0 π 0
bsky.app/profile/chri...
29.05.2025 08:02 β π 2 π 0 π¬ 0 π 0
First They Came - Wikipedia
To engage in the logic is to lose the game.
The fascist playbook: normalize hate, starting at the most vulnerable populations, until everyone is subjugated.
en.wikipedia.org/wiki/First_T...
28.05.2025 23:57 β π 0 π 0 π¬ 0 π 0
Trump administration orders US embassies to stop student visa interviews
Directive could severely delay visa processing and hurt universities that rely on foreign students for revenue
I am in Boston because I believe in American democracy. I love our freedoms and our culture. I want to be teaching US students.
And I want to live and teach here in my home, where I grew up.
Why are we setting our home on fire?
www.theguardian.com/us-news/202...
28.05.2025 13:29 β π 5 π 2 π¬ 1 π 0
The USA is a magnet for AI talent! But with today's clampdown on international students our ecosystem is suddenly trashed.
Several of my projects have incoming PhD talent signed but frozen out in Germany, Denmark...
We are now discussing setting up shop in Toronto or London.
28.05.2025 13:29 β π 5 π 0 π¬ 1 π 0
Here is some of what my lab does: baulab.info/
Just yesterday I had a conversation with other Boston natives about setting up a new AI incubator in town.
I am so excited by this. It is so important to figure out how to attract, nurture, and retain talent locally.
28.05.2025 13:29 β π 0 π 0 π¬ 1 π 0
When setting up my AI lab I faced a choice between Toronto and Boston. I chose Boston, my home and the world's best incubator for research talent.
Here you can take a short stroll to meet with top minds in hundreds of fields from AI to astronomy, batteries to biotech.
28.05.2025 13:29 β π 12 π 0 π¬ 1 π 0
The techno utopian philosophy!
But Opus didn't like that essay and recommended one about the venture capital system.
25.05.2025 14:48 β π 2 π 0 π¬ 0 π 0
I found the essay gripping. Although I asked for it and critiqued it, it is not my own perspective.
I think Opus's advocacy about AI safety, with its specific diagnosis of the problem, is worth reading.
The essay can be found here.
davidbau.com/archives/20...
25.05.2025 13:12 β π 0 π 0 π¬ 0 π 0
Black Box, Blood Money
Friday evening, an Italian tourist escaped a torturer in Manhattan who was after his crypto password. I asked Anthropic's Opus 4 to analyze and explain what the episode might teach us about AI.
It critiqued my guidance, instead proposing a focus on VCs:
25.05.2025 13:12 β π 2 π 0 π¬ 2 π 0
Assistant professor of computer science at Technion
https://belinkov.com/
Assistant Professor at UC Berkeley
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Professor a NYU; Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
http://yann.lecun.com
Senior Director, Research Scientist @ Meta FAIR + Visiting Prof @ NYU.
Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position encode+LLM: MemNet (2015). Recent (2024): Self-Rewarding LLMs & more!
Computer Vision research group @ox.ac.uk
Cofounded and lead PyTorch at Meta. Also dabble in robotics at NYU.
AI is delicious when it is accessible and open-source.
http://soumith.ch
Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com
Group Leader, the Francis Crick Institute.
Social Behaviour and Connectomics
https://www.crick.ac.uk/research/labs/michael-winding
Computer vision research scientist β’ ex big tech β’ cinematographer β’ π UIUC (PhD), Caltech β’ π§π¬
PhD student doing LLM interpretability with @davidbau.bsky.social and @byron.bsky.social. (they/them) https://sfeucht.github.io