Anna (Anya) Ivanova's Avatar

Anna (Anya) Ivanova

@neuranna.bsky.social

Language and thought in brains and in machines. Assistant Prof @ Georgia Tech Psychology. Previously a postdoc @ MIT Quest for Intelligence, PhD @ MIT Brain and Cognitive Sciences. She/her https://www.language-intelligence-thought.net

2,552 Followers  |  632 Following  |  65 Posts  |  Joined: 06.11.2023  |  1.6682

Latest posts by neuranna.bsky.social on Bluesky

Find Taha’s poster today at the @dataonbrainmind.bsky.social workshop #NeurIPS2025

07.12.2025 14:26 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 2
The Quantization Model of Neural Scaling

I like this one proceedings.neurips.cc/paper_files/...

29.11.2025 17:38 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We leave the β€œhow” to future work (by you and others and maybe us eventually)
I’ll keep ROSE in mind as a testable prediction of the ”how” claims

28.11.2025 15:57 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The main Q you deal with in your post - the how - is definitely an important one, but we didn’t engage with it much in this short piece; our focus is on the higher-level organizational principles. We did discuss it internally though, including the traveling waves idea.

28.11.2025 15:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

…the claim in your post (lang areas as an executive processor) targets the same level of explanation as our proposal. And, as you say, it’s licensed by minimalism. So then it would make sense, to me, to compare this minimalist prediction with our proposal and find key differences (eg causal impact)

28.11.2025 15:51 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Haha a great use of Thanksgiving break πŸ˜‚ thank you for engaging, but I still think my original question is answerable!

I agree, some areas of lang science make no predictions about the mechanisms of lang processing. And even those that do might not commit to a specific neural implementation. But…

28.11.2025 15:46 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Skyline of Madison, WI

Skyline of Madison, WI

🚨I am looking for a POSTDOC, LAB MANAGER/TECH and GRAD STUDENTS to join my new lab in beautiful Madison, WI.
We study how our brains perceive and represent the physical world around us using behavioral, computational, and neuroimaging methods.
paulunlab.psych.wisc.edu
#VisionScience #NeuroSkyence

17.11.2025 21:43 β€” πŸ‘ 48    πŸ” 41    πŸ’¬ 1    πŸ“Œ 4

Cool! What’s the prediction of the program about what would happen if the core language system was disabled? (I suspect this is where the predictions of the two theories would diverge)

27.11.2025 15:53 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

What a privilege and a delight to work with @coltoncasto.bsky.social @ev_fedorenko and @neuranna
on this new speculative piece on What it means to understand language, nicely summarized in this
Tweeprint from @coltoncasto.bsky.social arxiv.org/abs/2511.19757

26.11.2025 16:34 β€” πŸ‘ 35    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0

Neuroscience of language has a dilemma: how do we reconcile extensive patient and imaging evidence for **language-specific** processing with the fact that naturalistic language evokes extensive activity all over the brain? We propose a framework that accounts for both.

26.11.2025 16:43 β€” πŸ‘ 17    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
What enables human language? A biocultural framework Explaining the origins of language is a key challenge in understanding ourselves as a species. We present an empirical framework that draws on synergies across fields to facilitate robust studies of l...

Origins of language, one of humanity’s most distinctive traits, may be best explained as a unique convergence of multiple capacities each with its own evolutionary history, involving intertwined roles of biology & culture. This framing can expand research horizons. A 🧡 on our @science.org paper.πŸ§ͺ1/n

23.11.2025 11:52 β€” πŸ‘ 201    πŸ” 86    πŸ’¬ 6    πŸ“Œ 9
Preview
Genomic Investigations of Spoken and Written Language Abilities: A Guide to Advances in Approaches, Technologies, and Discovery Purpose: The aim of this tutorial is to show how the rise of molecular technologies and analytical methods in human genetics yields exciting new ...

Advances in genomics are giving exciting new perspectives on biology of speech, language & reading. My latest peer-reviewed paper is a tutorial, guiding readers from different backgrounds through the history of the field, current state-of-the-art, & where we’re heading. A taster in this thread.πŸ§ͺ
1/n

17.11.2025 17:52 β€” πŸ‘ 55    πŸ” 30    πŸ’¬ 1    πŸ“Œ 2

You get a few free runs if you register with an edu email. I tried once, it was ok - I think a truly successful approach would require some prompt iteration, so multiple runs. Also of course the result will very much depend on the problem at hand.

17.11.2025 16:27 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Top: A syntax tree for the sentence "the doctor by the lawyer saw the artist".

Bottom: A continuous vector.

Top: A syntax tree for the sentence "the doctor by the lawyer saw the artist". Bottom: A continuous vector.

πŸ€–πŸ§ I'll be considering applications for PhD students & postdocs to start at Yale in Fall 2026!

If you are interested in the intersection of linguistics, cognitive science, & AI, I encourage you to apply!

PhD link: rtmccoy.com/prospective_...
Postdoc link: rtmccoy.com/prospective_...

14.11.2025 16:40 β€” πŸ‘ 36    πŸ” 13    πŸ’¬ 2    πŸ“Œ 2
Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

New work to appear @ TACL!

Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.

Yet they often assign higher probability to ungrammatical strings than to grammatical strings.

How can both things be true? πŸ§΅πŸ‘‡

10.11.2025 22:11 β€” πŸ‘ 84    πŸ” 19    πŸ’¬ 2    πŸ“Œ 3

Delighted Sasha's (first year PhD!) work using mech interp to study complex syntax constructions won an Outstanding Paper Award at EMNLP!

Also delighted the ACL community continues to recognize unabashedly linguistic topics like filler-gaps... and the huge potential for LMs to inform such topics!

07.11.2025 18:22 β€” πŸ‘ 33    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

Excited to share our work on mechanisms of naturalistic audiovisual processing in the human brain 🧠🎬!!
www.biorxiv.org/content/10.1...

07.11.2025 16:01 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 9    πŸ“Œ 2
This document is scheduled to be published in the
Federal Register on 10/30/2025 and available online at
https://federalregister.gov/d/2025-19702, and on https://govinfo.gov

DEPARTMENT OF HOMELAND SECURITY: Removal of the Automatic Extension of Employment Authorization Documents AGENCY: U.S. Citizenship and Immigration Services (USCIS), Department of Homeland Security (DHS). ACTION: Interim final rule (β€œIFR”) with request for comments. ______________________________________________________________________ SUMMARY: This IFR amends DHS regulations to end the practice of automatically extending the validity of employment authorization documents (Forms I-766 or EADs) for aliens who have timely filed an application to renew their EAD in certain employment authorization categories. The purpose of this change is to prioritize the proper vetting and screening of aliens before granting a new period of employment authorization and/or a new EAD. This IFR does not impact the validity of EADs that were automatically extended prior to [INSERT DATE OF PUBLICATION IN THE FEDERAL REGISTER] or which are otherwise automatically extended by law or Federal Register notice.

This document is scheduled to be published in the Federal Register on 10/30/2025 and available online at https://federalregister.gov/d/2025-19702, and on https://govinfo.gov DEPARTMENT OF HOMELAND SECURITY: Removal of the Automatic Extension of Employment Authorization Documents AGENCY: U.S. Citizenship and Immigration Services (USCIS), Department of Homeland Security (DHS). ACTION: Interim final rule (β€œIFR”) with request for comments. ______________________________________________________________________ SUMMARY: This IFR amends DHS regulations to end the practice of automatically extending the validity of employment authorization documents (Forms I-766 or EADs) for aliens who have timely filed an application to renew their EAD in certain employment authorization categories. The purpose of this change is to prioritize the proper vetting and screening of aliens before granting a new period of employment authorization and/or a new EAD. This IFR does not impact the validity of EADs that were automatically extended prior to [INSERT DATE OF PUBLICATION IN THE FEDERAL REGISTER] or which are otherwise automatically extended by law or Federal Register notice.

NEW: The Trump admin just ended the practice of automatically extending work permits when people file to renew them β€” meaning that if USCIS takes too long to process a renewal the applicant loses their authorization to work legally.

The interim final rule applies to renewals filed after tomorrow.

29.10.2025 13:43 β€” πŸ‘ 1164    πŸ” 705    πŸ’¬ 55    πŸ“Œ 75

Much remains to be done on this front, ideas are welcome!

6/end

23.10.2025 16:21 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Finally, we wanted to ensure that what we're showing is LLM patterns not TunedLens patterns - see Appendix for the control analyses we do! (+a comparison with LogitLens) 5/

23.10.2025 16:20 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We then apply this approach to 3 case studies - prediction by part-of-speech, multi-token fact recall, and fixed-response question answering. Check the paper & Akshat's thread for details!

(look at this model predicting positive/negative sentiment - such a clear pattern!)

4/

23.10.2025 16:20 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Most early layer predictions get overturned! It appears that these are statistical guesses, made when the model has not yet processed enough contextual information.

(flip rate = how often the model's final prediction differs from the current layer's prediction)

3/

23.10.2025 16:19 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

In early layers, most frequent tokens (e.g., "the") dominate predictions, whereas infrequent tokens gradually become more predicted later on.

2/

23.10.2025 16:18 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ€–πŸ“ˆ How do LLMs use their depth?

Akshat Gupta led a fun project to find out! We leverage TunedLens (~linear decoding of tokens) to explore how LLMs' internal representations change from layer to layer.

Preprint: arxiv.org/abs/2510.18871

1/

23.10.2025 16:17 β€” πŸ‘ 69    πŸ” 16    πŸ’¬ 3    πŸ“Œ 0

Perhaps! But language contains a lot of information about the physical world, so in principle these are learnable from language alone

21.10.2025 12:01 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Omg adorable! Thanks! :)

21.10.2025 03:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Finally out in TACL:
🌎EWoK (Elements of World Knowledge)🌎: A cognition-inspired framework for evaluating basic world knowledge in language models

tl;dr: LLMs learn basic social concepts way easier than physical&spatial concepts

Paper: direct.mit.edu/tacl/article...
Website: ewok-core.github.io

20.10.2025 17:36 β€” πŸ‘ 70    πŸ” 10    πŸ’¬ 1    πŸ“Œ 2
Preview
Dense Phenotyping of Human Brain Network Organization Using Precision fMRI The advent of noninvasive imaging methods like functional magnetic resonance imaging (fMRI) transformed cognitive neuroscience, providing insights into large-scale brain networks and their link to cog...

Why do brain networks vary? Do these differences shape behavior? If every 🧠 is unique, how can we detect common features of brain organization?
@rodbraga.bsky.social and I dig in, in @annualreviews.bsky.social (ahead of print):
go.illinois.edu/Gratton2025-...

#neuroskyence #psychscisky #MedSky
πŸ§΅πŸ‘‡

16.10.2025 15:00 β€” πŸ‘ 83    πŸ” 46    πŸ’¬ 1    πŸ“Œ 3
Post image

New paper in Imaging Neuroscience by Ammar I. Marvi, Nancy G. Kanwisher, et al:

An efficient multifunction fMRI localizer for high-level visual, auditory, and cognitive regions in humans

doi.org/10.1162/IMAG...

15.10.2025 05:10 β€” πŸ‘ 51    πŸ” 14    πŸ’¬ 0    πŸ“Œ 1

Our young lab greatly appreciates open datasets and code provided by pioneers in the field! @alexanderhuth.bsky.social
@amandalebel.bsky.social
@rjantonello.bsky.social
@mtoneva.bsky.social
@samnastase.bsky.social
@jixingli.bsky.social
@mschrimpf.bsky.social
@evfedorenko.bsky.social
etc
3/end

29.09.2025 17:39 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@neuranna is following 20 prominent accounts