๐๐This paper will appear at ACL 2025 (@aclmeeting.bsky.social)! New updated version is on arXiv: arxiv.org/pdf/2505.07659 ๐๐
03.06.2025 13:45 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0@wegotlieb.bsky.social
Assistant Professor of Computational Linguistics @ Georgetown; formerly postdoc @ ETH Zurich; PhD @ Harvard Linguistics, affiliated with MIT Brain & Cog Sci. Language, Computers, Cognition.
๐๐This paper will appear at ACL 2025 (@aclmeeting.bsky.social)! New updated version is on arXiv: arxiv.org/pdf/2505.07659 ๐๐
03.06.2025 13:45 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0A key hypothesis in the history of linguistics is that different constructions share underlying structure. We take advantage of recent advances in mechanistic interpretability to test this hypothesis in Language Models.
New work with @kmahowald.bsky.social and @cgpotts.bsky.social!
๐งต๐!
We see this project as in line with some other recent papers seeking to cast typological variation in information-theoretic terms, with shout-outs to Michaela Socolof, @postylem.bsky.social @futrell.bsky.social (aclanthology.org/2022.coling-...) and Julius Steuer (aclanthology.org/2023.sigtyp-...)
13.05.2025 13:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0โญ โญThis paper also makes several technical contributions to the mixed-pair mutual information estimation pipeline of Wolf et al., (aclanthology.org/2023.emnlp-m...). Shout out to @cuiding.bsky.social for all her hard work on this aspect of the paper! โญโญ
13.05.2025 13:21 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0โ In line with our prediction, we find that mutual information is higher in tonal languages than in non-tonal languages. BUT, the way one represents context is important. When full sentential context is taken into account (mBERT and mGPT), the distinction collapses.
13.05.2025 13:21 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0๐๐We test this prediction by estimating mutual information in an audio dataset of 10 different languages across 6 language families. ๐๐
13.05.2025 13:21 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0We propose a way to do so using โฆ๐กinformation theory.๐ก In tonal languages, pitch reduces uncertainty about lexical identity, therefore, the mutual information between pitch and words should be higher.
13.05.2025 13:21 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐But there are intermediate languages, which have lexically contrastive tone, but only sporadically, making some linguists doubt the tonal/non-tonal dichotomy. So, how can we measure how โtonalโ a language is? ๐ง๐ง
13.05.2025 13:21 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐ Different languages use pitch in different ways. ๐ โTonalโ languages, like Cantonese, use it to make lexical distinctions. ๐ While others, like English, use it for other functions, like marking whether or not a sentence is a question. โ
13.05.2025 13:21 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0โญ๐ฃ๏ธNew preprint out: ๐ฃ๏ธโญ โUsing Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accentโ with @cuiding.bsky.social , Giovanni Acampa, @tpimentel.bsky.social , @alexwarstadt.bsky.social ,Tamar Regev: arxiv.org/abs/2505.07659
13.05.2025 13:21 โ ๐ 11 ๐ 5 ๐ฌ 1 ๐ 2Iโll also use this as a way to plug human-scale language modeling in the wild: This yearโs BabyLM eval pipeline was just released last week at github.com/babylm/evalu.... For more info on BabyLM head to babylm.github.io
12.05.2025 15:48 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Couldnโt be happier to have co-authored this will a stellar team, including: Michael Hu, @amuuueller.bsky.social, @alexwarstadt.bsky.social, @lchoshen.bsky.social, Chengxu Zhuang, @adinawilliams.bsky.social, Ryan Cotterell, @tallinzen.bsky.social
12.05.2025 15:48 โ ๐ 3 ๐ 1 ๐ฌ 1 ๐ 0This version includes ๐ฑNew analyses ๐ฑnew arguments ๐ฑ and a whole new โLooking Forwardโ section! If youโre interested in what a team of (psycho) computational linguists thinks the future will hold, check out our brand new Section 8!
12.05.2025 15:48 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0๐ฃPaper Update ๐ฃItโs bigger! Itโs better! Even if the language models arenโt. ๐คNew version of โBigger is not always Better: The importance of human-scale language modeling for psycholinguisticsโ osf.io/preprints/ps...
12.05.2025 15:48 โ ๐ 18 ๐ 3 ๐ฌ 1 ๐ 2Excited to share our preprint "Using MoTR to probe agreement errors in Russian"! w/ Metehan Oฤuz, @wegotlieb.bsky.social, Zuzanna Fuchs Link: osf.io/preprints/ps...
1- We provide moderate evidence that processing of agreement errors is modulated by agreement type (internal vs external agr.)
Me and @wegotlieb.bsky.social were recently invited to write a wide-ranging reflection on the current state of linguistic theory and methodology.
A draft is up here. For anyone interested in thinking big about linguistics, we'd be happy to hear your thoughts!
arxiv.org/abs/2502.18313
#linguistics
โ๏ธ๐ฃThis paper was a big departure from my typical cognitive science fare, and so much fun to write! ๐ฃโ๏ธ Thank you to @bwal.bsky.social and especially to @kevintobia.bsky.social for their legal expertise on this project!
19.02.2025 14:25 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0On the positive side, we suggest that LLMs can serve a role as โdialecticโ partners ๐ฃ๏ธโ๐ฃ๏ธ helping judges and clerks strengthen their arguments, as long as judicial sovereignty is maintained ๐ฉโโ๏ธ๐๐ฉโโ๏ธ
19.02.2025 14:25 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0โ๏ธ We also show, through demonstration, that itโs very easy to engineer prompts that steer models toward oneโs desired interpretation of a word or phrase. ๐Prompting is the new โdictionary shoppingโ ๐ฌ ๐ ๐ฌ
19.02.2025 14:25 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0๐๏ธWe identify five โmythsโ about LLMs which, when dispelled, reveal their limitations as legal tools for textual interpretation. To take one example, during instruction tuning, LLMs are trained on highly structured, non-natural inputs.
19.02.2025 14:25 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0We argue no! ๐ โโ๏ธ While LLMs appear to possess excellent language capabilities, they should not be used as references for โordinary language use,โ at least in the legal setting. โ๏ธ The reasons are manifold.
19.02.2025 14:25 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐๏ธLast year a U.S. judge queried Chat GPT to help with their interpretation of โordinary meaning,โ in the same way one might use a dictionary to look up the ordinary definition of a word ๐ โฆ But is it the same?
19.02.2025 14:25 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐ฃ New Paper โ๏ธ๐งโโ๏ธ๐๏ธ Large Language Models for Legal Interpretation? Don't Take Their Word for It ๐ฉโโ๏ธ๐๏ธโ๏ธ with @bwal.bsky.social , @complingy.bsky.social Amir Zeldes, and @kevintobia.bsky.social papers.ssrn.com/sol3/papers....
19.02.2025 14:25 โ ๐ 13 ๐ 3 ๐ฌ 1 ๐ 0