This is super cool! I had been contemplating starting a project like this a few years ago to actualize a vague interest I have in the math of knots... but this benchmark is probably much better than I could've managed. Can't wait to take a closer look πͺ’!
06.12.2025 17:28 β π 8 π 1 π¬ 1 π 0
We've create a simple and easy framework for documenting AI evaluation methods! Check it out and submit some for your own datasets with a PR!
You can learn more here: github.com/facebookrese...
And here: arxiv.org/abs/2512.04062
05.12.2025 19:32 β π 2 π 0 π¬ 0 π 0
My spouse is doing a week of science communication (in German) this week.
Topics may include cognitive science, linguistics, AI, political economy, science and technology studies, maybe even some philosophy too.
Check it outπ¨βπ¬!
01.12.2025 18:11 β π 4 π 0 π¬ 0 π 0
Yes! Social accountability seems to have been a major part of the process...if you get a ping telling you you're late from an (S)AC you know, you're much more likely to feel embarassed, drop everything, and do what you agreed to do...the fields growing pangs are a challenge for sure!
01.12.2025 18:04 β π 2 π 0 π¬ 0 π 0
I did once (arxiv.org/pdf/2311.18567) if you count this as compling paper (and not NLP). I know my coauthor Ryan has continued thinking about Judea Pearl style causality more afterwards too probably.
27.11.2025 03:27 β π 2 π 0 π¬ 0 π 0
Like gen1 may just be comitative. Which also had me doubting whether he1 really corresponds to 'and' either... but i suspect Haspelmath has tests or at least refs to some in that link.
18.11.2025 04:10 β π 1 π 0 π¬ 0 π 0
WALS Online -
Chapter Nominal and Verbal Conjunction
I stared trying to construct an argument that maybe mandarin differentiates NP (he2) and DP (gen1) conjunction. Then I tried to think about basic coordination of mandarin VPs (may only be periphrastic)... so perhaps this is a next reasonable place to point you to instead:
wals.info/chapter/64
18.11.2025 04:07 β π 2 π 0 π¬ 2 π 0
Top: A syntax tree for the sentence "the doctor by the lawyer saw the artist".
Bottom: A continuous vector.
π€π§ I'll be considering applications for PhD students & postdocs to start at Yale in Fall 2026!
If you are interested in the intersection of linguistics, cognitive science, & AI, I encourage you to apply!
PhD link: rtmccoy.com/prospective_...
Postdoc link: rtmccoy.com/prospective_...
14.11.2025 16:40 β π 36 π 13 π¬ 2 π 2
Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."
New work to appear @ TACL!
Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.
Yet they often assign higher probability to ungrammatical strings than to grammatical strings.
How can both things be true? π§΅π
10.11.2025 22:11 β π 85 π 19 π¬ 2 π 3
Nice! Might this shed some light on unnatural language processing effects? Thinking of older work I was doing with @koustuvsinha.com. We found examples where word reordering causing ungrammaticality could improve classifier results, and much discussion followed. Could be fun to think these together!
11.11.2025 14:48 β π 5 π 0 π¬ 0 π 0
Please respond to this survey if you have changed or have thought about changing your name in academic publishing! For any reason, whether it be transition, recognizability, marriage, privacy, immigration, cultural reasons, etc.
Please RT for reach :)
10.11.2025 15:11 β π 6 π 4 π¬ 0 π 0
UT Austin Computational Linguistics Research Group β Humans processing computers processing humans processing language
UT Austin Linguistics is hiring in computational linguistics!
Asst or Assoc.
We have a thriving group sites.utexas.edu/compling/ and a long proud history in the space. (For instance, fun fact, Jeff Elman was a UT Austin Linguistics Ph.D.)
faculty.utexas.edu/career/170793
π€
07.10.2025 20:53 β π 41 π 27 π¬ 1 π 4
I agree this thread's headline claim seems premature. Let me add our recent ACL Findings paper, with Dexter Ju and @hagenblix.bsky.social, which found syntactic simplification in at least some LMs, in a novel domain regeneration setting: aclanthology.org/2025.finding...
15.08.2025 04:35 β π 6 π 1 π¬ 1 π 0
Our team is hiring a postdoc in (mechanistic) interpretability! The ideal candidate will have research experience in interpretability for text and/or image generation models and be excited about open science!
Please consider applying or sharing with colleagues: metacareers.com/jobs/2223953961352324
15.07.2025 20:11 β π 11 π 5 π¬ 0 π 0
Deflating βHypeβ Wonβt Save Us
By Hagen Blix & Ingeborg Glimmer
Ingeborg and I wrote a thing about "hype", and why we think that framing AI through that lens is increasingly inadequate - check it out!
03.07.2025 20:12 β π 26 π 11 π¬ 2 π 2
My spouse's new book has a chapter getting into the weeds around AI and high paid jobs...AI powered deskilling, drawing historical connections back as far as the steam engine, economic reasoning pushing technical innovation etc. Check it π I think you might like it.
bsky.app/profile/hage...
27.06.2025 21:56 β π 3 π 0 π¬ 0 π 0
Have you heard about this year's shared task? π’
Mechanistic Interpretability (MI) is quickly advancing, but comparing methods remains a challenge. This year at #BlackboxNLP, we're introducing a shared task to rigorously evaluate MI methods in language models π§΅
23.06.2025 14:45 β π 16 π 4 π¬ 1 π 1
Come by to our panel at APS to share your thoughts, and ask us all the hard stuff!
24.05.2025 13:10 β π 5 π 1 π¬ 0 π 0
Check it out! Guy et al. explores the impact of format on function vectors, and invites further conversation about what it would mean to have universal goal representations in LLMs.
(I've hung around interp communities for a while but this is my 1st mech-interp project. feedback much appreciated!)
23.05.2025 17:53 β π 12 π 0 π¬ 0 π 0
COLM 2025: Workshops
COLM 2025 workshops are announced now, check it out: colmweb.org/Workshops.html
20.05.2025 16:58 β π 5 π 1 π¬ 0 π 0
Awesome, can't wait to read it; congrats Maya!
09.05.2025 16:20 β π 2 π 1 π¬ 0 π 0
Book Launch: Auto-Correct: The Fantasies and Failures of AI, Ethics, and the Driverless Car Β· Luma
A conversation with Maya Indira Ganesh on how driverless cars are reshaping governance, responsibility, and values
What can a driverless car tell us aboutβ¦
Pls come to my book launch on May 22, 5pm, at Newnham College Cambridge cohosted w @mctd.bsky.social and @cfi-cambridge.bsky.social and with stellar discussants @gsvoss.bsky.social and Jennifer Schooling Register here: lu.ma/9vm405sk
09.05.2025 13:37 β π 20 π 12 π¬ 4 π 2
This is such a fun example of LM weirdness (which also shows how they match form over fact!)
More linguistically: it looks like ending a query with "meaning" triggers the bot to accommodate the presupposition that the input contains an idiom! (Hard to run normal preposition tests here tho)
23.04.2025 21:03 β π 5 π 1 π¬ 0 π 0
Foundations of AI β Initiative for the Theoretical Sciences
Friday, 11 April 9:30 AM - 4:00 PM Rooms 9206/9207, Graduate Center CUNY Join us at the Initiative for Theoretical Sciences for an interdisciplinary exploration of the foundations of artificial i...
Happy to be at CUNY today to workshop about the theoretical foundations of AI! Fun to have several perspectives---math, philosophy, linguistics and more---together in one place!
w/@giannig.bsky.social @seiller.bsky.social J. Terilla et al.
itsatcuny.org/calendar/202...
11.04.2025 14:15 β π 7 π 1 π¬ 0 π 0
Fantastic news, congrats to you and to BU π
27.03.2025 03:25 β π 1 π 0 π¬ 0 π 0
Content I'm so here for π€©
18.03.2025 14:09 β π 2 π 0 π¬ 0 π 0
AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons
The rapid advancement and deployment of AI systems have created an urgent need for standard safety-evaluation frameworks. This paper introduces AILuminate v1.0, the first comprehensive industry-standa...
Happy to share that the paper describing the AILuminate v1.0 benchmark is now out! arxiv.org/abs/2503.05731
The benchmark is designed with @mlcommons.org to assess LLM risk and reliability across 12 hazard categories. AILuminate is available for testing models and helping ensure safer deployment!
12.03.2025 14:50 β π 3 π 1 π¬ 0 π 0
using computational methods to understand the linguistic mechanisms of social problems | NLP, socioling, discourse-pragmatics | asst prof at UC Davis Linguistics
https://robvoigt.faculty.ucdavis.edu/
Every future imagined by a tech company is worse than the previous iterationβ¦or something like that.
chenzizhao.github.io unlearning natural stupidity while phding @cornelltech.bsky.social
Reporter covering AI for The Guardian. Senior Tarbell Fellow @ Tarbell Center for AI Journalism. SABEW award for international reporting, 2024.
Animation tidbits from around the world, including rare and underappreciated material you won't find elsewhere.
Newsletter: http://animationobsessive.substack.com
Blog: https://argmin.substack.com/
Webpage: https://people.eecs.berkeley.edu/~brecht/
CS PhD student at UT Austin in #NLP
Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models!
Other interests: math, philosophy, cinema
https://www.juandiego-rodriguez.com/
Head of Research at OneProject.org. Author, Design Justice (https://design-justice.pubpub.org) + Out of the Shadows, Into the Streets. #FreePalestine. Trans agenda. She/they/ella/elle. More: https://www.schock.cc
Internet linguist. Wrote Because Internet, NYT bestseller about internet language. Co-hosts @lingthusiasm.bsky.social, a podcast that's enthusiastic about linguistics.
she/her π
Montreal en/fr π¨π¦
gretchenmcculloch.com
author. towards trans communist futures. Family Abolition: Capitalism and the Communizing of Care (Pluto, 2023) and Everything for Everyone: An Oral History of the New York Commune, 2052β2072 (Common Notions, 2022). Editor at Pinko
journalist @zeit.de, @woz.ch, @nytimes.com etc | US politics, social movements, unions | based in New York and Berlin | author Β»Uprising β Amerikas neue LinkeΒ« www.lukashermsmeier.com
Tech reporter at Business Insider. Previously at TechCrunch. Bylines in WSJ, Wired, Vice, Foreign Policy.
Signal: charlesrollet.12
Interp & analysis in NLP
Mostly π¦π·, slightly π¨π±
website: https://t.co/ml5yPJjZLO Natural Language Processing and Machine Learning researcher at the University of Cambridge. Member of the PaNLP group: https://www.panlp.org/ and fellow of Fitzwilliam College.
Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence.
Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Philosopher writing about theory of mind, moral psychology, character judgment, social norms. I bore undergraduates for a living. All opinions are my own. #philosophy #philsky
he/him
https://sites.google.com/site/ewestraphilosophy/
Linguist in AI & CogSci π§ π©βπ»π€ PhD student @ ILLC, University of Amsterdam
π https://mdhk.net/
π https://scholar.social/@mdhk
π¦ https://twitter.com/mariannedhk
Computational semanticist at the University of Cambridge.
LΓΊ chiaΜh-pΓ‘--bΕe?
[bridged from https://lingo.lol/@AngloPeranakan on the fediverse by https://fed.brid.gy/ ]