Roger Levy @rplevy - Bluesky Profile

Building on Kyle's and Jenn's responses – it seems to me the analogy is: grammaticality is to BLiMP and SyntaxGym as truth is to COMPS and plausibility (albeit that's not binary) is to EWoK. So, to apply our framework to those datasets, perhaps one should swap truth/plausibility for grammaticality?

13.11.2025 01:57 — 👍 2 🔁 0 💬 1 📌 0

Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

New work to appear @ TACL!

Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.

Yet they often assign higher probability to ungrammatical strings than to grammatical strings.

How can both things be true? 🧵👇

10.11.2025 22:11 — 👍 90 🔁 20 💬 2 📌 3

thinking of calling this "The Illusion Illusion"

(more examples below)

01.12.2024 14:33 — 👍 1584 🔁 387 💬 60 📌 91

Helps to be a linguist!

14.11.2024 11:45 — 👍 4 🔁 0 💬 0 📌 0

Been listening a lot to Ella Jenkins the last couple of days. What wonderful music and performance!

11.11.2024 23:44 — 👍 7 🔁 1 💬 1 📌 0

Within the past two weeks I deleted apps for media companies owned by 67% of the world’s richest billionaires, and it felt great!

10.11.2024 12:47 — 👍 33 🔁 2 💬 1 📌 0

Not quite sure what you mean by a “complete” corpus. I do think the basic philosophical assumptions of frequentist probability are applicable to corpora, using the large-numbers-of-native-speakers thought experiment.

And productivity is a property of the asymptotic distribution, if I’m getting you.

10.11.2024 12:19 — 👍 0 🔁 0 💬 0 📌 0

If there were enough native speakers of the language living at once, you’d quickly get enough instances of the prefix for relative frequency estimation of the next token distribution. Too few humans are alive for this in practice, but that’s not a problem for theoretical validity of the construct!

09.11.2024 23:57 — 👍 1 🔁 0 💬 1 📌 0

You might be interested in this paper we did some time ago!

escholarship.org/content/qt69...

It supports your conjecture that, insofar as we think the “true distribution” is a valid theoretical construct (which I consider a highly defensible position), large-N Cloze would not give it to us.

09.11.2024 23:03 — 👍 7 🔁 0 💬 1 📌 0

Results of high stakes elections that happen only once every four years offer remarkable opportunities for overfitting theories of the electorate

08.11.2024 19:46 — 👍 10 🔁 1 💬 0 📌 0

The book The Patterns of Comics by Neil Cohn

Interior page from The Patterns of Comics by Neil Cohn

Back cover of The Patterns of Comics by Neil Cohn

It's my book's release day! The Patterns of Comics is now officially published, featuring an extended data-driven analysis of the structures used in 350+ comics from Asia, Europe, and North America analyzing diversity, regularity, and change over time www.visuallanguagelab.com/poc

28.12.2023 13:18 — 👍 180 🔁 59 💬 6 📌 2

screenshot of title and authors of paper + map with 18 colorful box callouts showing where datasets came from

GOOD MORNING BLUESKY!
Very excited about this new paper:

www.pnas.org/doi/10.1073/pnas.2300671120

Key Q: what predicts how much young kids (👶)talk?

How much 🗣 kids heard predicted how much 👶talked, but other factors, e.g. mom’s education, didn’t. #PsychSci #DevPsy 🗣💬

INCOMING SUMMARY🧵ALERT 1/14

13.12.2023 14:52 — 👍 157 🔁 79 💬 5 📌 1

#linguistics Bluesky: what are the best available quantitative measures of dialect/language mutual intelligibility? The more fine-grained, the better: I'm hoping to vividly illustrate at least one specific dialect continuum (e.g., the Romance languages of the Mediterranean coast)

10.12.2023 16:17 — 👍 3 🔁 1 💬 2 📌 0

Today in linguists are NOT KIDDING when we say that your capacity for language enables you to understand sentences that have never before been uttered in human history.

20.11.2023 20:54 — 👍 534 🔁 189 💬 3 📌 4

New postdoc opportunity to work jointly with @cantlonlab.bsky.social and me to understand cognition across species, age, and culture! cmu.wd5.myworkdayjobs.com/CMU/job/Pitt...

15.11.2023 17:26 — 👍 33 🔁 28 💬 0 📌 1

Absolutely, the Nature EiC has it completely backwards. Checking for errors and quality of data (and of math, code, and argumentation) is the most important work that reviewers can do.

11.11.2023 00:18 — 👍 18 🔁 1 💬 1 📌 0

Screenshot of portion of article linked to in post, where Nature EiC says that checking underlying data is not the job of peer review.

The quotes from Nature EiC Magdalena Skipper about whether journals should be checking for errors/data quality as part of peer review are quite surprising to me.

https://www.wsj.com/science/whats-wrong-with-peer-review-e5d2d428?st=dhrnljoa74fujcv&reflink=desktopwebshare_permalink

11.11.2023 00:11 — 👍 78 🔁 35 💬 14 📌 8

“This significant effect was found using a post hoc weighting procedure aligned with our overarching hypothesis”?!?

10.11.2023 13:05 — 👍 1 🔁 0 💬 0 📌 0

Snow geese fill the sky at sunset in Washington's Skagit Valley.

I am delighted to announce that the Department of Biology at the University of Washington is advertising for a tenure-track assistant professor position on the quantitative understanding of collective behavior.

I will be chairing the search; details are here: apply.interfolio.com/130336

07.11.2023 23:20 — 👍 266 🔁 151 💬 9 📌 6

Glushko Dissertation Prize - Cognitive Science Society

I think it’s amazing that Cognitive Science gives recent PhDs $10K in UNRESTRICTED CASH…right when folks are broke, exhausted, moving town…and need it most.

It’s almost Glushko season!

cognitivesciencesociety.org/glushko-diss...

07.11.2023 12:41 — 👍 12 🔁 8 💬 1 📌 0

LIT Lab Our goal is to understand the relationship between language and human thought. How does the language network in the brain interact with other systems to interpret meaning in the world? Can models...

First post and big news - I am starting as an Assistant Professor in Psychology at Georgia Tech in Jan 2024!

www.language-intelligence-thought.net

06.11.2023 16:52 — 👍 67 🔁 11 💬 4 📌 3

In a new TiCS article, @emaliemcmahon.bsky.social and I review a growing body of behavioral, neural, and computational evidence that social interactions are automatically extracted by the human visual system:

tinyurl.com/nhh2dhxt

#PsychSciSky #NeuroSkyence

05.10.2023 14:22 — 👍 63 🔁 30 💬 3 📌 1

While the world has its eyes on the Middle East, democratic conditions in Indonesia are looking grim. The Supreme Court has overruled the Constitution in order to allow the sitting president's son to stand as Vice Presidential candidate with a disgraced general with a stained human rights record

03.11.2023 23:06 — 👍 29 🔁 9 💬 1 📌 3

LIU LAB In the Look, Infer, and Understand (LIU) Lab at Johns Hopkins University, we are interested in how our minds and brains reason about the physical and social world. We study the developmental and neu...

I am reading PhD applications this year, with a special interest in students who would like to work on the topic of perceived danger. But open to all applicants who share some of my interests. Visit www.liulaboratory.org to see papers, lab values, and tips for application writing.

02.11.2023 17:27 — 👍 14 🔁 11 💬 0 📌 0

Supervisory Research Scientist (Interdisciplinary) This position is located at the Consumer Financial Protection Bureau (CFPB), Office of Research. The incumbent supervises and conducts independent, self-directed social/behavioral analysis on a variet...

The Consumer Financial Protection Bureau (CFPB) is hiring a section chief for the psych/”behavioral” section and I'd love to see some CogSci representation in there! My brother works there (trained as an economist) and it's an incredible gig doing research in the public interest.

03.11.2023 18:22 — 👍 6 🔁 9 💬 0 📌 0

Santa Fe Institute now has a Blue Sky account: @sfiscience.bsky.social

03.11.2023 17:43 — 👍 39 🔁 12 💬 1 📌 0

Spatial communication systems across languages reflect universal action constraints - Nature Human B... Coventry et al. show that spatial demonstratives—such as ‘this’ and ‘that’ in English—are selected on the basis of whether the speaker is able to reach the object or not, across 29 diverse...

A new cross-linguistic study on demonstratives by a team of psychologists and linguists: "Commonalities and differences across languages in spatial communication can be understood in terms of universal constraints on action shaping spatial language and cognition." www.nature.com/articles/s41...

31.10.2023 19:12 — 👍 16 🔁 4 💬 0 📌 1

PNAS Proceedings of the National Academy of Sciences (PNAS), a peer reviewed journal of the National Academy of Sciences (NAS) - an authoritative source of high-impact, original research that broadly spans...

New paper out!

"Large language models show human-like content biases in transmission chain experiments"
#CulturalEvolution #cssky 🧪

www.pnas.org/doi/10.1073/...

27.10.2023 07:27 — 👍 41 🔁 26 💬 1 📌 1

new work just dropped, see @stephan-meylan.bsky.social's "thread" below:)
#DevPsych #CogPsych #PsychSciSky #CogSciSky

26.10.2023 20:55 — 👍 19 🔁 3 💬 1 📌 0

Thrilled at publication of
@stephan-meylan.bsky.social's "How adults understand what young children say", featuring Bayesian noisy-channel inference, LLMs, & child speech datasets!

TL;DR: prior expectations of what kids *want to say* is crucial. (Knowing how kids mispronounce words is too.)

26.10.2023 21:00 — 👍 23 🔁 9 💬 0 📌 1

Roger Levy

Latest posts by rplevy.bsky.social on Bluesky

@rplevy is following 19 prominent accounts