Building on Kyle's and Jenn's responses β it seems to me the analogy is: grammaticality is to BLiMP and SyntaxGym as truth is to COMPS and plausibility (albeit that's not binary) is to EWoK. So, to apply our framework to those datasets, perhaps one should swap truth/plausibility for grammaticality?
13.11.2025 01:57 β π 2 π 0 π¬ 1 π 0
Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."
New work to appear @ TACL!
Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.
Yet they often assign higher probability to ungrammatical strings than to grammatical strings.
How can both things be true? π§΅π
10.11.2025 22:11 β π 90 π 20 π¬ 2 π 3
thinking of calling this "The Illusion Illusion"
(more examples below)
01.12.2024 14:33 β π 1585 π 387 π¬ 60 π 91
Helps to be a linguist!
14.11.2024 11:45 β π 4 π 0 π¬ 0 π 0
Been listening a lot to Ella Jenkins the last couple of days. What wonderful music and performance!
11.11.2024 23:44 β π 7 π 1 π¬ 1 π 0
Within the past two weeks I deleted apps for media companies owned by 67% of the worldβs richest billionaires, and it felt great!
10.11.2024 12:47 β π 33 π 2 π¬ 1 π 0
Not quite sure what you mean by a βcompleteβ corpus. I do think the basic philosophical assumptions of frequentist probability are applicable to corpora, using the large-numbers-of-native-speakers thought experiment.
And productivity is a property of the asymptotic distribution, if Iβm getting you.
10.11.2024 12:19 β π 0 π 0 π¬ 0 π 0
If there were enough native speakers of the language living at once, youβd quickly get enough instances of the prefix for relative frequency estimation of the next token distribution. Too few humans are alive for this in practice, but thatβs not a problem for theoretical validity of the construct!
09.11.2024 23:57 β π 1 π 0 π¬ 1 π 0
You might be interested in this paper we did some time ago!
escholarship.org/content/qt69...
It supports your conjecture that, insofar as we think the βtrue distributionβ is a valid theoretical construct (which I consider a highly defensible position), large-N Cloze would not give it to us.
09.11.2024 23:03 β π 7 π 0 π¬ 1 π 0
Results of high stakes elections that happen only once every four years offer remarkable opportunities for overfitting theories of the electorate
08.11.2024 19:46 β π 10 π 1 π¬ 0 π 0
The book The Patterns of Comics by Neil Cohn
Interior page from The Patterns of Comics by Neil Cohn
Back cover of The Patterns of Comics by Neil Cohn
It's my book's release day! The Patterns of Comics is now officially published, featuring an extended data-driven analysis of the structures used in 350+ comics from Asia, Europe, and North America analyzing diversity, regularity, and change over time www.visuallanguagelab.com/poc
28.12.2023 13:18 β π 180 π 59 π¬ 6 π 2
screenshot of title and authors of paper + map with 18 colorful box callouts showing where datasets came from
GOOD MORNING BLUESKY!
Very excited about this new paper:
www.pnas.org/doi/10.1073/pnas.2300671120
Key Q: what predicts how much young kids (πΆ)talk?
How much π£ kids heard predicted how much πΆtalked, but other factors, e.g. momβs education, didnβt. #PsychSci #DevPsy π£π¬
INCOMING SUMMARYπ§΅ALERT 1/14
13.12.2023 14:52 β π 157 π 79 π¬ 5 π 1
#linguistics Bluesky: what are the best available quantitative measures of dialect/language mutual intelligibility? The more fine-grained, the better:Β I'm hoping to vividly illustrate at least one specific dialect continuum (e.g., the Romance languages of the Mediterranean coast)
10.12.2023 16:17 β π 3 π 1 π¬ 2 π 0
Today in linguists are NOT KIDDING when we say that your capacity for language enables you to understand sentences that have never before been uttered in human history.
20.11.2023 20:54 β π 534 π 189 π¬ 3 π 4
New postdoc opportunity to work jointly with @cantlonlab.bsky.social and me to understand cognition across species, age, and culture! cmu.wd5.myworkdayjobs.com/CMU/job/Pitt...
15.11.2023 17:26 β π 33 π 28 π¬ 0 π 1
Absolutely, the Nature EiC has it completely backwards. Checking for errors and quality of data (and of math, code, and argumentation) is the most important work that reviewers can do.
11.11.2023 00:18 β π 18 π 1 π¬ 1 π 0
Screenshot of portion of article linked to in post, where Nature EiC says that checking underlying data is not the job of peer review.
The quotes from Nature EiC Magdalena Skipper about whether journals should be checking for errors/data quality as part of peer review are quite surprising to me.
https://www.wsj.com/science/whats-wrong-with-peer-review-e5d2d428?st=dhrnljoa74fujcv&reflink=desktopwebshare_permalink
11.11.2023 00:11 β π 78 π 35 π¬ 14 π 8
βThis significant effect was found using a post hoc weighting procedure aligned with our overarching hypothesisβ?!?
10.11.2023 13:05 β π 1 π 0 π¬ 0 π 0
Snow geese fill the sky at sunset in Washington's Skagit Valley.
I am delighted to announce that the Department of Biology at the University of Washington is advertising for a tenure-track assistant professor position on the quantitative understanding of collective behavior.
I will be chairing the search; details are here: apply.interfolio.com/130336
07.11.2023 23:20 β π 266 π 151 π¬ 9 π 6
Glushko Dissertation Prize - Cognitive Science Society
I think itβs amazing that Cognitive Science gives recent PhDs $10K in UNRESTRICTED CASHβ¦right when folks are broke, exhausted, moving townβ¦and need it most.
Itβs almost Glushko season!
cognitivesciencesociety.org/glushko-diss...
07.11.2023 12:41 β π 12 π 8 π¬ 1 π 0
In a new TiCS article, @emaliemcmahon.bsky.social and I review a growing body of behavioral, neural, and computational evidence that social interactions are automatically extracted by the human visual system:
tinyurl.com/nhh2dhxt
#PsychSciSky #NeuroSkyence
05.10.2023 14:22 β π 63 π 30 π¬ 3 π 1
While the world has its eyes on the Middle East, democratic conditions in Indonesia are looking grim. The Supreme Court has overruled the Constitution in order to allow the sitting president's son to stand as Vice Presidential candidate with a disgraced general with a stained human rights record
03.11.2023 23:06 β π 29 π 9 π¬ 1 π 3
LIU LAB
In the Look, Infer, and Understand (LIU) Lab at Johns Hopkins University, we are interested in how our minds and brains reason about the physical and social world. We study the developmental and neu...
I am reading PhD applications this year, with a special interest in students who would like to work on the topic of perceived danger. But open to all applicants who share some of my interests. Visit www.liulaboratory.org to see papers, lab values, and tips for application writing.
02.11.2023 17:27 β π 14 π 11 π¬ 0 π 0
Supervisory Research Scientist (Interdisciplinary)
This position is located at the Consumer Financial Protection Bureau (CFPB), Office of Research. The incumbent supervises and conducts independent, self-directed social/behavioral analysis on a variet...
The Consumer Financial Protection Bureau (CFPB) is hiring a section chief for the psych/βbehavioralβ section and I'd love to see some CogSci representation in there! My brother works there (trained as an economist) and it's an incredible gig doing research in the public interest.
03.11.2023 18:22 β π 6 π 9 π¬ 0 π 0
Santa Fe Institute now has a Blue Sky account: @sfiscience.bsky.social
03.11.2023 17:43 β π 39 π 12 π¬ 1 π 0
new work just dropped, see @stephan-meylan.bsky.social's "thread" below:)
#DevPsych #CogPsych #PsychSciSky #CogSciSky
26.10.2023 20:55 β π 19 π 3 π¬ 1 π 0
Thrilled at publication of
@stephan-meylan.bsky.social's "How adults understand what young children say", featuring Bayesian noisy-channel inference, LLMs, & child speech datasets!
TL;DR: prior expectations of what kids *want to say* is crucial. (Knowing how kids mispronounce words is too.)
26.10.2023 21:00 β π 23 π 9 π¬ 0 π 1
Professor of English Linguistics, UCL
Here, I post on (English) language topics.
On Substack, I post on English Grammar: https://basaarts.substack.com/
#grammar #syntax #parsing
Computational Linguist, PhD, Lecturer at TU Dublin, Researcher, robertsmithresearch.wordpress.com
Sociolinguist, novelist, photographer, quilter, saxophonist and Boglehead. Associate Professor of Linguistics at UMich.
Bonnie Katz Tenenbaum Professor & Associate Dean of Education
@Stanfordeducation.Bsky.social. Af. Am. Studies & Linguistics by courtesy.
PI of the @StanfordBADLab.Bsky.social
Interdisciplinary community advancing language science through research & training in science, education, tech, & health β’ linktr.ee/umd_lsc
Professor of Psychology and Public Affairs at Princeton
Postdoctoral researcher at Duke University | PhD in bioanth from UCLA | primate behavior, communication, social bonds, emotions & learning. Based in Portland, Oregon and Durham, NC
Zealous modeler. Annoying statistician. Reluctant geometer. Support my writing at http://patreon.com/betanalpha. He/him.
Human/AI interaction. ML interpretability. Visualization as design, science, art. Professor at Harvard, and part-time at Google DeepMind.
Postdoc at MIT. Generative models, inference, AI for science. Prev: Princeton, Meta, NUS. liusulin.github.io
Visiting Researcher @NYU Courant, CILVR.
PhD student @TU Denmark, MLLS(https://mlls.dk).
Probabilistic ML/DL. Nth order Markovian. Support Manifolds and latents.
Previously intern @SonyAI in deep generative modelling.
web: http://uppalanshuk.github.io
NLP Research Scientist at IBM Research
CS prof at Penn, Amazon Scholar in AWS. Interested in ML theory and related topics, as well as photography and Gilbert and Sullivan. Website: www.cis.upenn.edu/~mkearns
Phonetics/prosody/linguistics/cogpsy academic in UK & amateur classical pianist πΉ & fountain pen enthusiast ποΈ
Senior Lecturer, Department of Psychology, University of York. Research interests include bilingualism, language production, and cognitive ageing.
Cognitive neuroscientist. Trying to work out how the brain understands language.
http://timvieira.github.io/blog