Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts | Political Analysis | Cambridge Core
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts
Very excited that my paper with @katakeith.bsky.social is now out in @polanalysis.bsky.social. We investigate whether LLMs actually follow the instructions/definitions provided in codebooks, propose some diagnostics, and release a new evaluation dataset.
www.cambridge.org/core/journal...
19.09.2025 13:45 โ ๐ 30 ๐ 14 ๐ฌ 0 ๐ 2
Whoa...!! If social-science leaning at all maybe try other preprint servers? SocArXiv for example? We put one of our preprints there: osf.io/preprints/so...
27.08.2025 19:02 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Yes! I agree. It's so rare these days to see a keynote that is so thorough and full of new conceptualizations.
12.08.2025 02:12 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
5300 attendees in person here at #acl2025 ๐ฎ
30.07.2025 15:31 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
The #ACL2025 #ACL2025NLP feed is up and running! It matches both hashtags and any posts from or mentions of @aclmeeting.bsky.social
Pin it to your home ๐ and enjoy!
bsky.app/profile/did:...
17.07.2025 11:15 โ ๐ 48 ๐ 14 ๐ฌ 2 ๐ 0
Topic @adeldaoud.bsky.social and I were discussing today at lunch at #ic2s2 and want to ask here:
What are the โknown factsโ in the social sciences? Which relationships between at least two social variables have been empirically found to have large effects and replicated by multiple groups?
24.07.2025 12:57 โ ๐ 4 ๐ 2 ๐ฌ 2 ๐ 0
Under review! Happy to share a draft if you email me. Thanks!
23.07.2025 19:14 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Thanks:)
23.07.2025 14:39 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Highlighting this thread. Based on what I'm seeing at #ic2s2 this week, this line of work is hot (if a bit crowded), but I predict will only be more widely adopted by social scientists in the future.
23.07.2025 13:07 โ ๐ 11 ๐ 3 ๐ฌ 1 ๐ 0
Not as recent, but still LLM-based
"WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation." GPT-3 composes new examples with similar patterns to challenging examples.
aclanthology.org/2022.finding...
23.07.2025 13:05 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
I thought this was a clever and useful paper from Xiong, ... Hovy, El-Assady, Ash "Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification." Using LLMs to help humans refine their codebooks (before codebooks are fixed for the true annotation stage) arxiv.org/pdf/2507.05010
23.07.2025 13:00 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0
We used active learning to create a human-annotated dataset of 1050 instances from FOMC transcriptsโlabeled for FOMC membersโ opinions and directional stance towards monetary policy. Preprint and dataset should be released publicly by the end of the summer but email me for an advanced copy.
23.07.2025 12:52 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Congrats to Alisa Kanganis (Williams College โ25) for presenting her thesis work at #ic2s2 today!
23.07.2025 12:52 โ ๐ 11 ๐ 0 ๐ฌ 3 ๐ 0
Yay! I'm there as well. Let's sync up.
20.07.2025 11:31 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
A Co-op for Computing
Faculty are diving into the exciting, data-crunching, AI world of GPMoo.
Honored by the feature on my research, grant, and GPU cluster by the Williams magazine. today.williams.edu/magazine/a-c...
28.05.2025 01:41 โ ๐ 9 ๐ 1 ๐ฌ 0 ๐ 0
Personally, I find I have to burn a day answering all the questions (particularly for a dataset release). I think it should be condensed to the 5 most important ones.
20.05.2025 18:27 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
A full room for @katakeith.bsky.social's talk on proximal causal inference with text data โจโจโจ
27.01.2025 23:19 โ ๐ 17 ๐ 1 ๐ฌ 1 ๐ 0
Mark your calendars for these upcoming events tied to SCI and its One-U Responsible AI Initiative! Visit rai.utah.edu/events for details.
@parasharmanish.bsky.social @katakeith.bsky.social @anamarasovic.bsky.social @freiling.bsky.social
24.01.2025 22:30 โ ๐ 7 ๐ 4 ๐ฌ 0 ๐ 0
Our semi-synthetic experiments use MIIMIC-III clinical notes and two open-weight LLMs and show that our method produces estimates with low bias.
11.12.2024 01:10 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
For settings with an unobserved (but known) confounding variable, we propose a new causal inference method that uses two instances of pre-treatment text data, infers two proxies using two zero-shot models on the separate instances, and applies these proxies in the proximal g-formula.
11.12.2024 01:10 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Check out our #NeurIPS2024 poster (presented by my collaborators Jacob Chen and Rohit Bhattacharya) about โProximal Causal Inference With Text Dataโ at 5:30pm tomorrow (Weds)!
neurips.cc/virtual/2024...
11.12.2024 01:10 โ ๐ 12 ๐ 4 ๐ฌ 1 ๐ 0
Details - Assistant/Associate Professor - Natural Language Processing (NLP) | Human Resources | UMass Amherst
We're hiring new #nlp faculty this year!
Asst or Assoc Professors in NLP at UMass CICS --
careers.umass.edu/amherst/en-u...
19.11.2024 14:33 โ ๐ 66 ๐ 34 ๐ฌ 1 ๐ 0
I'm excited to share that we've released v1.0 of our podcast corpus, SPoRC, led by my PhD student Ben Litterer! This first dataset is a slice of time, comprising over one million episodes from May and June 2020, including transcripts, diarization, and extracted audio features.
15.11.2024 15:03 โ ๐ 52 ๐ 16 ๐ฌ 1 ๐ 4
All - Bluesky Directory
A curated collection of all things relating to the Blue Sky social media platform.
Starter packs are genius, but I was surprised there wasn't a list of them for people to find.
So I built it:
blueskydirectory.com/starter-pack...
The website monitors the packs being shared and adds the ones it finds to the database.
Missed your stater pack? Message me and I'll get it added.
11.11.2024 16:13 โ ๐ 6576 ๐ 2975 ๐ฌ 1123 ๐ 434
New here? Interested in AI/ML? Check out these great starter packs!
AI: go.bsky.app/SipA7it
RL: go.bsky.app/3WPHcHg
Women in AI: go.bsky.app/LaGDpqg
NLP: go.bsky.app/SngwGeS
AI and news: go.bsky.app/5sFqVNS
You can also search all starter packs here: blueskydirectory.com/starter-pack...
09.11.2024 09:13 โ ๐ 557 ๐ 213 ๐ฌ 67 ๐ 55
๐ซ ๐ซถ
06.11.2024 20:35 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
In my NLP class (www.cs.williams.edu/~kkeith/teac...) next week, we're talking about eval.
I'd like to have a large section of the lecture focus on contamination. Crowd-sourcing--please send me your favorite contamination papers! Thanks! ๐
06.11.2024 20:27 โ ๐ 16 ๐ 3 ๐ฌ 6 ๐ 0
go.bsky.app/PCckf3C
05.11.2024 21:39 โ ๐ 17 ๐ 11 ๐ฌ 1 ๐ 0
Yay! thanks
06.11.2024 14:01 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
A workshop series where we make zines that answer the question: what happens when we imagine otherwise?
Run by @dylnbkr.bsky.social and @paulinewee.bsky.social from @dairinstitute.bsky.social
Sign up for workshops:
>>> https://linktr.ee/dair.futures <<<
Researcher in Bayesian statistical machine learning at Harvard University's Dept. of Biostatistics. Website: mikewojnowicz@github.io
Asst Prof @ University of Washington Information School // PhD in English from WashU in St. Louis
Iโm interested in books, data, social media, and digital humanities.
They call me "Eyre Jordan" on the bball court ๐
https://melaniewalsh.org/
Reader in Computational Social Science at the University of Edinburgh. he/him
Studying NLP, CSS, and Human-AI interaction. PhD student @MIT. Previously at Microsoft FATE + CSS, Oxford Internet Institute, Stanford Symbolic Systems
hopeschroeder.com
NLP / CSS PhD at Berkeley I School. I develop computational methods to study culture as a social language.
Assistant Prof. @csaudk.bsky.social | Fellow @cphsodas.bsky.social
Previous: @icepfl.bsky.social @americanexpress @Xerox @Intel
Interests: ๐ฅพ๐๏ธ๐ดโโ๏ธ๐๏ธโโ๏ธ๐ธ
#NLProc #LLMs #AgenticAI #Causality #GraphML
https://www.cs.au.dk/~clan/people/aarora
@UTAustin
"Nullius in verba"
Discussion โ planetarycausalinference.org/posts
Jobs โ aidevlab.org/jobs
Waitress turned Congresswoman for the Bronx and Queens. Grassroots elected, small-dollar supported. A better world is possible.
ocasiocortez.com
assistant professor in information, university of michigan. i have big intellectual feelings about language and technology.
https://tisjune.github.io/
she/her
The 2025 Conference on Language Modeling will take place at the Palais des Congrรจs in Montreal, Canada from October 7-10, 2025
Official account of Science Vs!
We're a science podcast that pits facts against everything else.
Made by Wendy Zukerman, Rose Rimler, Meryl Horn, Michelle Dang, Ekedi Fausther-Keeys, Blythe Terrell, Bobby Lord & independent fact checkers
The 11th International Conference on Computational Social Science (IC2S2) will be held in Norrkรถping, Sweden, July 21-24, 2025.
Website: https://www.ic2s2-2025.org/
Internet measurement, tech policy, and privacy researcher. Asst. Prof. at UIowa.
โจNEW PROJECTโจ Support science across the country by writing an opinion piece in your hometown or local newspaper.
This account is run by @cantlonlab.bsky.social & @spiantado.bsky.social.
Join us at sciencehomecoming.com
Mobilizing the fight for science and democracy, because Science is for everyone ๐งช๐
The hub for science activism!
Learn more โฌ๏ธ
http://linktr.ee/standupforscience
PhD student in NLP at UMass Amherst; previously Google, Stanford, BITS Goa. she/her.