Andy Halterman's Avatar

Andy Halterman

@ahalterman.bsky.social

Assistant professor of political science at MSU. NLP, text, and conflict.

509 Followers  |  60 Following  |  6 Posts  |  Joined: 11.07.2023
Posts Following

Posts by Andy Halterman (@ahalterman.bsky.social)

Post image

Currently in FirstView: In “Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts,” @ahalterman.bsky.social and @katakeith.bsky.social show how “off-the-shelf” LLMs have limitations in faithfully following real-world codebook operationalizations.

27.11.2025 18:05 — 👍 1    🔁 1    💬 1    📌 0

Very short summary of this paper:

19.09.2025 19:25 — 👍 8    🔁 1    💬 0    📌 0
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts | Political Analysis | Cambridge Core Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts

Very excited that my paper with @katakeith.bsky.social is now out in @polanalysis.bsky.social. We investigate whether LLMs actually follow the instructions/definitions provided in codebooks, propose some diagnostics, and release a new evaluation dataset.
www.cambridge.org/core/journal...

19.09.2025 13:45 — 👍 30    🔁 14    💬 0    📌 2
Post image

Currently in FirstView: “Synthetically generated text for supervised text analysis.” @ahalterman.bsky.social proposes using LLMs to generate synthetic training data for training smaller, traditional supervised text models.

28.04.2025 21:04 — 👍 3    🔁 4    💬 1    📌 1

I promise that training models on synthetic text is a better idea than it sounds. For the theoretically squeamish: think of this as model distillation (LLM --> small classifier). For the hardcore empiricists, there are a few F1-go-up plots.

31.01.2025 17:15 — 👍 1    🔁 0    💬 0    📌 0
Synthetically generated text for supervised text analysis | Political Analysis | Cambridge Core Synthetically generated text for supervised text analysis

New paper in Political Analysis on synthetic text data for training classifiers. Main idea: generate training examples with LLMs, then fit classifiers on synthetic (+real) text. Paper has validations and guidance.
Blog: andrewhalterman.com/post/synthet...
Paper: www.cambridge.org/core/journal...

31.01.2025 17:07 — 👍 14    🔁 4    💬 1    📌 1
A McSweeney's style parody.

TREADSTONE RECRUITMENT OR YOUR PHD PROGRAM?

Can you tell whether each quote is discussing a top-secret CIA black ops program or your department's graduate program?

1. “We'd hoped it might build into a good training platform, but quite honestly, for a strictly theoretical exercise, we thought it was far too expensive."

2. “You came to us. You volunteered. You said you'd do anything it takes."

3. "You haven't slept for a long time, have you? Have you made a decision? This can't go on, you know. You have to decide."

4. "The details? No. I mean, I was told it was voluntary. I don't know if that's true or not, but that's what I was told."

5. "Stop running from the truth. You chose to come here! You chose to stay! And no matter how much you want to forget it... eventually you're going to have to face how you chose to become [who you are]."

6.  "You could have left at any time. And you knew exactly what it meant for you if you chose to stay."

7. "You're not a liar are you? Or too weak to see this through?"

8. “Look, they took vulnerable subjects, okay? You mix that with the right pharmacology and some serious behavior modification..."

9.  “You made yourself into who you are."

A McSweeney's style parody. TREADSTONE RECRUITMENT OR YOUR PHD PROGRAM? Can you tell whether each quote is discussing a top-secret CIA black ops program or your department's graduate program? 1. “We'd hoped it might build into a good training platform, but quite honestly, for a strictly theoretical exercise, we thought it was far too expensive." 2. “You came to us. You volunteered. You said you'd do anything it takes." 3. "You haven't slept for a long time, have you? Have you made a decision? This can't go on, you know. You have to decide." 4. "The details? No. I mean, I was told it was voluntary. I don't know if that's true or not, but that's what I was told." 5. "Stop running from the truth. You chose to come here! You chose to stay! And no matter how much you want to forget it... eventually you're going to have to face how you chose to become [who you are]." 6. "You could have left at any time. And you knew exactly what it meant for you if you chose to stay." 7. "You're not a liar are you? Or too weak to see this through?" 8. “Look, they took vulnerable subjects, okay? You mix that with the right pharmacology and some serious behavior modification..." 9. “You made yourself into who you are."

03.12.2024 18:04 — 👍 4    🔁 2    💬 0    📌 0

I couldn't find a tutorial I liked to get students who know R up and running with Python, so I wrote my own! Part 1 is here: andrewhalterman.com/post/python_...
(And if you have a tutorial you like, please let me know!)

08.11.2023 19:09 — 👍 17    🔁 10    💬 3    📌 0
The "focus group" meme from I Think You Should Leave. The original text, "A great steering wheel that doesn't whiff out the window while I driving" is replaced with "A great steering wheel that whiffs out the window while I driving (Schelling 1966)"

The "focus group" meme from I Think You Should Leave. The original text, "A great steering wheel that doesn't whiff out the window while I driving" is replaced with "A great steering wheel that whiffs out the window while I driving (Schelling 1966)"

The optimal amount of dad joke cringe when teaching undergrads is > 0.

17.10.2023 12:46 — 👍 3    🔁 0    💬 0    📌 0