Georg Ahnert's Avatar

Georg Ahnert

@wanlo.bsky.social

PhD Student in Social Data Science at University of Mannheim | LLMs and Surveys | georgahnert.de

176 Followers  |  368 Following  |  14 Posts  |  Joined: 28.11.2023  |  2.1852

Latest posts by wanlo.bsky.social on Bluesky

πŸ‘‹ #ACL2025NLP πŸ‡¦πŸ‡Ή @marlutz.bsky.social and I are presenting our poster on demographic representativeness of LLMs today!

πŸ•¦ 10:30-12:00
πŸ“ Hall X5 (board 1 or 14 according to different sources 🧐)

Here’s the paper on ACL anthology: aclanthology.org/2025.finding...

Drop by!

29.07.2025 07:31 β€” πŸ‘ 20    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1
Preview
IC2S2'25 NorrkΓΆping - YouTube This playlist contains all keynotes from IC2S2'25 in NorrkΓΆping, Sweden.

All the keynote recordings are available now, enjoy! www.youtube.com/playlist?lis...

25.07.2025 15:49 β€” πŸ‘ 40    πŸ” 19    πŸ’¬ 1    πŸ“Œ 1

Hereβ€˜s some of the slides πŸ‘‡ bsky.app/profile/mstr...

24.07.2025 16:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Chair for Data Science in the Economic and Social Sciences at University of Mannheim having lots of fun at #ic2s2 @janajung.bsky.social @wanlo.bsky.social @indiiigo.bsky.social @jrupprec.bsky.social @maximiliankreutner.bsky.social and Stefano Balietti

23.07.2025 15:22 β€” πŸ‘ 21    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Laura Nelson on stage presenting her keynote at IC2S2. The slide lays out "A maturing field" of Computational Qualitative Research

Laura Nelson on stage presenting her keynote at IC2S2. The slide lays out "A maturing field" of Computational Qualitative Research

Really inspiring keynote by @lauraknelson.bsky.social this morning at #IC2S2 discussing when to model and when to generate societiesβ€”among many other themes in computational qualitative research!

23.07.2025 08:00 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Before heading to ACL, I'm excited to be at #IC2S2 this week! 🌞

I'll present a related working paper on validating LLM social simulations at the ABM session on Tuesday (11 AM, Vingen 7): indiiigo.github.io/files/GABM_V...

(w/ @wanlo.bsky.social @mstrohm.bsky.social and @janalasser.bsky.social)

21.07.2025 20:00 β€” πŸ‘ 17    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Screenshot of our paper "Missing the Margins: A Systematic Literature Review on the Demographic Representativeness of LLMs"

Screenshot of our paper "Missing the Margins: A Systematic Literature Review on the Demographic Representativeness of LLMs"

Details about what we annotated in our systematic review

Details about what we annotated in our systematic review

Do LLMs represent the people they're supposed simulate or provide personalized assistance to?

We review the current literature in our #ACL2025 Findings paper and investigating what researchers conclude about the demographic representativeness of LLMs:
osf.io/preprints/so...

1/

21.07.2025 10:11 β€” πŸ‘ 23    πŸ” 8    πŸ’¬ 1    πŸ“Œ 2
Post image

LLMs can understand political discourse, but can they actually predict votes of real politicians?

Excited to share my work at #IC2S2 this week!
I will present my poster on Tuesday between 1:30 and 3:30 p.m.

21.07.2025 10:37 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
The Swedish countryside as seen from a moving train, with a lake, a red and white house, and some cows.

The Swedish countryside as seen from a moving train, with a lake, a red and white house, and some cows.

The 15.07 train has a 30 min delay now but the landscapeβ€˜s quite pretty ;)

20.07.2025 16:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Really excited to also present this work at #IC2S2 next week in NorrkΓΆping! πŸŽ‰ I'd love to discuss how to produce LLM survey responses at my poster on Wed at 13:30 (Poster Session 2, Poster ID 68) πŸ“Š

18.07.2025 15:19 β€” πŸ‘ 17    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Post image

LLMs can generate synthetic survey responses, e.g. for imputation, but how reliable are they? πŸ“‹

At #IC2S2, I'll be sharing our research on the robustness of AI-generated responses to perturbations and if they mirror human survey biases. πŸ€–
Come by my poster on Tuesday between 1:30 and 3:30 p.m.

18.07.2025 14:16 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

Very excited to head to #IC2S2 next week! πŸŽ‰

In our project, we tested whether a psychological assessment can measure sexism in LLMs, and found that applying such tools to LLMs is not as straightforward as it seems.

Find me and my poster at Poster Session 1 (Tue 12:30-14:30) β€” hope to see you there

17.07.2025 19:14 β€” πŸ‘ 9    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
A research setup for the evaluation of Answer Production Methods for closed-ended survey responses from LLMs. An LLM is prompted with a survey and an optional instruction, before a Answer Production Method is applied. These methods range from token-probabilities to open-ended text generation + classification. I then evaluated them against human survey answers and calculate individual-level accuracy as well as distribution alignment for sub-populations.

A research setup for the evaluation of Answer Production Methods for closed-ended survey responses from LLMs. An LLM is prompted with a survey and an optional instruction, before a Answer Production Method is applied. These methods range from token-probabilities to open-ended text generation + classification. I then evaluated them against human survey answers and calculate individual-level accuracy as well as distribution alignment for sub-populations.

LLMs are trained to produce open-ended responses πŸ“, but most survey items require closed-ended responses instead πŸ“Š

This Wed 11:00–12:30 at #ESRA25, I'll discuss the large impact that Answer Production Methods have on prediction results + share recommendations for methods and parameters. πŸ‘ˆ

14.07.2025 13:34 β€” πŸ‘ 10    πŸ” 4    πŸ’¬ 0    πŸ“Œ 2

Thanks :) We have a BERT-based baseline model that labels individual tweetsβ€”but I agree, would be a very interesting comparison now that LLMs can increasingly handle super long contexts!

27.06.2025 09:53 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

Day 3 morning sessions: language change and generative AI #ICWSM

26.06.2025 10:49 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Thanks a lot for the shoutout! Would be happy to talk about this and other ongoing projects on social simulation at #ICWSM next week πŸ™‚

16.06.2025 08:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models | Proceedings of the International AAAI Conference on Web and Social Media ...

Our conclusion: Temporal Adapters enable longitudinal analyses of affect aggregates from social media data by temporally aligning LLMs. ⏱️

Read the full paper: ojs.aaai.org/index.php/IC...

16.06.2025 08:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Our estimates with Llama 3 Temporal Adapters show a strong positive and significant correlation with collective frustration, fear, boredom, and sadness. Our results vary strongly between emotions, but they are in line with a baseline method's estimates.

Our estimates with Llama 3 Temporal Adapters show a strong positive and significant correlation with collective frustration, fear, boredom, and sadness. Our results vary strongly between emotions, but they are in line with a baseline method's estimates.

We also apply our method to the extraction of public attitudes towards Boris Johnson as a prime minister and towards the National Healthy Service, were we similarly find positive cross-correlation with survey data for some but not all answer options.

We also apply our method to the extraction of public attitudes towards Boris Johnson as a prime minister and towards the National Healthy Service, were we similarly find positive cross-correlation with survey data for some but not all answer options.

Results: From several collective emotions and public opinion, our longitudinal estimates show a strong positive and significant cross-correlation with survey data gathered by YouGov directly from human participants.

16.06.2025 08:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Overview of our method that shows how each week's Twitter data is used to train a separate Temporal Adapter, and how a weekly affect aggregate is then obtained from the LLM's token probabilities.

Overview of our method that shows how each week's Twitter data is used to train a separate Temporal Adapter, and how a weekly affect aggregate is then obtained from the LLM's token probabilities.

Method: We gather weekly text data from a panel of Twitter users and fine-tune Temporal Adapters for Llama 3 8B with it. πŸ¦™ We then prompt Llama with established survey questions, one week at a time, to extract longitudinal affect aggregates.

16.06.2025 08:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
A lineplot that shows how scared people in the UK were over time, during the first COVID-19 lockdown. Our method (Llama 3 Temporal Adapters) produces similar estimates of as the survey data gathered by YouGov.

A lineplot that shows how scared people in the UK were over time, during the first COVID-19 lockdown. Our method (Llama 3 Temporal Adapters) produces similar estimates of as the survey data gathered by YouGov.

Excited to present our paper with @maxpe.bsky.social, @dgarcia.eu, and @mstrohm.bsky.social next week at #ICWSM! ✨

We extend social simulation with LLMs to a longitudinal setting by fine-tuning Temporal Adaptersβ€”here's how: 🧡

16.06.2025 08:23 β€” πŸ‘ 17    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2
Preview
DataFest πŸ‡©πŸ‡ͺ 2025 Call for applications 2025! πŸ“’ We are glad to announce our call for applications to join the 8th edition of DataFest Germany, which will take place at the Ludwig-Maximilians-UniversitΓ€t in Munich (Marc...

We're excited to announce #DataFest Germany 2025 at LMU Munich, March 28-30! In this #hackathon, students from diverse study programs compete for the best insights and visualizations from an exclusive dataset within 48 hours. More info: www.datafest.de/home

31.01.2025 09:43 β€” πŸ‘ 9    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

Great to see such strong arguments for using "open-weight" LLMs! Maybe setting random seeds could be added to the advice to practitioners? Most interfaces seem to support this nowβ€”huggingface, OpenAI, Ollama, vllm,…

19.12.2024 13:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Ready for another Computational Social Science Starter Pack?

Here is number 2! More amazing folks to follow! Many students and the next gen represented!

go.bsky.app/GoEyD7d

14.11.2024 23:42 β€” πŸ‘ 78    πŸ” 53    πŸ’¬ 33    πŸ“Œ 43

Sharing my first Computational Social Science starter pack! Will grow with time, feel free to nominate and self nominate!

go.bsky.app/CYmRvcK

13.11.2024 02:05 β€” πŸ‘ 98    πŸ” 41    πŸ’¬ 62    πŸ“Œ 3

Repost if you’ve participated in a Summer Institute in Computational Social Science. Let’s get #SICSS Bluesky going!

08.10.2023 19:49 β€” πŸ‘ 52    πŸ” 64    πŸ’¬ 0    πŸ“Œ 3

Thank you! I've got to admit that the above preprint is already my own work....

19.10.2024 14:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models This paper proposes temporally aligned Large Language Models (LLMs) as a tool for longitudinal analysis of social media data. We fine-tune Temporal Adapters for Llama 3 8B on full timelines from a pan...

Maybe fine-tuning LLMs on digital traces of subpopulations could be interesting?

Might be biased though because I just worked on a paper where we explored this in a longitudinal setting with what we called Temporal Adapters for LLMs – arxiv.org/abs/2409.17990

19.10.2024 11:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@wanlo is following 20 prominent accounts