Johannes B. Gruber's Avatar

Johannes B. Gruber

@jbgruber.bsky.social

Senior Researcher @gesis.org // Data Editor @polcommjournal.bsky.social πŸ”Ž political communication (#polsky + #commsky) with text analysis and #rstats (#opendata + #openscience) 🌏 JohannesBGruber.eu πŸ‘¨β€πŸ’» research software github.com/JBGruber

1,389 Followers  |  977 Following  |  450 Posts  |  Joined: 21.09.2023
Posts Following

Posts by Johannes B. Gruber (@jbgruber.bsky.social)

The sun is out, so 'enjoyed' a bit less internet on the weekend. How is it going in the world?! πŸ˜€

Ah, right...

03.03.2026 13:17 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ“’ Study out in Research & Politics!

We (@lukasbirkenmai1.bsky.social @wurthmann.bsky.social @msaeltzer.bsky.social) find that directly elected MPs talk more about local & deprived places than list MPs on social media.

πŸ” doi.org/10.1177/2053...

Some Highlights [1/3]

25.02.2026 17:33 β€” πŸ‘ 49    πŸ” 19    πŸ’¬ 4    πŸ“Œ 1

Β―\_(ツ)_/Β―

24.02.2026 21:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Mental model sketch to visualize the trade-off function to consider when choosing tools from the text-as-data toolbox. The plot shows each method’s capability range: the vertical span indicates the task complexity each method can handle, while the horizontal position shows the resources required to implement that method for tasks within its capability. The areas overlap because more sophisticated methods can also solve simpler tasks: LLMs can do everything dictionaries can do, just at higher cost. The β€œoptimal method line” represents the most efficient choice: the least resource-intensive method capable of handling a given task complexity.

Mental model sketch to visualize the trade-off function to consider when choosing tools from the text-as-data toolbox. The plot shows each method’s capability range: the vertical span indicates the task complexity each method can handle, while the horizontal position shows the resources required to implement that method for tasks within its capability. The areas overlap because more sophisticated methods can also solve simpler tasks: LLMs can do everything dictionaries can do, just at higher cost. The β€œoptimal method line” represents the most efficient choice: the least resource-intensive method capable of handling a given task complexity.

After showing it in several presentations already, the publication is where my mental model for choosing a #textasdata method finally found a home. Not perfect, but it has proven useful.

24.02.2026 11:17 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Newer, Larger, Better? A Critique of the Unreflective LLM Adoption in Communication Research The growing adoption of large language models (LLMs) in political communication research has prompted excitement but also concern. In this opinion piece, we offer an informed and critical overview ...

πŸ“’Publication alert πŸ“’

LLMs are an awesome technology, but they come with trade-offs!

Paul Balluff, @justinho.bsky.social, @seanhp.bsky.social, Alexis Palmer, @lrossi.bsky.social, Irina Shklovski, Chung-hong Chan and I commented on these trade-offs in #commsky research. Out now and open access!

24.02.2026 11:17 β€” πŸ‘ 20    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0
clumsy, an utility for simulating broken network for Windows Vista / Windows 7 and above

There is solution for the internet too πŸ˜…!

15.02.2026 18:54 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Intercity Express TRAIN SOUND | ICE Fahrt GERΓ„USCH | 4+ HOURS | ASMR Meditation - Relaxation - Sleep
YouTube video by PureRelax4U Intercity Express TRAIN SOUND | ICE Fahrt GERΓ„USCH | 4+ HOURS | ASMR Meditation - Relaxation - Sleep

Lol there totally is youtu.be/0zxpSTU3FbE

15.02.2026 13:38 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Maybe there is a ICE white noise sound. Could help you isolate one of the variables. If there is no uptick in productivity, you could put your office chair 9n a vibrating plate or buy a timer that randomly increases the remaining time.

15.02.2026 13:36 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

One of my favourite events of the year! Learn about new tools/methods/people and hack on some dataset/tool/project 😍

12.02.2026 08:17 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

πŸš€ Yaay, it's time to register for the hackathon! πŸš€

10.02.2026 11:38 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Join us for the #KODAQS #Toolbox #Talk tomorrow!

Yannik Peters will present #TextPrep, a tool to assess how preprocessing methods can improve the quality of social media data.

πŸ—“οΈ Thursday, 5 Feburary, 2026, 1-2 pm.
πŸ“Œ Online, register here: www.gesis.org/angebot/wiss...

04.02.2026 13:54 β€” πŸ‘ 3    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Observing Opinions: What Are Language Models? by What is it about computational communication science? In this episode, we’re joined by Dr. Johannes Gruber from Vrije Universiteit Amsterdam to unpack the world of language models. Johannes explains what language models really are and how they shape how ...

I was interviewed for a podcast!

I talked with Wendo King'ang'i about (large) language models and opinion mining. It was recorded some time ago, so my affiliation is old, but the topic is still relevant, and I tried to make it as accessible as I could.

02.02.2026 08:56 β€” πŸ‘ 18    πŸ” 4    πŸ’¬ 0    πŸ“Œ 1
Outlook dialogue box asking if the user wants to stay signed in with the box ticked to not show this dialogue box again. The dialogue appears roughly every 12 hours, and ticking the box has no effect.

Outlook dialogue box asking if the user wants to stay signed in with the box ticked to not show this dialogue box again. The dialogue appears roughly every 12 hours, and ticking the box has no effect.

I think what annoys me most about the Microsoft products I am forced to work with is that they constantly try to gaslight me into thinking I had any control over what they do.

02.02.2026 08:51 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
How to precompute package vignettes or pkgdown articles As of earlier this year, we are now automatically building binaries and pkgdown documentation for all rOpenSci packages. One issue we encountered is that some packages include vignettes that require s...

Have you considered precomputing vignettes? ropensci.org/blog/2019/12...

Not the cleanest approach, but I haven't found a better one.

30.01.2026 07:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

partycoloR is now on CRAN! Started as a simple idea 6 years ago, now it's a full-featured package. Extract party colors and logos from Wikipedia with one line of code. It's already powering ParlGov Dashboard.

install.packages("partycoloR")

28.01.2026 08:20 β€” πŸ‘ 99    πŸ” 20    πŸ’¬ 0    πŸ“Œ 2
Preview
Mitarbeiter*in fΓΌr vertrauenswΓΌrdige Forschungsumgebungen (DSS-25) | Jobs bei GESIS – Leibniz-Institut fΓΌr Sozialwissenschaften GESIS ist eine der weltweit fΓΌhrenden Infrastruktureinrichtungen fΓΌr die Sozialwissenschaften und steht Forscher*innen mit Expertise und Infrastrukturangeboten auf allen Ebenen ihrer Forschungsprojekt...

There is a permanent position available in my team! You will help researchers gain access to highly sensitive data and analyse them in a secure environment: gesis.jobs.personio.de/job/2495658?...

Go for it and see you in Cologne ❀️🀍

26.01.2026 13:31 β€” πŸ‘ 10    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0

3 rejections this year πŸ’”. But (mostly) very helpful comments and the chance to further develop these projects (maybe for #ica27 in Glasgow?). (And still one accepted presentation πŸ₯΅)

15.01.2026 12:58 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
2H Introduction to Web Scraping and Data Management for Social Scientists – Essex Summer School in Social Science Data Analysis

I will teach Web Scraping and Data Management at @essexsummerschool.bsky.social this year! Learn to:

- manage research data (in tables, files, databases etc.) πŸ“‚
- Systematically get data from the internet (APIs, scraping, browser automation) πŸ€–
- wrangle your data in #rstats to make it useful πŸ“ˆ

14.01.2026 14:13 β€” πŸ‘ 17    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Talking to someone recently provided me with a new perspective on LLM usage: if you don't give a f*ck about the quality and accuracy of the output/product, using LLMs is great πŸ˜…

12.01.2026 13:29 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

If you do research with/about LLMs, image or audio models, or classic NLP methods, you should check out this call πŸ‘‡

#textasdata #NLP #llm #commsky #polsky

12.01.2026 08:54 β€” πŸ‘ 3    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0

🧡 PhD position (75%) in Political Behavior / Political Communication / CSS
πŸ“ LMU Munich | ⏳ 3 years | πŸ—“ start March–May 2026

We’re hiring for DemocraGPT, a @bidt.bsky.social-funded project developing an AI-based training for difficult conversations in times of growing polarization

06.01.2026 09:32 β€” πŸ‘ 44    πŸ” 41    πŸ’¬ 2    πŸ“Œ 1

AAAAAAAAAAAAAAAAAAAAHHHHHHHHHHH

03.01.2026 16:11 β€” πŸ‘ 77    πŸ” 27    πŸ’¬ 0    πŸ“Œ 2

Not long ago I discussed with people what our red lines would be before not traveling to the US anymore. We landed on illegal occupation of another country, like Russia is attempting in Ukraine. We're there now.

03.01.2026 16:54 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

…Really undermines the sanctity of the FIFA Peace Prize, doesn’t it?

03.01.2026 14:45 β€” πŸ‘ 376    πŸ” 53    πŸ’¬ 10    πŸ“Œ 0

AAAAAAAAAAAAAAAAAAAAAAAHHHHHHHHH

03.01.2026 13:57 β€” πŸ‘ 53    πŸ” 25    πŸ’¬ 0    πŸ“Œ 0

😍 how did you get Arch on your phone? Is this the native terminal or termux?

31.12.2025 16:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Applications via the EU's DSA Data Access Portal: An Introduction Researchers can submit requests to the EU's "Data Access Portal" to obtain non-public data from very large online platforms in certain cases. What information do researchers need to provide and what d...

January 16, 2026: Julian Jaursch from the German Digital Service Coordinator @bnetza.bsky.social will introduce the EU's #DSA Data Access Portal during the series "Show &β€―Tell – Social Media Data in Research Practice". Register here: events.gwdg.de/event/1259/ #digitalservicesact #platformdata

23.12.2025 09:48 β€” πŸ‘ 8    πŸ” 6    πŸ’¬ 1    πŸ“Œ 2
Post image

The Call for Papers and Panels for #COMPTEXT2026 in Birmingham (23-25 April) is out; feel free to circulate: shorturl.at/gRg0p!
Deadline: January 16!

17.12.2025 09:06 β€” πŸ‘ 21    πŸ” 15    πŸ’¬ 1    πŸ“Œ 4

Ich werde der Deutschen Bahn nie verzeihen, dass sie mich zu einem Menschen gemacht hat, der sich ΓΌber die Deutsche Bahn beschwert.

16.12.2025 07:05 β€” πŸ‘ 1785    πŸ” 145    πŸ’¬ 54    πŸ“Œ 11

Reflections on the day 1 of the age ban: experts sidelined again, nuance ignored, and the debate drowned by low-effort vox pops. Policy built on vibes not evidence gets sold as "protecting kids" with all the spectacle of a state propaganda march, loud, choreographed, & divorced from substance.

11.12.2025 00:53 β€” πŸ‘ 46    πŸ” 16    πŸ’¬ 2    πŸ“Œ 0