CLAUSE - Computational Linguistics @ Bielefeld University's Avatar

CLAUSE - Computational Linguistics @ Bielefeld University

@clausebielefeld.bsky.social

CompLing group (CLAUSE) at Bielefeld U (PI: Sina Zarrieß). We work on: NLG, Language & Vision, Pragmatics & Dialogue, HateSpeech, BabyLMs, DH, and more! clause-bielefeld.github.io

410 Followers  |  372 Following  |  40 Posts  |  Joined: 13.11.2024  |  2.0986

Latest posts by clausebielefeld.bsky.social on Bluesky

Stefan Hartmann presenting a slide

Stefan Hartmann presenting a slide

Happening right now — @stefanhartmann.bsky.social presenting an extremely interesting case study on snowclones like »x is the new y«. 🗣️

22.01.2026 13:50 — 👍 13    🔁 1    💬 0    📌 1

Tomorrow!

21.01.2026 10:37 — 👍 7    🔁 1    💬 0    📌 0
Post image

I have just returned from a week-long visit to Bielefeld University! Thank you very much for hosting me Sina Zarrieß and @ozgealacam.bsky.social 😊 @clausebielefeld.bsky.social

18.01.2026 14:43 — 👍 8    🔁 2    💬 1    📌 0
Post image

This week we’re having @ecekt.bsky.social as our guest in Bielefeld. She gave a highly timely talk on language+vision models, how they process images under noise conditions, and about how to train a highly effective multimodal BabyLM with model merging. 🗣️👀💻

13.01.2026 10:42 — 👍 12    🔁 1    💬 0    📌 1
Post image

For years since the GPT-2 paper, emergent in-context learning (ICL) from 'next-token' training has been treated as something deeply tied to 𝐡𝐮𝐦𝐚𝐧 𝐥𝐚𝐧𝐠𝐮𝐚𝐠𝐞. But … is it?

18.11.2025 17:27 — 👍 2    🔁 2    💬 1    📌 1
AI generated image

AI generated image

Am I evil? Am I likeable?

Need a 10 minutes break? Like Fantasy? Loath it? Take part in our study and help us by rating images of fictional characters here:
bixprag.lili.uni-bielefeld.de/publix/0aSWK...

19.11.2025 10:25 — 👍 2    🔁 5    💬 0    📌 0
Post image

For this week’s group colloquium, we invited Loulou Kosmala from Paris-Est Créteil University. She gave a talk on multimodal feedback during all types of conversation, from real life to virtual, from learners to adults, from L1 to L2, and more! 🤩

11.11.2025 10:44 — 👍 3    🔁 0    💬 0    📌 0
Dialogue Is Not Enough to Make a Communicative BabyLM
(But Neither Is Developmentally Inspired Reinforcement Learning)
Francesca Padovani1∗ Bastian Bunzeck2∗ Manar Ali2 Omar Momen2
Arianna Bisazza1 Hendrik Buschmeier2 Sina Zarrieß2
1Center for Language and Cognition (CLCG), University of Groningen
2CRC 1646 – Linguistic Creativity in Communication, Bielefeld University
f.padovani@rug.nl bastian.bunzeck@uni-bielefeld.de

Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning) Francesca Padovani1∗ Bastian Bunzeck2∗ Manar Ali2 Omar Momen2 Arianna Bisazza1 Hendrik Buschmeier2 Sina Zarrieß2 1Center for Language and Cognition (CLCG), University of Groningen 2CRC 1646 – Linguistic Creativity in Communication, Bielefeld University f.padovani@rug.nl bastian.bunzeck@uni-bielefeld.de

As part of this year's BabyLM challenge, we (researchers from @gronlp.bsky.social and @clausebielefeld.bsky.social diverged from established pretraining paradigm by training only on dialogue data from CHILDES.

28.10.2025 12:53 — 👍 16    🔁 3    💬 1    📌 0

Preprint alert! We release BabyBabelLM, a multilingual benchmark of developmentally plausible training data. I was responsible for German and Polish data as well as various child-directed wikis. Immensely rewarding project with exceptionally cool co-authors. 🥳🚀

14.10.2025 17:19 — 👍 11    🔁 3    💬 0    📌 1
Post image

𝐃𝐨 𝐲𝐨𝐮 𝐫𝐞𝐚𝐥𝐥𝐲 𝐰𝐚𝐧𝐭 𝐭𝐨 𝐬𝐞𝐞 𝐰𝐡𝐚𝐭 𝐦𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐞𝐟𝐟𝐨𝐫𝐭 𝐥𝐨𝐨𝐤𝐬 𝐥𝐢𝐤𝐞? 🇨🇳🇮🇩🇸🇪

Here’s the proof! 𝐁𝐚𝐛𝐲𝐁𝐚𝐛𝐞𝐥𝐋𝐌 is the first Multilingual Benchmark of Developmentally Plausible Training Data available for 45 languages to the NLP community 🎉

arxiv.org/abs/2510.10159

14.10.2025 17:01 — 👍 42    🔁 16    💬 2    📌 1

Happening in an hour! 🥳

23.09.2025 13:36 — 👍 1    🔁 0    💬 0    📌 0

If you are at #IWCS, then you should not miss Sanne‘s talk ”Not Just Who or What: Modeling the Interaction of Linguistic and Annotator Variation in Hateful Word Interpretation“ (Sanne Hoeken, Özge Alacam, Dong Nguyen, Massimo Poesio, Sina Zarrieß), tomorrow at 16:30! 🕟
@sannehoeken.bsky.social

22.09.2025 10:15 — 👍 4    🔁 1    💬 0    📌 1
Sina in front of a slide with different size circles

Sina in front of a slide with different size circles

Sina Zarieß is giving the KONVENS keynote on training BabyLMs #nlproc
The slide shows the number of words a 12yo human has seen in their lifetime compared to the numbers of words typical language models have seen in training #llm

11.09.2025 11:43 — 👍 6    🔁 3    💬 0    📌 0
Post image

Happening now: Sina‘s keynote on our BabyLM work. 🥳

11.09.2025 11:34 — 👍 5    🔁 0    💬 0    📌 1
Post image

Great first day at #KONVENS2015 today. Looking forward to another engaging day with a keynote by Sina Zarrieß tomorrow 🤓
@clausebielefeld.bsky.social

10.09.2025 20:36 — 👍 2    🔁 1    💬 1    📌 0

Don’t miss Sina‘s keynote on BabyLMs at #konvens tomorrow!

10.09.2025 11:09 — 👍 3    🔁 0    💬 0    📌 0
Post image

Final Keynote of #semdial by David Schlangen on ”Meaningful Interaction with Unreal Speakers?“ 😇💬

05.09.2025 09:32 — 👍 2    🔁 0    💬 1    📌 0

Final day at #semdial2025 #bialogue — four more presentations, one key note and hopefully many engaging discussions. Let's go!

05.09.2025 06:11 — 👍 0    🔁 1    💬 0    📌 0
Post image

Second #semdial keynote by Robert Hawkins on ”Foraging for common ground“

04.09.2025 14:03 — 👍 3    🔁 0    💬 0    📌 0
Post image

Day 2 of #semdial starts with a session on LMs and dialogue systems 🤩

04.09.2025 06:40 — 👍 3    🔁 0    💬 0    📌 0
Post image

Actually yes! Dialogue differs distinctly from monologues in terms of phonetic features and in the production of novel phonetic forms!

03.09.2025 09:41 — 👍 2    🔁 0    💬 0    📌 0
Post image

Leonie Schade asks whether it takes two to do an articulatory tango 😁

03.09.2025 09:24 — 👍 6    🔁 1    💬 1    📌 0

And the second talk features contributions by our PI Sina Zarrieß. 🤩

03.09.2025 08:35 — 👍 6    🔁 0    💬 1    📌 0

#semdial has begun 💬

03.09.2025 07:33 — 👍 1    🔁 0    💬 0    📌 0
Post image

#semdial is about to begin 🥳

03.09.2025 07:01 — 👍 2    🔁 2    💬 1    📌 0

Program: semdial2025.github.io/program/
Proceedings: purl.org/semdial/2025...

02.09.2025 20:11 — 👍 0    🔁 0    💬 0    📌 0
Post image

#semdial2025, the long-awaited #bialogue conference starts tomorrow! We are looking forward to three wonderful conference days, featuring three exciting keynotes, and many oral and poster presentations on the semantics and pragmatics of dialogue. 👄💬
Check out the program and proceedings below. 👇

02.09.2025 20:10 — 👍 3    🔁 0    💬 1    📌 1
Post image

Let’s go!

01.08.2025 10:00 — 👍 3    🔁 0    💬 0    📌 0

Is simpler child-directed language easier to learn?

Check out our CoNLL paper "Do Construction Distributions Shape Formal Language Learning in German BabyLMs?"

@conll-conf.bsky.social

01.08.2025 09:24 — 👍 2    🔁 2    💬 1    📌 0
Preview
Components of Creativity: Language Model-based Predictors for Clustering and Switching in Verbal Fluency Sina Zarrieß, Simeon Junker, Judith Sieker, Özge Alacam. Proceedings of the 29th Conference on Computational Natural Language Learning. 2025.

Find the paper here: aclanthology.org/2025.conll-1...

01.08.2025 09:14 — 👍 3    🔁 0    💬 1    📌 0

@clausebielefeld is following 19 prominent accounts