Pieter Delobelle @pieter.ai

Pieter Delobelle

@pieter.ai.bsky.social

LLM engineer at Aleph Alpha | 👨‍💻 Fairness in LLMs and Dutch NLP | Prev. apple, PhD & postdoc from KU Leuven pieter.ai

81 Followers | 158 Following | 8 Posts | Joined: 21.12.2024 | 1.6141

Latest posts by pieter.ai on Bluesky

SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models Margaret Mitchell, Giuseppe Attanasio, Ioana Baldini, Miruna Clinciu, Jordan Clive, Pieter Delobelle, Manan Dey, Sil Hamilton, Timm Dill, Jad Doughman, Ritam Dutt, Avijit Ghosh, Jessica Zosa Forde, Ca...

It will be presented by @mmitchell.bsky.social and the paper can be read here:

aclanthology.org/2025.naacl-l...

02.05.2025 12:13 — 👍 5 🔁 1 💬 0 📌 0

Proud that our work on multilingual bias evals made it into @wired.com! The paper is being presented today at #NAACL2025.

📃 SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models
📅 Session K (2/5 at 12:00) @ Ballroom B

02.05.2025 12:10 — 👍 7 🔁 0 💬 1 📌 0

Picture of a crowd listening to our awesome introduction of the session “towards tokenizer-free end to end architectures”

If you are at #ICLR25 and care about tokenizers, drop by Aleph Alpha’s Birds of a Feather session – happening now at Opal 103.

24.04.2025 04:41 — 👍 3 🔁 0 💬 0 📌 0

The end of GEITje 1 At the pressing request of Stichting BREIN, GEITje is no longer available as of today. All model files have been removed from my HuggingFace repositories1. GEITje was a Dutch-language large open langu...

So while I believe our use for tweety (and even my RobBERT model trained in 2019) is well within the law, it is a worrying precedent set by Brein.

geitje’s blog post here: goingdutch.ai/en/posts/gei...

30.01.2025 12:47 — 👍 0 🔁 0 💬 0 📌 0

.. instead of uni-backed Dutch LLMs like Fietje-2b by @bramvanroy.bsky.social (KUL) or our tweety-7b-dutch (KUL & UGent).

How copyright applies to LLMs is not so clearcut (it protects works from unauthorised distribution), since LLMs do not repeat training data unless severely oversampled.

30.01.2025 12:47 — 👍 1 🔁 0 💬 1 📌 0

I just found out that Stichting Brein took down GEITje, a Dutch 7B LLM made by Edwin Rijgersberg as a hobby project.

While its training corpus was indeed copyrighted (Gigacorpus), it is interesting that Brein went after a hobby project first.. 🧵 1/3

30.01.2025 12:47 — 👍 1 🔁 0 💬 1 📌 0

Parallia/Fairly-Multilingual-ModernBERT-Embed-BE · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Not super multilingual, but for Dutch, German, French and English (all Belgian languages 🇧🇪) there is is this variant: huggingface.co/Parallia/Fai...

10.01.2025 14:38 — 👍 1 🔁 0 💬 0 📌 0

academic poster presenting the results of the research project.

TweetyIta and ItaEval are a language model and evaluation benchmark for Italian tasks. What's more, they are 100% community-driven and born within RiTA (rita-nlp.org). @asantilli.bsky.social will present the poster on Dec 5, 16:30-17:30.

+ Pieter Delobelle, Moreno La Quatra, @bsavoldi.bsky.social

04.12.2024 14:43 — 👍 6 🔁 4 💬 1 📌 1

Computer Science Conference Deadlines Map Interactive world map of Computer Science, AI, and ML conference deadlines

Unsure where to submit your next research paper to now that aideadlin.es is not updated anymore? And let’s be honest, is the location not as important as the conference itself?

🗺️ Check out my latest side-project: deadlines.pieter.ai

23.12.2024 14:39 — 👍 13 🔁 4 💬 0 📌 0

Meet our researchers from the DTAI lab at KU Leuven!

Using this starter pack, you can keep up with all the AI research from our PhD students, post-docs, professors and alumni 🦋

22.11.2024 14:57 — 👍 18 🔁 8 💬 2 📌 0

@pieter.ai is following 20 prominent accounts

Fengyi Zhu
@yvonneyvonnus

Linguistics lover |Engineering student| love accents, phonetics, semantics, mathematical linguistics/computational linguistics and cognitive science.| love comedies, music, photography. Languages toolkit: Chinese, Japanese, English BISU➡️➡️KU Leuven :))

Margaret Mitchell
@mmitchell

Researcher trying to shape AI towards positive outcomes. ML & Ethics +birds. Generally trying to do the right thing. TIME 100 | TED speaker | Senate testimony provider | Navigating public life as a recluse. Former: Google, Microsoft; Current: Hugging Face

Eva Hofman
@vaemanhof

👑Redacteur technologie en data bij De Groene Amsterdammer https://www.groene.nl/auteur/eva-hofman 🍸JOSEPHINE bij uitgeverij Pluim https://uitgeverijpluim.nl/josephine

Carolin Holtermann
@carolin-holtermann

Ph.D. Candidate in NLP focusing at @ds-hamburg.bsky.social on Multilinguality and Multiculturality #NLProc

michael veale
@michae.lv

assoc. prof, @laws.ucl.ac.uk, technology, policy, society, whimsical latvian top level domain names. michae.lv and fediverse https://someone.elses.computer/@mikarv 🏳️‍🌈

Jonas Schouterden
@joschout

Software Engineer at Google. ML PhD from DTAI. Alumnus Computer Science KU Leuven.

Patrick Haller
@phmaker

PhD student | parameter- and sample-efficient language modeling | at HU Berlin

Alan Akbik
@alanakbik

Professor of Machine Learning / NLP at Humboldt-Universität zu Berlin

Aidan Clark
@aidanclark

I train models @ OpenAI. Previously Research at DeepMind. Hae sententiae verbaque mihi soli sunt.

Edwin Rijgersberg
@edwinrijgersberg.nl

Machine learning engineer. I occasionally write about AI and its applications specifically for the Dutch language. Of course, views expressed are solely my own.

Workshop on Online Abuse and Harms
@woahworkshop

Workshop on Online Abuse and Harms (WOAH) to be held at ACL 2025 in Vienna. This account is managed by @florplaza.bsky.social https://www.workshopononlineabuse.com/

@evavnmssnhv

steph
@urschrei

Academic at the ⋂ of cities, technology, and climate adaptation. Reluctant polygon enthusiast. Sometimes I work on computational geometry and spatial data libraries, which I promise almost never to discuss.

Stella Biderman
@stellaathena

I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.

hessian.AI
@hessianai

hessian.AI conducts cutting-edge AI research, provides computing infrastructure & services, supports start-up projects, ensures the transfer to business and society and thus strengthens the AI ecosystem in Hesse & beyond. https://hessian.ai/legal-notice

Vilém Zouhar
@zouharvi

PhD student @ ETH Zürich | all aspects of NLP but mostly evaluation and MT | go vegan | https://vilda.net

Erik Arakelyan
@kirekara

Researcher @Nvidia | PhD from @CopeNLU | Formerly doing magic at @Amazon Alexa AI and @ARM. ML MSc graduate from @UCL. Research is the name of the game. ᓚᘏᗢ http://osoblanco.github.io

Anna Rogers
@annarogers

Associate professor at IT University of Copenhagen: NLP, language models, interpretability, AI & society. Co-editor-in-chief of ACL Rolling Review. #NLProc #NLP

Isabelle Augenstein
@iaugenstein

Professor at the University of Copenhagen. Explainable AI, Natural Language Processing, ML. Head of copenlu.bsky.social lab. #NLProc #NLP #XAI http://isabelleaugenstein.github.io/

Marzena Karpinska
@markar

#nlp researcher interested in evaluation including: multilingual models, long-form input/output, processing/generation of creative texts previous: postdoc @ umass_nlp phd from utokyo https://marzenakrp.github.io/