Pieter Delobelle's Avatar

Pieter Delobelle

@pieter.ai.bsky.social

LLM engineer at Aleph Alpha | ๐Ÿ‘จโ€๐Ÿ’ป Fairness in LLMs and Dutch NLP | Prev. apple, PhD & postdoc from KU Leuven pieter.ai

81 Followers  |  158 Following  |  8 Posts  |  Joined: 21.12.2024  |  1.6141

Latest posts by pieter.ai on Bluesky

Preview
SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models Margaret Mitchell, Giuseppe Attanasio, Ioana Baldini, Miruna Clinciu, Jordan Clive, Pieter Delobelle, Manan Dey, Sil Hamilton, Timm Dill, Jad Doughman, Ritam Dutt, Avijit Ghosh, Jessica Zosa Forde, Ca...

It will be presented by @mmitchell.bsky.social and the paper can be read here:

aclanthology.org/2025.naacl-l...

02.05.2025 12:13 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Proud that our work on multilingual bias evals made it into @wired.com! The paper is being presented today at #NAACL2025.

๐Ÿ“ƒ SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models
๐Ÿ“… Session K (2/5 at 12:00) @ Ballroom B

02.05.2025 12:10 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Picture of a crowd listening to our awesome introduction of the session โ€œtowards tokenizer-free end to end architecturesโ€

Picture of a crowd listening to our awesome introduction of the session โ€œtowards tokenizer-free end to end architecturesโ€

If you are at #ICLR25 and care about tokenizers, drop by Aleph Alphaโ€™s Birds of a Feather session โ€“ happening now at Opal 103.

24.04.2025 04:41 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
The end of GEITje 1 At the pressing request of Stichting BREIN, GEITje is no longer available as of today. All model files have been removed from my HuggingFace repositories1. GEITje was a Dutch-language large open langu...

So while I believe our use for tweety (and even my RobBERT model trained in 2019) is well within the law, it is a worrying precedent set by Brein.

geitjeโ€™s blog post here: goingdutch.ai/en/posts/gei...

30.01.2025 12:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

.. instead of uni-backed Dutch LLMs like Fietje-2b by @bramvanroy.bsky.social (KUL) or our tweety-7b-dutch (KUL & UGent).

How copyright applies to LLMs is not so clearcut (it protects works from unauthorised distribution), since LLMs do not repeat training data unless severely oversampled.

30.01.2025 12:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I just found out that Stichting Brein took down GEITje, a Dutch 7B LLM made by Edwin Rijgersberg as a hobby project.

While its training corpus was indeed copyrighted (Gigacorpus), it is interesting that Brein went after a hobby project first.. ๐Ÿงต 1/3

30.01.2025 12:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Parallia/Fairly-Multilingual-ModernBERT-Embed-BE ยท Hugging Face Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

Not super multilingual, but for Dutch, German, French and English (all Belgian languages ๐Ÿ‡ง๐Ÿ‡ช) there is is this variant: huggingface.co/Parallia/Fai...

10.01.2025 14:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
academic poster presenting the results of the research project.

academic poster presenting the results of the research project.

TweetyIta and ItaEval are a language model and evaluation benchmark for Italian tasks. What's more, they are 100% community-driven and born within RiTA (rita-nlp.org). @asantilli.bsky.social will present the poster on Dec 5, 16:30-17:30.

+ Pieter Delobelle, Moreno La Quatra, @bsavoldi.bsky.social

04.12.2024 14:43 โ€” ๐Ÿ‘ 6    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
Computer Science Conference Deadlines Map Interactive world map of Computer Science, AI, and ML conference deadlines

Unsure where to submit your next research paper to now that aideadlin.es is not updated anymore? And letโ€™s be honest, is the location not as important as the conference itself?

๐Ÿ—บ๏ธ Check out my latest side-project: deadlines.pieter.ai

23.12.2024 14:39 โ€” ๐Ÿ‘ 13    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Meet our researchers from the DTAI lab at KU Leuven!

Using this starter pack, you can keep up with all the AI research from our PhD students, post-docs, professors and alumni ๐Ÿฆ‹

22.11.2024 14:57 โ€” ๐Ÿ‘ 18    ๐Ÿ” 8    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

@pieter.ai is following 20 prominent accounts