Gemma Boleda's Avatar

Gemma Boleda

@gboleda.bsky.social

linguistics x artificial intelligence x cognitive science | computational linguistics, NLP | COLT Research Group @colt-upf.bsky.social, ICREA @icreacommunity.bsky.social, Universitat Pompeu Fabra @upf.edu, @traduccioupf.bsky.social gboleda.github.io

154 Followers  |  109 Following  |  9 Posts  |  Joined: 11.11.2024
Posts Following

Posts by Gemma Boleda (@gboleda.bsky.social)

Post image

Behind every medical advance, social study or cultural insight, there is research.

#ICREAcall supports researchers with 20 permanent positions in Catalonia's research system:ย 

๐Ÿง‘โ€๐Ÿคโ€๐Ÿง‘ Social & Behavioural Sciencesย 
๐Ÿ“š Humanitiesย 
๐Ÿ”ฌ Life Sciences & Medicine

https://www.icrea.cat/icrea-call/

25.02.2026 08:31 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Per molts anys i grร cies #comunitatICREA / Happy anniversary and thank you ICREA. Becoming part of this community of researchers was a dream come true.

08.02.2026 21:18 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image Post image

๐ŸŽ‰ Avui รฉs un dia especial! Celebrem els #25anysICREA al CCCB.

Ens trobem mรฉs de 320 persones per celebrar un quart de segle de recerca d'excelยทlรจncia que ha transformat Catalunya i l'ha projectat al mรณn.

Avui, mรฉs que mai, tots #SomICREA.

06.02.2026 10:32 โ€” ๐Ÿ‘ 17    ๐Ÿ” 9    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 7
Sample ManyNames images with associated names, in English and Mandarin Chinese

Sample ManyNames images with associated names, in English and Mandarin Chinese

Releasing v. 2.3 of ManyNames, an object naming dataset with 25K objects in real world images (English, plus partial coverage in Catalan and Mandarin Chinese). Check it out!

amore-upf.github.io/manynames/

(New in this version: further data cleaning, speaker ID, more lexical info)

15.01.2026 14:19 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Presented at #DLBCN, a very nice yearly event showcasing what is done around Deep Learning in Barcelona. Come to the next edition!

23.12.2025 19:31 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thereโ€™s more to Neural Nets than big fat LLMs!

Weโ€™ve built a NN-agent framework to simulate how people choose the best word in a given communication context (i.e. pragmatic naming behavior).

With @yuqing0304.bsky.social, @ecesuurker.bsky.social, Tessa Verhoef, @gboleda.bsky.social

06.11.2025 21:07 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Leibniz MMS Days 2026 - Abstract G. Boleda

Happy to announce a keynote lecture by @gboleda.bsky.social on "Why are Large Language Models so good at language?" at our Leibniz MMS Days next March (registration open until 7 January):
www.wias-berlin.de/workshops/MM...
www.wias-berlin.de/workshops/MM...

12.12.2025 11:31 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Ever wondered how our words change their meanings over time, and why languages keep both broad terms (โ€œdogโ€) and specific ones (โ€œDalmatianโ€)?
Our new paper asks that question, but instead of asking humans, we ask neural agents ๐Ÿค–
๐Ÿงต๐Ÿ‘‡

06.11.2025 13:52 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 1

It's partially the former, partially the latter. We understand some aspects of how LLMs work, but there's A TON that we still don't understand. This is a super active area of research at the moment (keywords: explainable AI, interpretability).

15.10.2025 16:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Sigmoid function. Non-linearities in neural network allow it to behave in distributed and near-symbolic fashions.

Sigmoid function. Non-linearities in neural network allow it to behave in distributed and near-symbolic fashions.

New paper! ๐Ÿšจ I argue that LLMs represent a synthesis between distributed and symbolic approaches to language, because, when exposed to language, they develop highly symbolic representations and processing mechanisms in addition to distributed ones.
arxiv.org/abs/2502.11856

30.09.2025 13:15 โ€” ๐Ÿ‘ 27    ๐Ÿ” 11    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

CoNLL is over! Hereโ€™s most of the organizing team, next to the Danube in Vienna (missing
โ€ช@nvshrao.bsky.social and Snigdha Chaturvedi). #conll2025 @conll-conf.bsky.social @microth.bsky.social @emcheng.bsky.social

03.08.2025 13:18 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Announcing the COLT Symposium on June 2nd!

๐—˜๐—บ๐—ฒ๐—ฟ๐—ด๐—ฒ๐—ป๐˜ ๐—ณ๐—ฒ๐—ฎ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€ ๐—ผ๐—ณ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐—ถ๐—ป ๐—บ๐—ถ๐—ป๐—ฑ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—บ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ๐˜€

What properties of language are emerging from work in experimental and theoretical linguistics, neuroscience & LLM interpretability?

Info: tinyurl.com/colt-site
Register: tinyurl.com/colt-register

๐Ÿงต1/3

13.05.2025 09:00 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

๐Ÿšจ Guest Speaker Alert! ๐Ÿšจ

Weโ€™re thrilled to announce that #CoNLL2025 will feature: ๐Ÿฅ

Raquel Fernรกndez (University of Amsterdam)
&
Jean-Rรฉmi King (@jeanremiking.bsky.social, CNRS / Meta AI)!
๐ŸŽคโœจ

Check out their awesome work!๐Ÿ‘‡

04.03.2025 16:02 โ€” ๐Ÿ‘ 9    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

๐Ÿ“ข Upcoming Seminar

Words are weird? On the role of lexical ambiguity in language
๐Ÿ—ฃ Gemma Boleda (Universitat Pompeu Fabra, Spain)
Why is language so ambiguous? Discover how ambiguity balances cognitive simplicity and communicative complexity through large-scale studies.
๐Ÿ“ UniMiB, Room U6-01C, Milan

03.03.2025 13:41 โ€” ๐Ÿ‘ 13    ๐Ÿ” 6    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Preview
LLMs as a synthesis between symbolic and continuous approaches to language Since the middle of the 20th century, a fierce battle is being fought between symbolic and continuous approaches to language and cognition. The success of deep learning models, and LLMs in particular,...

new pre-print: LLMs as a synthesis between symbolic and continuous approaches to language arxiv.org/abs/2502.11856

24.02.2025 16:29 โ€” ๐Ÿ‘ 15    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Computational Linguistics Seminar Series at ILLC

๐Ÿ“ข The Computational Linguistics Seminar series: the Interplay between Language and Reasoning is scheduled for Thursday, February 6th, 2025 (16:30) and will feature Raffaella Bernardi, University of Trento โœจ

๐Ÿ“ L0.06 of LAB42 UvA (live streaming on Zoom ๐ŸŒ)

๐Ÿ“Ž projects.illc.uva.nl/LaCo/CLS/

04.02.2025 15:33 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
CoNLL 2025 | CoNLL

CoNLL 2025 Call for Papers ๐Ÿ˜€!
#CoNLL2025
conll.org
๐Ÿ”ด Co-located w/ ACL 2025 (July 31 - August 1)
โšช๏ธ This year CoNLL will only accept direct submissions (ddl: March 14 2025)
โšซ๏ธ CoNLL will accept both non-archival and archival submissions!

07.02.2025 21:28 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
CoNLL 2025 | CoNLL

This year, CoNLL will be accepting *non-archival* (as well as archival) submissions! www.conll.org #CoNLL2025

Follow CoNLL at
@conll-conf.bsky.social

05.02.2025 14:15 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐Ÿ”ŠNew EMNLP paper from Eleonora Gualdoni & @gboleda.bsky.social !

Why do objects have many names?

Human lexicons contain different words that speakers can use to refer to the same object, e.g., purple or magenta for the same color.

We investigate using tools from efficient coding...๐Ÿงต

1/3

02.12.2024 10:38 โ€” ๐Ÿ‘ 27    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Gemma Boleda

Can I be added? (Gemma Boleda, Barcelona, gboleda.github.io). Thanks!

11.11.2024 13:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0