Andrea Piergentili @apierg - Bluesky Profile

Great seminar last week by Alessandra Teresa Cignarella @cerveza-inglesa.bsky.social
"6,000 Papers, One New Dataset, and Several Confused LLMs: A Story About Stereotype Detection in NLP"
#NLP #StereotypeDetection

18.02.2026 09:21 — 👍 3 🔁 3 💬 0 📌 0

Our #PickOfTheWeek by @bsavoldi.bsky.social: "Attention to Non-Adopters" by @kaitlynzhou.bsky.social, @gligoric.bsky.social, @myra.bsky.social, @mlam.bsky.social, @vyoma-raman.bsky.social, Boluwatife Aminu, Caeley Woo, Michael Brockman, @hannah-cha.bsky.social, @jurafsky.bsky.social (2025).

18.02.2026 09:29 — 👍 3 🔁 3 💬 0 📌 0

Last week at the @fbk-mt.bsky.social seminars, we hosted Elizabeth Salesky from Google DeepMind, presenting her work on "Translation and Language Modeling with Pixels"

#NLProc #tokenization #MT

21.01.2026 11:05 — 👍 12 🔁 7 💬 0 📌 0

Glitter: A Multi-Sentence, Multi-Reference Benchmark for Gender-Fair German Machine Translation A Pranav, Janiça Hackenbuchner, Giuseppe Attanasio, Manuel Lardelli, Anne Lauscher. Findings of the Association for Computational Linguistics: EMNLP 2025. 2025.

Impressive work by the Glitter team: a new human-made benchmark for German gender-inclusive MT with long passages and multiple inclusive approaches + experiments showing that MT systems and LLMs still fall short in generating inclusive outputs.

aclanthology.org/2025.finding... ✨

07.01.2026 14:16 — 👍 2 🔁 1 💬 0 📌 1

It was great having Hosein Mohebbi (@hmohebbi.bsky.social) speak about interpretability for speech Transformers at our #MTSeminars! Thanks for the insights 🎤 #NLP #XAI

09.12.2025 10:17 — 👍 7 🔁 5 💬 0 📌 0

Jobs | Science and Technology Hub - Trento | A Researcher in Responsible and Trustworthy NLP

🚀 JOB ALERT 3: The FBK's MT Unit is hiring!

Join us as a Researcher in Responsible & Trustworthy NLP and advance ethical, fair, and transparent language technologies. If you care about building safe and accountable AI systems, you can apply here:
👉 jobs.fbk.eu/Annunci/Offe...

05.12.2025 10:14 — 👍 6 🔁 6 💬 0 📌 0

@bsavoldi.bsky.social presenting our new multilingual benchmark for evaluating LLMs on gender-neutral translation.

Catch our paper at #EMNLP2025
ℹ️ arxiv.org/pdf/2501.09409

#lt2025fbk

28.10.2025 10:44 — 👍 4 🔁 1 💬 0 📌 0

LT Highlights @ FBK 2025

🚀 Join us for the LT@FBK day 2025! Discover cutting-edge research and highlights in speech and language technologies from Fondazione Bruno Kessler (FBK)

📅 October 28, 2025
📍FBK, Trento
ℹ️ lt-highlights.fbk.eu

21.10.2025 10:15 — 👍 3 🔁 1 💬 0 📌 0

Last but definitely not least: @bsavoldi.bsky.social presenting joint work with @apierg.bsky.social @matteo-negri.bsky.social @luisabentivogli.bsky.social on scalable gender neutral translation evaluation using LLM-as-a-judge at #GITT2025

23.06.2025 14:07 — 👍 11 🔁 3 💬 6 📌 1

Agree to Disagree? A Meta-Evaluation of LLM Misgendering Numerous methods have been proposed to measure LLM misgendering, including probability-based evaluations (e.g., automatically with templatic sentences) and generation-based evaluations (e.g., with automatic heuristics or human validation). However, it has gone unexamined whether these evaluation methods have convergent validity, that is, whether their results align. Therefore, we conduct a systematic meta-evaluation of these methods across three existing datasets for LLM misgendering. We propose a method to transform each dataset to enable parallel probability- and generation-based evaluation. Then, by automatically evaluating a suite of 6 models from 3 families, we find that these methods can disagree with each other at the instance, dataset, and model levels, conflicting on 20.2% of evaluation instances. Finally, with a human evaluation of 2400 LLM generations, we show that misgendering behaviour is complex and goes far beyond pronouns, which automatic evaluations are not currently designed to capture, suggesting essential disagreement with human evaluations. Based on our findings, we provide recommendations for future evaluations of LLM misgendering. Our results are also more widely relevant, as they call into question broader methodological conventions in LLM evaluation, which often assume that different evaluation methods agree.

Super interesting paper by Subramonian et al: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" arxiv.org/abs/2504.17075
Turns out, misgendering is messier than just pronouns. I'd love to see this analysis extended to grammatical gender languages! #LLM #AI #ethics @fbk-mt.bsky.social

04.06.2025 14:09 — 👍 5 🔁 0 💬 0 📌 1

Qualtrics Survey | Qualtrics Experience Management The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.

🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

👉 bit.ly/sondaggio_ai...

(è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!

03.06.2025 10:24 — 👍 16 🔁 18 💬 1 📌 0

FAMA - a FBK-MT Collection The First Large-Scale Open-Science Speech Foundation Model for English and Italian

🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian.

The models are live and ready to try on @hf.co:
🔗 huggingface.co/collections/...

📄 Preprint: arxiv.org/abs/2505.22759

#ASR #ST #OpenScience #MultilingualAI

30.05.2025 15:35 — 👍 7 🔁 3 💬 0 📌 0

Will do 🫡

09.05.2025 08:48 — 👍 1 🔁 0 💬 0 📌 0

a woman is standing in front of a bookshelf in a bookstore and talking about research . ALT: a woman is standing in front of a bookshelf in a bookstore and talking about research .

👀 Wanted: #Italian or #Dutch native speakers to take a survey on audiovisual translation for a master thesis student: watch a short video, answer some questions, help academic research 😎
⏩ Sharing = nice! ❤️
NL link: ugent.qualtrics.com/jfe/form/SV_...
IT link: ugent.qualtrics.com/jfe/form/SV_...

09.05.2025 08:29 — 👍 6 🔁 8 💬 1 📌 0

a man in a suit is making a funny face with the words dreams are expensive behind him ALT: a man in a suit is making a funny face with the words dreams are expensive behind him

💭Dreaming of attending #GITT2025 but need a little extra 💸 boost?
📣 Bursary applications to support participation are now open at tinyurl.com/gitt25
📆 Deadline May 9th
🙏Thanks to our incredible sponsors DCA at Tilburg University tinyurl.com/tudca25 and FLW at Ghent University www.ugent.be/lw/en

29.04.2025 14:03 — 👍 7 🔁 7 💬 1 📌 0

Reserved topic scholarships | Doctoral Program - Information Engineering and Computer Science

📢 Come and join our group!
We offer a fully funded 3-year PhD position:

📔 Automatic translation with large multimodal models: iecs.unitn.it/education/ad...

📍Full details for application: iecs.unitn.it/education/ad...

📅 Deadline May 12, 2025

#NLProc #FBK

22.04.2025 10:14 — 👍 9 🔁 9 💬 1 📌 0

An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation Gender-neutral translation (GNT) aims to avoid expressing the gender of human referents when the source text lacks explicit cues about the gender of those referents. Evaluating GNT automatically is pa...

Happy to announce that our paper 'An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation' was accepted at @gitt-workshop.bsky.social ! 🙌

Check it out: arxiv.org/abs/2504.11934 🔥

Co-authors (🫶🏻): @bsavoldi.bsky.social, @matteo-negri.bsky.social, @luisabentivogli.bsky.social

17.04.2025 16:29 — 👍 11 🔁 3 💬 0 📌 0

Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation As automatic metrics become increasingly stronger and widely adopted, the risk of unintentionally "gaming the metric" during model development rises. This issue is caused by metric interference (Mint)...

Brilliant and necessary work by Pombal et al. about metric interference in MT system development and evaluation: arxiv.org/abs/2503.08327

Are we developing better systems or are we just gaming the metrics? And how do we address this?
Super (m)interesting! 👀

19.03.2025 15:25 — 👍 10 🔁 1 💬 0 📌 1

While we look forward to a sunny Geneva, why wait to join the conversation?

We’ve created a starter pack for our #GITT2025 friends!
🕵️ Follow researchers working on gender bias in MT
💬 Stay up to date and dive into the discussion!

All info at sites.google.com/tilburgunive...

28.02.2025 09:22 — 👍 21 🔁 16 💬 1 📌 1

BREAKING NEWS: CDC orders mass retraction and revision of submitted research across all science and medicine journals. Banned terms must be scrubbed. Any unpublished manuscript mentioning certain topics, including gender and "LGBT," must be pulled or revised.

BREAKING NEWS: CDC orders mass retraction and revision of submitted research across all science and medicine journals. Banned terms must be scrubbed.

Goes beyond MMWR +other CDC pubs. Applies to research already submitted to top medical journals.

Take a look.
open.substack.com/pub/insideme...

01.02.2025 21:20 — 👍 7677 🔁 4305 💬 573 📌 1631

🙌 All members of our group are now on Bluesky! 🙌

You can find all of us in this starter pack 👇

16.01.2025 09:51 — 👍 6 🔁 5 💬 0 📌 0

Looking ahead to 2025, my goal is to keep the momentum and build on this year’s lessons: being more intentional about time management, becoming a better collaborator, and and carving out time for deep, focused work.

27.12.2024 15:35 — 👍 1 🔁 0 💬 0 📌 0

The rest of the year was spent on testing new things, new collaborations, reading and reviewing papers, and traveling around for conferences. No doubt this has been the year where I learned the most so far, and 99% of the learning happened because I had access to some amazing (and patient) people.

27.12.2024 15:35 — 👍 1 🔁 0 💬 1 📌 0

I also developed a demo showcasing gender-neutral translation with LLMs, which I had the chance to present at FBK’s Digital Industry Center Demo Day. Unfortunately the demo is not open to the public for now, but here is a photo of @bsavoldi.bsky.social and me presenting it ✌️

27.12.2024 15:35 — 👍 1 🔁 0 💬 1 📌 0

Two key resources enabled the research progress we made this year: GeNTE (2023) and Neo-GATE (2024). They are benchmarks for the conservative and the innovative approach respectively and are both freely available on Hugging Face:

huggingface.co/datasets/FBK...
huggingface.co/datasets/FBK...

27.12.2024 15:35 — 👍 3 🔁 0 💬 1 📌 0

My research topic is gender-inclusive MT, and this year we explored two directions: the "conservative" one with gender-neutral translation and the "innovative" one, using neomorphemes (like ə and *, in Italian). I worked on papers published at venues ranging from top conferences to local workshops.

27.12.2024 15:35 — 👍 1 🔁 0 💬 1 📌 0

but look i made you some is written in white on a black background ALT: but look i made you some is written in white on a black background

With 2024 wrapping up, and given how little I’ve posted here (or anywhere, really), I thought I’d share a quick recap of my year and finally make some ✨content✨

27.12.2024 15:35 — 👍 3 🔁 0 💬 1 📌 0

Our @apierg.bsky.social presenting our #calamita challenges at #CLiCit2024: machine translation and gender-fair generation.

Poster session upcoming, see you there!

For more details:
👉 MagneT: clic2024.ilc.cnr.it/wp-content/u...
👉 GFG: clic2024.ilc.cnr.it/wp-content/u...

06.12.2024 16:22 — 👍 9 🔁 2 💬 0 📌 0

Our very own @dennisfucci.bsky.social presenting the challenges of Explainability for Speech Models at #CLiCit2024. If you’re interested, check out the paper 👉 clic2024.ilc.cnr.it/wp-content/u...
#NLProc

05.12.2024 16:04 — 👍 9 🔁 1 💬 0 📌 0

Andrea Piergentili

Latest posts by apierg.bsky.social on Bluesky

@apierg is following 20 prominent accounts