GITT 2025's Avatar

GITT 2025

@gitt-workshop.bsky.social

Workshop on Gender-Inclusive Translation Technologies. 3rd edition happening at MT Summit 2025! Website: https://sites.google.com/tilburguniversity.edu/gitt2025

63 Followers  |  75 Following  |  82 Posts  |  Joined: 18.12.2024  |  2.349

Latest posts by gitt-workshop.bsky.social on Bluesky

Post image

And that's a wrap on a very hot πŸ₯΅ yet very rewarding #GITT2025
πŸ’­ We need more languages, better quality estimation, and intersectionality
😍 Thanks to everyone who was there, presented, contributed to the discussion, and helped with practicalities.
🌞 Enjoy the rest of @mtsummit2025.bsky.social!

24.06.2025 05:51 β€” πŸ‘ 12    πŸ” 4    πŸ’¬ 0    πŸ“Œ 1
Post image Post image

Results show GPT4-o and Qwen 72B outperforming the baseline classifier and improved accuracy for intermediate reasoning steps. More details in the paper - have a read! πŸ‘€ #GITT2025

23.06.2025 14:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Experiments using mGeNTE as reference arxiv.org/abs/2501.09409

23.06.2025 14:17 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

RQs: can LLMs identify neutral versus gendered translation? How do we improve the accuracy? Tests on 3 language pairs (English into Italian, Spanish, and German) on sentence and phrase level in monolingual and crosslingual scenarios

23.06.2025 14:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Work on LLM-as-judge suggests this could be useful for GNT evaluation as well

23.06.2025 14:11 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

As already raised by our keynote, quality estimation is a great industry need as there is often no time for human evaluation in production, this is particularly tricky for GNT

23.06.2025 14:10 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Not one single solution for gender neutral translation (GNT) (tradeoff adequacy/fluency?) making GNT a complex evaluation task

23.06.2025 14:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Last but definitely not least: @bsavoldi.bsky.social presenting joint work with @apierg.bsky.social @matteo-negri.bsky.social @luisabentivogli.bsky.social on scalable gender neutral translation evaluation using LLM-as-a-judge at #GITT2025

23.06.2025 14:07 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 6    πŸ“Œ 1
Post image Post image Post image Post image

Poster session now happening at #GITT2025 Some really exciting research and potluck discussions happening! πŸ”₯

23.06.2025 12:53 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image 23.06.2025 12:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Summary for inclusive AI: representation, transparency, community input, iteration, respect #GITT2025

23.06.2025 12:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What should the industry do to be more inclusive? Rely on NB and inclusive language experts, connect with the community, stay updated, train the people (this can be you!), test and iterate

23.06.2025 12:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Gender tags can be used in training! Postprocessing can help in time sensitive situations, but can be dangerous too, not always useful

23.06.2025 12:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

Some attempted strategies: style indication alone not enough, few shot and style guide additions helped

23.06.2025 12:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What about prompting? Industry requires everything to work within the pipeline, within the system. You can't have prompts leading to different outputs, it cannot be wrong, it cannot contain hallucinations

23.06.2025 12:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Microsoft custom translator was finetuned with +/- 5k-10k segments. Because of the specificity of the problem, limited data is enough for improvements. "overfitting in my favor"

23.06.2025 12:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Perfect data doesn't exist, but data was created with support of experts and generation. Opposite strategies from what you'd want for non-inclusive datasets! Multiple solutions suddenly useful and even necessary

23.06.2025 12:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Great crowd participation! πŸ‘ŒπŸ”₯ Training, teaching, finetuning, data, prompting and more as potential solutions

23.06.2025 11:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Share your thoughts! #GITT2025 how do we shape AI for inclusive language?

23.06.2025 11:55 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Issues for MT: lack of training data (historical data was not inclusive, no representation), association bias, post-editing always necessary, publishing MT output as is can be problematic, tricky for gendered languages, evaluation metrics used in the industry, but might not work for inclusivity

23.06.2025 11:55 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Important to consult expert matters and the community. Some specific examples: having the inclusive schwa character on the keyboard for Italian, explicit representation of nonbinary characters, e.g. Dragon Age the Veilguard

23.06.2025 11:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In big corporations, teams had to be hired specifically for inclusivity, output had to be tested, guidelines needed to be written and constantly updated with rapidly changing language and character representation evolved (games = community, people need to find themselves)

23.06.2025 11:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Inclusivity has become increasingly important in the industry, especially for younger generations. The path towards inclusive language has a few different steps and inclusivity goes beyond just gender > echoing the idea of intersectionality from the opening notes!

23.06.2025 11:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Good keynotes use inspirations quotes from others. Great keynotes use inspirational quotes from themselves πŸ‘Œ

23.06.2025 11:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Already working on the industry when SMT was the default, not the easiest to use for localisation back then, all the way to NMT and LLM today. "video games today are like a work of art" > not necessarily the easiest match for MT #GITT2025

23.06.2025 11:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ”‘πŸŽ΅ Our keynote speaker Cristina Anselmi sharing localisation industry perspectives on optimising AI for inclusivity 🀩

23.06.2025 11:36 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 16    πŸ“Œ 0
Post image Post image

Evaluation clearly still shows issues and how challenging this is in real world scenarios. Hillary and colleagues are open to collaborate with anyone willing to help tackle this issue in future work 🀝

23.06.2025 10:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Some response to actively ambiguous sentences, but no consistency across languages and systems.

23.06.2025 10:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Baseline: neutral translation when gender is known shows variance within languages across systems. Ambiguous scenarios show strong default masculine translation, no increase in neutral translations

23.06.2025 10:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Subset of this test suite specifically selected for gender neutral / ambiguous scenarios, 3 languages (Spanish, Czech, Icelandic) with different strategies

23.06.2025 09:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@gitt-workshop is following 20 prominent accounts