And that's a wrap on a very hot π₯΅ yet very rewarding #GITT2025
π We need more languages, better quality estimation, and intersectionality
π Thanks to everyone who was there, presented, contributed to the discussion, and helped with practicalities.
π Enjoy the rest of @mtsummit2025.bsky.social!
24.06.2025 05:51 β π 12 π 4 π¬ 0 π 1
Results show GPT4-o and Qwen 72B outperforming the baseline classifier and improved accuracy for intermediate reasoning steps. More details in the paper - have a read! π #GITT2025
23.06.2025 14:22 β π 2 π 0 π¬ 0 π 0
Experiments using mGeNTE as reference arxiv.org/abs/2501.09409
23.06.2025 14:17 β π 2 π 0 π¬ 0 π 0
RQs: can LLMs identify neutral versus gendered translation? How do we improve the accuracy? Tests on 3 language pairs (English into Italian, Spanish, and German) on sentence and phrase level in monolingual and crosslingual scenarios
23.06.2025 14:15 β π 2 π 0 π¬ 0 π 0
Work on LLM-as-judge suggests this could be useful for GNT evaluation as well
23.06.2025 14:11 β π 2 π 0 π¬ 0 π 0
As already raised by our keynote, quality estimation is a great industry need as there is often no time for human evaluation in production, this is particularly tricky for GNT
23.06.2025 14:10 β π 2 π 0 π¬ 0 π 0
Not one single solution for gender neutral translation (GNT) (tradeoff adequacy/fluency?) making GNT a complex evaluation task
23.06.2025 14:09 β π 2 π 0 π¬ 0 π 0
Last but definitely not least: @bsavoldi.bsky.social presenting joint work with @apierg.bsky.social @matteo-negri.bsky.social @luisabentivogli.bsky.social on scalable gender neutral translation evaluation using LLM-as-a-judge at #GITT2025
23.06.2025 14:07 β π 10 π 3 π¬ 6 π 1
23.06.2025 12:13 β π 1 π 0 π¬ 0 π 0
Summary for inclusive AI: representation, transparency, community input, iteration, respect #GITT2025
23.06.2025 12:12 β π 1 π 0 π¬ 0 π 0
What should the industry do to be more inclusive? Rely on NB and inclusive language experts, connect with the community, stay updated, train the people (this can be you!), test and iterate
23.06.2025 12:11 β π 0 π 0 π¬ 0 π 0
Gender tags can be used in training! Postprocessing can help in time sensitive situations, but can be dangerous too, not always useful
23.06.2025 12:09 β π 0 π 0 π¬ 0 π 0
What about prompting? Industry requires everything to work within the pipeline, within the system. You can't have prompts leading to different outputs, it cannot be wrong, it cannot contain hallucinations
23.06.2025 12:05 β π 0 π 0 π¬ 0 π 0
Microsoft custom translator was finetuned with +/- 5k-10k segments. Because of the specificity of the problem, limited data is enough for improvements. "overfitting in my favor"
23.06.2025 12:02 β π 0 π 0 π¬ 0 π 0
Perfect data doesn't exist, but data was created with support of experts and generation. Opposite strategies from what you'd want for non-inclusive datasets! Multiple solutions suddenly useful and even necessary
23.06.2025 12:00 β π 1 π 0 π¬ 0 π 0
Great crowd participation! ππ₯ Training, teaching, finetuning, data, prompting and more as potential solutions
23.06.2025 11:58 β π 1 π 0 π¬ 0 π 0
Share your thoughts! #GITT2025 how do we shape AI for inclusive language?
23.06.2025 11:55 β π 0 π 0 π¬ 0 π 0
Issues for MT: lack of training data (historical data was not inclusive, no representation), association bias, post-editing always necessary, publishing MT output as is can be problematic, tricky for gendered languages, evaluation metrics used in the industry, but might not work for inclusivity
23.06.2025 11:55 β π 0 π 0 π¬ 0 π 0
Important to consult expert matters and the community. Some specific examples: having the inclusive schwa character on the keyboard for Italian, explicit representation of nonbinary characters, e.g. Dragon Age the Veilguard
23.06.2025 11:51 β π 0 π 0 π¬ 0 π 0
In big corporations, teams had to be hired specifically for inclusivity, output had to be tested, guidelines needed to be written and constantly updated with rapidly changing language and character representation evolved (games = community, people need to find themselves)
23.06.2025 11:49 β π 0 π 0 π¬ 0 π 0
Inclusivity has become increasingly important in the industry, especially for younger generations. The path towards inclusive language has a few different steps and inclusivity goes beyond just gender > echoing the idea of intersectionality from the opening notes!
23.06.2025 11:42 β π 1 π 0 π¬ 0 π 0
Good keynotes use inspirations quotes from others. Great keynotes use inspirational quotes from themselves π
23.06.2025 11:39 β π 1 π 0 π¬ 0 π 0
Already working on the industry when SMT was the default, not the easiest to use for localisation back then, all the way to NMT and LLM today. "video games today are like a work of art" > not necessarily the easiest match for MT #GITT2025
23.06.2025 11:39 β π 1 π 0 π¬ 0 π 0
ππ΅ Our keynote speaker Cristina Anselmi sharing localisation industry perspectives on optimising AI for inclusivity π€©
23.06.2025 11:36 β π 5 π 0 π¬ 16 π 0
Evaluation clearly still shows issues and how challenging this is in real world scenarios. Hillary and colleagues are open to collaborate with anyone willing to help tackle this issue in future work π€
23.06.2025 10:08 β π 1 π 0 π¬ 0 π 0
Some response to actively ambiguous sentences, but no consistency across languages and systems.
23.06.2025 10:05 β π 1 π 0 π¬ 0 π 0
Baseline: neutral translation when gender is known shows variance within languages across systems. Ambiguous scenarios show strong default masculine translation, no increase in neutral translations
23.06.2025 10:03 β π 1 π 0 π¬ 0 π 0
Subset of this test suite specifically selected for gender neutral / ambiguous scenarios, 3 languages (Spanish, Czech, Icelandic) with different strategies
23.06.2025 09:59 β π 1 π 0 π¬ 0 π 0
Janice M. Jenkins Collegiate Professor of Computer Science at U. Michigan, Director Michigan AI Lab, Former ACL President, AAAI Fellow, ACM Fellow. Researcher #NLProc #AI
π https://web.eecs.umich.edu/~mihalcea/
Technical lead at Hub of Computing and Data Science, University of Hamburg
CS PhD candidate @UCBerkeley. Interested in multilingual and low-resourced language NLP + HCI. @SIGHPC CDS Fellow. Interned @MBZUAI. Current intern at DAIR
Website: https://hhnigatu.github.io
Postdoc at MBZUAI | Ph.D from IPN| Research Scientist at Lelapa AI| Working on Low-resource NLP
Responsible, and Inclusive #NLProc. Senior RS @ Google Research. she/her
tpu go brr @deep-mind.bsky.social @uwcse.bsky.social | varying proportions of AI and mediocre jokes (not mutually exclusive) | she/her/hers
Assistant Professor at Bar-Ilan University
https://yanaiela.github.io/
USC Graduate Student | USC ISI NLP Researcher | 3x Apple Intern | Self-proclaimed Michelin 3-star Foodie | she/her
Doktorsnemi à mÑltækni / Computer science PhD student with a focus on natural language processing. Heyrðu à mér ef þig vantar prófarkarlesara ||
HΓΊn/she
Sociolinguistics & language ideologies & trans stuff & cats. Assistant prof at Elon University in North Carolina
they/them
Senior Research Scientist @fbk-mt.bsky.social - he/him
Researcher @ St. PΓΆlten UAS
Cofounder AKMatriX.org
mathematician β’ computer scientist β’ linguist β’ progressive
https://l17r.eu
queer crip in tech research, hearing sign language enthusiast
Human-centered AI #HCAI, NLP & ML. Director TRAILS (Trustworthy AI in Law & Society) and AIM (AI Interdisciplinary Institute at Maryland). Formerly Microsoft Research NYC. Fun: π§π§βπ³π§β·οΈποΈ. he/him.
Associate Professor in Computer Science at the University of Maryland. Human-Centered Natural Language Processing & Machine Translation
PhD Student, Ex- U.S. Federal Govβt Data Scientist
NLP, Linguistics, Cognitive Science, AI, ML, etc.
Job currently: Research Scientist (NYC)
Job formerly: NYU Linguistics, MSU Linguistics