Multilingual Representation Workshop @ EMNLP 2025 @mrl-workshop

Announcing the two Best Paper Awards for this year’s workshop. Congratulations to all the authors for your great work!

09.11.2025 08:26 — 👍 1 🔁 0 💬 0 📌 0

Finally, we recognize the Hawaiian submission to the shared task. Thank you for your contributions!

09.11.2025 08:14 — 👍 1 🔁 0 💬 0 📌 0

The second is 7PERFECTION, a dataset for seven Nigerian languages!

09.11.2025 08:13 — 👍 1 🔁 0 💬 1 📌 0

The first of our best contribution awards goes to AraPIQA!

09.11.2025 08:12 — 👍 1 🔁 0 💬 1 📌 0

We would like to recognize three honorable mention submissions. Great work!

09.11.2025 08:11 — 👍 1 🔁 0 💬 1 📌 0

This afternoon, @catherinearnett.bsky.social presented the result of the shared task and presented the best contribution awards. Congratulations to all contributors!

09.11.2025 08:08 — 👍 3 🔁 0 💬 1 📌 0

@aliceatkaist.bsky.social joins us to give the final keynote of the day about code switching in multilingual language models!

09.11.2025 06:03 — 👍 1 🔁 0 💬 0 📌 0

Join us in hall C3 at posters 137-168 for our in-person poster session. Join us online on Zoom via Underline for our virtual poster session!

09.11.2025 03:00 — 👍 1 🔁 1 💬 1 📌 0

Now Pontus Stenetorp shares an oral history of UK-LLM!

09.11.2025 01:54 — 👍 0 🔁 0 💬 1 📌 0

Research poster of the paper "Sub-1B Language Models for Low-resource Languages: Training Strategies and Insights for Basque."

Tomorrow, I'll be presenting (virtually) our research from @orainlp.bsky.social on pre-training SLMs for low-resource languages as a poster during the @mrl-workshop.bsky.social.

Come check it out!

📝 aclanthology.org/2025.mrl-mai...

08.11.2025 11:20 — 👍 9 🔁 3 💬 1 📌 0

@kellymarchisio.bsky.social from Cohere presents “Building Multilingual LLMs in Industry”, sharing insights on training multilinguality at scale!

09.11.2025 01:25 — 👍 1 🔁 1 💬 1 📌 0

We have kicked off proceedings with some brief opening remarks from @catherinearnett.bsky.social

09.11.2025 01:25 — 👍 3 🔁 1 💬 1 📌 0

We are kicking off this year’s workshop in Suzhou at #EMNLP2025! Come join us in room A106-107 or online!

09.11.2025 01:24 — 👍 1 🔁 0 💬 0 📌 0

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures To date, there exist almost no culturally-specific evaluation benchmarks for large language models (LLMs) that cover a large number of languages and cultures. In this paper, we present Global PIQA, a ...

Preprint: arxiv.org/abs/2510.24081
Dataset: huggingface.co/datasets/mrl...

29.10.2025 15:50 — 👍 2 🔁 0 💬 0 📌 1

Global PIQA Contributor Interest Form Thanks for your interest in contributing to Global PIQA! Please fill out the form and we will contact you with details about how to get involved!

It’s not too late to get involved! Until early 2026, we will be accepting submissions for languages not already represented in Global PIQA. If you’re interested, please fill out this form and we will contact you with details!
docs.google.com/forms/d/e/1F...

29.10.2025 15:50 — 👍 3 🔁 0 💬 1 📌 1

There are seven languages where even the best proprietary LLM scores less than 80% (chance: 50%). Sub-Saharan African languages lag behind Western European languages by ~15%. Thus Global PIQA highlights languages which are very poorly served by large, proprietary models.

29.10.2025 15:50 — 👍 2 🔁 0 💬 1 📌 0

The top proprietary models achieve ~90% accuracy, which falls short of human accuracy (~95%). The best open models perform significantly worse, with the best open model performance from Gemma 3 (27B) at 82.4%.

29.10.2025 15:50 — 👍 2 🔁 0 💬 1 📌 0

This dataset is created and owned by the contributors, all of whom were offered authorship. We believe this is more fair to annotators and is likely to result in a higher-quality dataset, as it is constructed by the NLP researchers who will use it.

29.10.2025 15:50 — 👍 3 🔁 0 💬 1 📌 0

Global PIQA includes subsets for 116 unique language varieties. These cover five continents, 14 language families, and 23 writing systems. Over 50% of examples reference local foods, customs, traditions, or other culturally-specific elements.

29.10.2025 15:50 — 👍 3 🔁 0 💬 1 📌 0

Introducing Global PIQA, a new multilingual benchmark for 100+ languages. This benchmark is the outcome of this year’s MRL shared task, in collaboration with 300+ researchers from 65 countries. This dataset evaluates physical commonsense reasoning in culturally relevant contexts.

29.10.2025 15:50 — 👍 22 🔁 10 💬 1 📌 5

We are in need of some emergency reviewers for MRL. If you are available, please fill out this form!

12.09.2025 18:31 — 👍 0 🔁 1 💬 0 📌 0

5TH MULTILINGUAL REPRESENTATION LEARNING (MRL) WORKSHOP @EMNLP 2025 SIGTYP

sigtyp.github.io/ws2025-mrl.h...

07.09.2025 14:53 — 👍 1 🔁 0 💬 0 📌 0

This year MRL is also accepting papers that have been submitted to ARR and have received reviews and a metareview! Submit your papers by September 23rd! See the workshop website for details on how to submit ⬇️

07.09.2025 14:53 — 👍 1 🔁 0 💬 1 📌 0

Correct, you can submit ARR papers that already have their reviews by Sep 23. More instructions soon!

TBD on whether the workshop will be hybrid.

25.08.2025 15:56 — 👍 1 🔁 0 💬 0 📌 0

We extended the deadline by one day, so you have until the end of today (Aug 24) AoE to submit! Good luck!

24.08.2025 22:08 — 👍 0 🔁 1 💬 0 📌 0

MRL 2025 Shared Task Info Meeting MRL 2025 Shared Task at EMNLP: Multilingual Physical Commonsense Reasoning Datasets Info meeting, 2025/08/14 Contact: mrl2025-workshop@googlegroups.com Last updated: 2025/08/14

Check out more information, including answers to FAQs: docs.google.com/presentation...

18.08.2025 15:52 — 👍 0 🔁 0 💬 0 📌 0

MRL Shared Task volunteers Call for participation to create physical reasoning datasets for various languages! While there has been much progress in developing benchmarks for diverse languages, we still have very few multiling...

If you plan to participate, fill in this google form so we can better plan the shared task: forms.gle/zxhpCfL6wvBz...

18.08.2025 15:52 — 👍 0 🔁 0 💬 1 📌 0

We have over 200 volunteers now for 90+ languages! We are hoping to expand the diversity of our language coverage and are still looking for participants who speak these languages. Check out how to get involved below, and please help us spread the word!

18.08.2025 15:52 — 👍 3 🔁 3 💬 1 📌 0

The deadline for MRL at #EMNLP2025 is next week!

⏰ Submission Deadline: August 23rd (AoE)

🔗 CfP: sigtyp.github.io/ws2025-mrl.h...

12.08.2025 17:00 — 👍 2 🔁 1 💬 0 📌 1

MRL 2025 Shared Task on Multilingual Physical Reasoning Datasets

See the shared task page for more information: sigtyp.github.io/st2025-mrl.h...

05.08.2025 15:17 — 👍 1 🔁 0 💬 0 📌 0

Multilingual Representation Workshop @ EMNLP 2025

Latest posts by mrl-workshop.bsky.social on Bluesky

@mrl-workshop is following 20 prominent accounts