Scaling Low-Resource MT via Synthetic Data Generation with LLMs
We investigate the potential of LLM-generated synthetic data for improving low-resource Machine Translation (MT). Focusing on seven diverse target languages, we construct a document-level synthetic co...
See you next week at EMNLP!
We will be presenting our work: Scaling Low-Resource MT via Synthetic Data Generation with LLMs
π Poster Session 13
π
Fri, Nov 7, 10:30-12:00 - Hall C
π Check it out! arxiv.org/abs/2505.14423
@helsinki-nlp.bsky.social @cambridgenlp.bsky.social @emnlpmeeting.bsky.social
28.10.2025 08:16 β π 8 π 2 π¬ 0 π 0
Last week I was at @aclmeeting.bsky.social ! Lots of friendly faces, great work and amazing art β¨οΈ We presented HPLT v2 datasets together with @very-laurie.bsky.social π Read our paper here: aclanthology.org/2025.acl-lon...
03.08.2025 09:06 β π 5 π 0 π¬ 0 π 0
I'm part of this! There's also a paper: arxiv.org/abs/2503.10267
17.03.2025 13:27 β π 6 π 3 π¬ 0 π 0
Come to Helsinki for the 18th MT Marathon! Sponsored by EAMT @ufal-cuni.bsky.social
18.03.2025 13:10 β π 8 π 5 π¬ 0 π 1
That's a wrap for @nodalida.bsky.social ! Short, nice and intense. I presented our work on efficient MT @helsinki-nlp.bsky.social within the #HPLT projectβ‘οΈ
05.03.2025 09:59 β π 6 π 0 π¬ 0 π 0
HPLT - High Performance Language Technologies
A space that combines petabytes of natural language data with large-scale model training
** New parallel data set ** . We've just released HPLT v2.0, a parallel data set of 50 languages paired with English, 380M sentence pairs in total. Extracted from the Internet Archive and Common Crawl hplt-project.org/datasets/v2.0
28.02.2025 13:34 β π 4 π 3 π¬ 1 π 1
π Just added our HPLT fast translation models to a new TranslateLocally repository! Translate on your own machineβfast, private, and easy. shorturl.at/R2vzw
20+ models for diverse languagesβlearn more about them next week at @nodalida.bsky.social!
24.02.2025 09:04 β π 5 π 0 π¬ 0 π 0
I am so excited to share with you all the 2025 edition of our #AmericasNLP workshop!
Do not hesitate in submitting your amazing research paper on indigenous and low resource languages.
Submission deadline: March 7, 2025
turing.iimas.unam.mx/americasnlp/...
30.01.2025 15:17 β π 29 π 15 π¬ 0 π 0
There is an open position for a postdoc in our lab in this call. The focus will be on scalable modular NLP with funding from the ERC PoC project MARMoT
07.02.2025 14:12 β π 1 π 3 π¬ 0 π 0
PhD student @CambridgeLTL; Previously @DLAB @EPFL; Interested in NLP and CSS. Apple Scholar, Gates Scholar.
AmericasNLP 2025 will be co-located with NAACL in Albuquerque, USA. Weβre looking forward to seeing you all there! π
turing.iimas.unam.mx/americasnlp/
A world-class research hub in AI and machine learning, in partnership with universities, RDI organizations and businesses in Finland. We are the 2nd institute in the @ellis.eu network.
π ellisinstitute.fi
Natural Language Processing @ University of Zurich
PhD student in NLP at GMU w/ Antonios Anastasopoulos. Focus: L2 acquisition, low-resource NLP, psycholinguistics. Passionate about empowering heritage speakers. Berkeley '19
The Language Technology Group (LTG) at the University of Oslo, Norway do research on a range of topics in Natural Language Processing (NLP), including language modeling for Norwegian and other languages.
Data Science research group on AI ethics and multicultural NLP, led by @a-lauscher.bsky.social.
https://mcgill-nlp.github.io/people/
MaiNLP research lab at CIS, LMU Munich directed by Barbara Plank @barbaraplank.bsky.social
Natural Language Processing | Artificial Intelligence | Computational Linguistics | Human-centric NLP
We are the #nlproc group at Uppsala University! We focus our research on computational modeling of natural language and practical applications involving natural language processing (NLP).
https://www.uu.se/en/department/linguistics-and-p
Assistant Professor at SCU: #NLP #AI. PhD and Postdoc in CS at Umich. Intern at Meta and Amazon. Co-host at ACL Mentorship.
Researcher and Research projects leader in the area of Computational Linguistics, NLP and AI.
This is the account for the NLP community at Imperial College London! Looking forward to sharing our NLP research with you π
Assistant Professor at Universidade da CoruΓ±a. Structured Prediction. Low-resource Languages. Computational Social science.
NLP/Computational Linguistics PhD @lecslab.bsky.social and @bouldernlp.bsky.social
typologically robust multilingual NLP
technology for language documentation
https://covetedfish.github.io/
Center for Language and Speech Processing at Johns Hopkins University
#NLProc #MachineLearning #AI http://tinyurl.com/clspy2ube
The AI community building the future!
Training materials for setting up and using a research infrastructure based on Jupyter notebooks: https://cusy.io/en/seminars