The CfP for the SRW at ACL 2026 is out!
09.01.2026 07:58 β π 3 π 0 π¬ 0 π 0
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
We investigate the potential of LLM-generated synthetic data for improving low-resource Machine Translation (MT). Focusing on seven diverse target languages, we construct a document-level synthetic co...
See you next week at EMNLP!
We will be presenting our work: Scaling Low-Resource MT via Synthetic Data Generation with LLMs
π Poster Session 13
π
Fri, Nov 7, 10:30-12:00 - Hall C
π Check it out! arxiv.org/abs/2505.14423
@helsinki-nlp.bsky.social @cambridgenlp.bsky.social @emnlpmeeting.bsky.social
28.10.2025 08:16 β π 8 π 2 π¬ 0 π 0
Last week I was at @aclmeeting.bsky.social ! Lots of friendly faces, great work and amazing art β¨οΈ We presented HPLT v2 datasets together with @very-laurie.bsky.social π Read our paper here: aclanthology.org/2025.acl-lon...
03.08.2025 09:06 β π 5 π 0 π¬ 0 π 0
I'm part of this! There's also a paper: arxiv.org/abs/2503.10267
17.03.2025 13:27 β π 6 π 3 π¬ 0 π 0
Come to Helsinki for the 18th MT Marathon! Sponsored by EAMT @ufal-cuni.bsky.social
18.03.2025 13:10 β π 8 π 5 π¬ 0 π 1
That's a wrap for @nodalida.bsky.social ! Short, nice and intense. I presented our work on efficient MT @helsinki-nlp.bsky.social within the #HPLT projectβ‘οΈ
05.03.2025 09:59 β π 6 π 0 π¬ 0 π 0
HPLT - High Performance Language Technologies
A space that combines petabytes of natural language data with large-scale model training
** New parallel data set ** . We've just released HPLT v2.0, a parallel data set of 50 languages paired with English, 380M sentence pairs in total. Extracted from the Internet Archive and Common Crawl hplt-project.org/datasets/v2.0
28.02.2025 13:34 β π 4 π 3 π¬ 1 π 1
π Just added our HPLT fast translation models to a new TranslateLocally repository! Translate on your own machineβfast, private, and easy. shorturl.at/R2vzw
20+ models for diverse languagesβlearn more about them next week at @nodalida.bsky.social!
24.02.2025 09:04 β π 5 π 0 π¬ 0 π 0
I am so excited to share with you all the 2025 edition of our #AmericasNLP workshop!
Do not hesitate in submitting your amazing research paper on indigenous and low resource languages.
Submission deadline: March 7, 2025
turing.iimas.unam.mx/americasnlp/...
30.01.2025 15:17 β π 29 π 15 π¬ 0 π 0
There is an open position for a postdoc in our lab in this call. The focus will be on scalable modular NLP with funding from the ERC PoC project MARMoT
07.02.2025 14:12 β π 1 π 3 π¬ 0 π 0
NLP PhD candidate @ University of Edinburgh
Computational Linguistics | Typology | Morphology | Multimodal NLP | Cognitive Science
(Interpretability + Neurosymbolic models sometimes)
PhD supervised by Tim RocktΓ€schel and Ed Grefenstette, part time at Cohere. Language and LLMs. Spent time at FAIR, Google, and NYU (with Brenden Lake). She/her.
Postdoc at βͺMila & McGill University π¨π¦ with a PhD in NLP from the University of Edinburgh π΄σ §σ ’σ ³σ £σ ΄σ Ώ memorization vs generalization x (non-)compositionality. she/her π©βπ» π³π±
a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor.
see more at https://kyunghyuncho.me/
ALMAnaCH, the Inria Paris NLP research team.
seeks to understand language.
Head of Cohere Labs
@Cohere_Labs @Cohere
PhD from @UvA_Amsterdam
https://marziehf.github.io/
Postdoctoral researcher at the Institute for Logic, Language and Computation at the University of Amsterdam.
Previously PhD Student at NLPNorth at the IT University of Copenhagen, with internships at AWS, Parameter Lab, Pacmed.
dennisulmer.eu
Teaching and writing media studies at CU Boulder. Helping to build a cooperative fediverse with Social.coop. Fan of democratic experiences and divine mysteries. Co-leading metagov.org, start.coop, wagingnonviolence.org.
NLP assistant prof at KU Leuven, PI @lagom-nlp.bsky.social. I like syntax more than most people. Also multilingual NLP, interpretability, mountains and beer. (She/her)
PhD student at Aalborg University, mostly working on NLP and linguistic typology
#NLProc research group @itu.dk (Copenhagen, Denmark)
π nlpnorth.github.io
@Cohere.com's non-profit research lab and open science initiative that seeks to solve complex machine learning problems. Join us in exploring the unknown, together. https://cohere.com/research
PhD student at Aalborg University π©π° doing multilingual NLP, computational linguistics, language variation, and interpretability research π
Loves reading, bouldering π§, hiking, and playing music
Full Professor at Aalborg University, Copenhagen π©π°
Postdoc @rug.nl with Arianna Bisazza.
Interested in NLP, interpretability, syntax, language acquisition and typology.
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
Associate professor at @liu.se πΈπͺ, site development lead for @aclanthology.org, editor-in-chief at @nejlt.bsky.social. Mildly obscure #NLP researcher.
I like coffee and board games.
π https://marcel.bollmann.me/
Influencing the world since 1583. Follow our other social channels: https://edin.ac/3CJvzdv
π₯ LLMs together (co-created model merging, BabyLM, textArena.ai)
π₯ Spreading science over hype in #ML & #NLP
Proud shareLM㪠Donor
@IBMResearch & @MIT_CSAIL
ACL Rolling Review (https://aclrollingreview.org)
Tweets by the ARR Communications / Support Team