Ona de Gibert's Avatar

Ona de Gibert

@onadegibert.bsky.social

PhD Student @HelsinkiNLP / Low-resource, Machine Translation, Knowledge Distillation, Multilinguality

138 Followers  |  240 Following  |  6 Posts  |  Joined: 27.11.2024  |  1.6341

Latest posts by onadegibert.bsky.social on Bluesky

Preview
Scaling Low-Resource MT via Synthetic Data Generation with LLMs We investigate the potential of LLM-generated synthetic data for improving low-resource Machine Translation (MT). Focusing on seven diverse target languages, we construct a document-level synthetic co...

See you next week at EMNLP!
We will be presenting our work: Scaling Low-Resource MT via Synthetic Data Generation with LLMs

πŸ“ Poster Session 13
πŸ“… Fri, Nov 7, 10:30-12:00 - Hall C
πŸ“– Check it out! arxiv.org/abs/2505.14423

@helsinki-nlp.bsky.social @cambridgenlp.bsky.social @emnlpmeeting.bsky.social

28.10.2025 08:16 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Last week I was at @aclmeeting.bsky.social ! Lots of friendly faces, great work and amazing art ✨️ We presented HPLT v2 datasets together with @very-laurie.bsky.social πŸŽ‰ Read our paper here: aclanthology.org/2025.acl-lon...

03.08.2025 09:06 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

NAACL was a blast πŸ’₯ Presented the findings of our Shared Tasks at @americasnlp.bsky.social, had a chance to reconnect with old friends, make new ones, and get excited about research I'm passionate about. #NAACL25

05.05.2025 15:08 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'm part of this! There's also a paper: arxiv.org/abs/2503.10267

17.03.2025 13:27 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

Come to Helsinki for the 18th MT Marathon! Sponsored by EAMT @ufal-cuni.bsky.social

18.03.2025 13:10 β€” πŸ‘ 8    πŸ” 5    πŸ’¬ 0    πŸ“Œ 1
Post image Post image

That's a wrap for @nodalida.bsky.social ! Short, nice and intense. I presented our work on efficient MT @helsinki-nlp.bsky.social within the #HPLT project⚑️

05.03.2025 09:59 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
HPLT - High Performance Language Technologies A space that combines petabytes of natural language data with large-scale model training

** New parallel data set ** . We've just released HPLT v2.0, a parallel data set of 50 languages paired with English, 380M sentence pairs in total. Extracted from the Internet Archive and Common Crawl hplt-project.org/datasets/v2.0

28.02.2025 13:34 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Post image

πŸš€ Just added our HPLT fast translation models to a new TranslateLocally repository! Translate on your own machineβ€”fast, private, and easy. shorturl.at/R2vzw
20+ models for diverse languagesβ€”learn more about them next week at @nodalida.bsky.social!

24.02.2025 09:04 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I am so excited to share with you all the 2025 edition of our #AmericasNLP workshop!
Do not hesitate in submitting your amazing research paper on indigenous and low resource languages.
Submission deadline: March 7, 2025
turing.iimas.unam.mx/americasnlp/...

30.01.2025 15:17 β€” πŸ‘ 29    πŸ” 15    πŸ’¬ 0    πŸ“Œ 0

There is an open position for a postdoc in our lab in this call. The focus will be on scalable modular NLP with funding from the ERC PoC project MARMoT

07.02.2025 14:12 β€” πŸ‘ 1    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

@onadegibert is following 20 prominent accounts