Martijn Bartelds's Avatar

Martijn Bartelds

@mbartelds.bsky.social

Postdoctoral Scholar Stanford NLP

410 Followers  |  124 Following  |  14 Posts  |  Joined: 16.11.2024  |  1.8486

Latest posts by mbartelds.bsky.social on Bluesky

✨Meet OLMoASR✨ By pairing our curated 1M-hour dataset with a powerful architecture, we've built open ASR models that achieve competitive performance with models like Whisper. We're open-sourcing data, code and models to help the community build more robust and transparent ASR.

29.08.2025 16:21 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Speech and Language Processing Speech and Language Processing

Now that school is starting for lots of folks, it's time for a new release of Speech and Language Processing! Jim and I added all sorts of material for the August 2025 release! With slides to match! Check it out here: web.stanford.edu/~jurafsky/sl...

24.08.2025 19:28 β€” πŸ‘ 151    πŸ” 59    πŸ’¬ 2    πŸ“Œ 5

Big THANK YOU to the amazing #Interspeech2025 Organizing Committee! πŸ’™

🎀 Odette Scharenborg, Catharine Oertel, Khiet Truong
πŸ’° Martijn Bartelds
🌐 DragoΘ™ BΔƒlan
πŸ—‚οΈ Saskia Peters
🀝 Ginny Ruiter, Marie Louise Verhagen, Natascha Voskuijl

14.07.2025 14:26 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Congratulations!! That’s wonderful!! πŸŽ‰πŸΎ

02.07.2025 17:18 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congrats!!! πŸŽ‰

29.04.2025 22:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

CTC-DRO can be applied to ASR with minimal computational costs, and offers the potential for reducing group disparities in other domains with similar challenges.

πŸ“„ Read our paper: arxiv.org/pdf/2502.017...
πŸ’» Get the code: github.com/Bartelds/ctc...

12.03.2025 15:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The result:
πŸ“Š Worst-language error ↓ up to 47.1%
πŸ“Š Average error ↓ up to 32.9%

CTC-DRO works seamlessly with existing self-supervised speech models through ESPnet πŸš€

12.03.2025 15:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We present CTC-DRO, which addresses the shortcomings of the group DRO objective by:
βœ… Input length-matched batching to mitigate CTC’s scaling issues
βœ… Smoothing the group weight update to prevent overemphasis on consistently high-loss groups

12.03.2025 15:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Why? Group DRO needs comparable training losses between languages. But in ASR, CTC-based losses vary due to differences in speech length, speakers, and acoustics. This creates spurious differences across language groups.

Result? Worse performance.

We need a new approach πŸš€

12.03.2025 15:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

CTC-based fine-tuning has been successful in multilingual ASR benchmarks but it doesn't fix language performance gaps. Group DRO could help by focusing on worst-performing languages, but it does not work ❌

12.03.2025 15:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

πŸŽ™οΈ Speech recognition is great - if you speak the right language.

Our new @stanfordnlp.bsky.social paper introduces CTC-DRO, a training method that reduces worst-language errors by up to 47.1%.

Work w/ Ananjan, Moussa, @jurafsky.bsky.social, Tatsu Hashimoto and Karen Livescu.

Here’s how it works 🧡

12.03.2025 15:29 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

I am excited to announce that I will join the University of Zurich as an assistant professor in August this year! I am looking for PhD students and postdocs starting from the fall.

My research interests include optimization, federated learning, machine learning, privacy, and unlearning.

06.03.2025 02:17 β€” πŸ‘ 28    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1

πŸ“’ Join us for the Conversational AI Reading Group meeting on Thursday, January 16th, 11 AM-12 PM EST.
Martijn Bartelds will present "Improving Universal Access to Modern Speech Technology".
Details here: poonehmousavi.github.io/rg

13.01.2025 16:19 β€” πŸ‘ 2    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Speech and Language Processing Speech and Language Processing

Happy New Year everyone! Jim and I just put up our January 2025 release of Speech and Language Processing! Check it out here: web.stanford.edu/~jurafsky/sl...

12.01.2025 20:44 β€” πŸ‘ 152    πŸ” 50    πŸ’¬ 1    πŸ“Œ 1
Group picture of people in the Stanford NLP Group gathered in front of the shores of Lake Tahoe.

Group picture of people in the Stanford NLP Group gathered in front of the shores of Lake Tahoe.

Natural Language Processingβ€”artificial intelligence that uses human languageβ€”has been on a roll lately. You’ve probably noticed! So the Stanford NLP Group has been growing, and diversifying into lots of new topics, including agents, language model programs, and socially aware #NLP.

nlp.stanford.edu

04.12.2024 17:14 β€” πŸ‘ 53    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

Excited to announce the launch of our ML-SUPERB 2.0 challenge @interspeech.bsky.social 2025! Join us in pushing the boundaries of multilingual ASR and LID! πŸš€

πŸ’» multilingual.superbbenchmark.org

04.12.2024 18:09 β€” πŸ‘ 8    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Multimodal Information Based Speech Processing (MISP) 2025 Challenge

Hi speech people, super exciting news here!

We are running another "Multimodal information based speech (MISP)" Challenge at @interspeech.bsky.social

Participate!
Spread the word!

More info πŸ‘‡
mispchallenge.github.io/mispchalleng...

25.11.2024 11:25 β€” πŸ‘ 15    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

made this thing, reply to be added
go.bsky.app/AKGJ82V

22.11.2024 00:26 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 6    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

22.11.2024 00:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Mentioning this post from @cjziems.bsky.social, listing some starter packs: bsky.app/profile/cjzi...

20.11.2024 19:02 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I've started putting together a starter pack with people working on Speech Technology and Speech Science: go.bsky.app/BQ7mbkA

(Self-)nominations welcome!

19.11.2024 11:13 β€” πŸ‘ 82    πŸ” 34    πŸ’¬ 44    πŸ“Œ 3

πŸ™‹β€β™‚οΈ

20.11.2024 15:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I wanted to contribute to "Starter Pack Season" with one for Stanford NLP+HCI: go.bsky.app/VZBhuJ5

Here are some other great starter packs:

- CSS: go.bsky.app/GoEyD7d + go.bsky.app/CYmRvcK
- NLP: go.bsky.app/SngwGeS + go.bsky.app/JgneRQk
- HCI: go.bsky.app/p3TLwt
- Women in AI: go.bsky.app/LaGDpqg

15.11.2024 19:20 β€” πŸ‘ 25    πŸ” 10    πŸ’¬ 2    πŸ“Œ 2

πŸ‘‹

17.11.2024 18:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@mbartelds is following 20 prominent accounts