Tanel Alumäe @tanelalumae - Bluesky Profile

Latest posts by tanelalumae.bsky.social on Bluesky

TalTech Systems for the PROCESS Signal Processing Grand Challenge The PROCESS Challenge aims to detect cognitive decline, including early stages like mild cognitive impairment, through spontaneous speech. This paper describes TalTech’s systems prepared for the chall...

Last year, our lab ventured into a new domain: detecting cognitive decline from speech recordings. We competed in the
ICASSP PROCESS Challenge and secured 2nd place in the MMSE prediction task (out of ~30 teams)! 🏆 Our paper is now published: ieeexplore.ieee.org/abstract/doc...

12.03.2025 09:24 — 👍 0 🔁 0 💬 0 📌 0

Coat of arms of Ukraine.

I just donated to u24.gov.ua

01.03.2025 13:01 — 👍 25 🔁 8 💬 0 📌 2

University lecturer in Finno-Ugric linguistics University lecturer in Finno-Ugric linguistics

The University of Helsinki is looking for a lecturer in Finno-Ugric linguistics: jobs.helsinki.fi/job/Helsinki...

29.01.2025 13:20 — 👍 11 🔁 7 💬 0 📌 0

"And I'll see the day that anyone gives us #1 without being forced to do so ..."

There are many LLM projects that are open about training and evaluation data, such as AllenAI OLMo, several EU projects (EuroGPT, HPLT), and several Huggingface projects. I don't think anybody forced them to do so.

30.01.2025 11:19 — 👍 2 🔁 0 💬 0 📌 0

Joint speech and text machine translation for up to 100 languages - Nature SEAMLESSM4T is a single machine translation tool that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation and automatic speech recog...

@nature.com asked me to write a short comment piece about the SeamlessM4T paper from @metaai.bsky.social
(nature.com/articles/s41...), here it is: nature.com/articles/d41.... I think SeamlessM4T is still the best publicly available multilingual ASR/speech-translation model.

16.01.2025 14:04 — 👍 1 🔁 0 💬 0 📌 0

Great challenge but very little time...

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

04.12.2024 22:15 — 👍 0 🔁 0 💬 0 📌 0

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

01.12.2024 10:52 — 👍 0 🔁 0 💬 0 📌 0

Very interesting challenge! Unfortunately there is very little time, considering that participants would have to prepare some kind of container that decodes the test data on the Dynabench server.
Some questions follow...

01.12.2024 10:45 — 👍 1 🔁 0 💬 1 📌 0

@tanelalumae is following 20 prominent accounts

BUT Speech
@butspeech

We do impactful research and raise new leading scientific personalities in the field of speech processing.

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Marianne de Heer Kloots
@mdhk.net

Linguist in AI & CogSci 🧠👩‍💻🤖 PhD student @ ILLC, University of Amsterdam 🌐 https://mdhk.net/ 🐘 https://scholar.social/@mdhk 🐦 https://twitter.com/mariannedhk

Aaricia
@aariciah

Linguist. Inclusive speech tech. Cats.

Romain Serizel
@rserizel

Associate professor at Université de Lorraine. Doing research is speech and audio processing.

Nori Jacoby
@norijacoby

Assistant professor at Cornell Psychology Department. CoCoCo Lab (Cornell Computational Cognition Lab) @co3lab.bsky.social. I am recruiting!

Daan van Esch
@daanvanesch.nl

I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.

Jonathan Le Roux
@jonathanleroux

Speech and audio research scientist @MERL. saneworkshop.org co-founder. IguanaTex developer. 🌐 jonathanleroux.org 🐙 github.com/Jonathan-LeRoux/ 🎓 scholar.google.com/citations?user=aUpxty8AAAAJ&hl=en

Catherine Lai
@catlai

Lecturer in speech and language technology, CSTR, University of Edinburgh. https://homepages.inf.ed.ac.uk/clai/

Michaela Watkins
@michaelaw

PhD candidate in linguistics (phonetics / phonology) at the University of Amsterdam Looking at Seoul Korean, voice quality analysis, symbolic neural networks 🇳🇱 🇬🇧

Noel Nguyen
@noelnguyen

Speech/cognitive scientist, Professor at Aix-Marseille University, with an interest for speech perception in social interactions, week-end road cyclist

Gasper Begus
@begus

Assoc. Professor at UC Berkeley Artificial and biological intelligence and language Linguistics Lead at Project CETI 🐳 PI Berkeley Biological and Artificial Language Lab 🗣️ College Principal of Bowles Hall 🏰 https://www.gasperbegus.com

Laura Gwilliams
@lauragwilliams

Language processing - Neuroscience - Machine Learning - Assistant Professor at Stanford University - She/Her - 🏳️‍🌈

Maureen de Seyssel
@maureendeseyssel

machine learning researcher @Apple | PhD from @CoML_ENS | speech, ml and cognition.

Eleanor Chodroff
@echodroff

Cognitive scientist, linguist, phonetician at the University of Zurich Dept. of Computational Linguistics

Auditory-Visual Speech Association (AVISA)
@avsp

The official(ish) account of the Auditory-VIsual Speech Association (AVISA) AV 👄 👓 speech references, but mostly what interests me avisa.loria.fr

Simon King
@simonking

So we’re all on this now?!

Dan Mirman
@danmirman

Professor of Brain and Language and Director of Postgraduate Programmes, Department of Psychology, University of Edinburgh. EiC Psychonomic Bulletin & Review. I study language processing and disorders.

William N. Havard
@williamnhavard

🇬🇧🇫🇷🏴󠁧󠁢󠁷󠁬󠁳󠁿 Computational Linguist Language learner : ייִדיש ,العربية, Norsk, Esperanto — Photography

Francisco Teixeira
@fsteixeira