Tanel Alumäe's Avatar

Tanel Alumäe

@tanelalumae.bsky.social

Associate Professor of Speech Processing Tallinn University of Technology, Estonia

197 Followers  |  115 Following  |  6 Posts  |  Joined: 09.11.2023  |  1.7867

Latest posts by tanelalumae.bsky.social on Bluesky

Preview
TalTech Systems for the PROCESS Signal Processing Grand Challenge The PROCESS Challenge aims to detect cognitive decline, including early stages like mild cognitive impairment, through spontaneous speech. This paper describes TalTech’s systems prepared for the chall...

Last year, our lab ventured into a new domain: detecting cognitive decline from speech recordings. We competed in the
ICASSP PROCESS Challenge and secured 2nd place in the MMSE prediction task (out of ~30 teams)! 🏆 Our paper is now published: ieeexplore.ieee.org/abstract/doc...

12.03.2025 09:24 — 👍 0    🔁 0    💬 0    📌 0
Coat of arms of Ukraine.

Coat of arms of Ukraine.

I just donated to u24.gov.ua

01.03.2025 13:01 — 👍 25    🔁 8    💬 0    📌 2
Preview
University lecturer in Finno-Ugric linguistics University lecturer in Finno-Ugric linguistics

The University of Helsinki is looking for a lecturer in Finno-Ugric linguistics: jobs.helsinki.fi/job/Helsinki...

29.01.2025 13:20 — 👍 11    🔁 7    💬 0    📌 0

"And I'll see the day that anyone gives us #1 without being forced to do so ..."

There are many LLM projects that are open about training and evaluation data, such as AllenAI OLMo, several EU projects (EuroGPT, HPLT), and several Huggingface projects. I don't think anybody forced them to do so.

30.01.2025 11:19 — 👍 2    🔁 0    💬 0    📌 0
Preview
Joint speech and text machine translation for up to 100 languages - Nature SEAMLESSM4T is a single machine translation tool that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation and automatic speech recog...

@nature.com asked me to write a short comment piece about the SeamlessM4T paper from @metaai.bsky.social
(nature.com/articles/s41...), here it is: nature.com/articles/d41.... I think SeamlessM4T is still the best publicly available multilingual ASR/speech-translation model.

16.01.2025 14:04 — 👍 1    🔁 0    💬 0    📌 0

Great challenge but very little time...

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

04.12.2024 22:15 — 👍 0    🔁 0    💬 0    📌 0

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

01.12.2024 10:52 — 👍 0    🔁 0    💬 0    📌 0

Very interesting challenge! Unfortunately there is very little time, considering that participants would have to prepare some kind of container that decodes the test data on the Dynabench server.
Some questions follow...

01.12.2024 10:45 — 👍 1    🔁 0    💬 1    📌 0

@tanelalumae is following 20 prominent accounts