BUT Speech's Avatar

BUT Speech

@butspeech.bsky.social

We do impactful research and raise new leading scientific personalities in the field of speech processing.

12 Followers  |  1 Following  |  31 Posts  |  Joined: 18.01.2025  |  2.13

Latest posts by butspeech.bsky.social on Bluesky

Preview
Czech AI โ€” CNAIP Czech AI. One Map, the Complete Czech AI Ecosystem. Explore an overview of all the key players in the domestic AI market

The Czech National AI Platform is a joint initiative of the public, private, and non-profit sectors.
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai

20.10.2025 15:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Thu July 24 11:00, we will have the last plenary talk of #JSALT2025 - Jordan Boyd-Graber Ying [University of Maryland] will present "Helpful AI Models: You can't always get what you want, but you might get what you needโ€" You can also watch it on YT: youtube.com/playlist?lis...

24.07.2025 08:38 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Tue July 22 11:00, we will have another plenary talk of #JSALT2025 - Xavier Serra [UPF Barcelona] will speak about Methodologies for Music Understanding and Generation in the Context of Trustworthy AI. You can also watch it on YT: youtube.com/playlist?lis...

jsalt2025.fit.vut.cz/plenary-lect...

22.07.2025 09:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Today, July 18, at 11:00, Herve Bredin [pyannoteAI, France] will give the 5th Plenary talk at the JSALT workshop "Speaker diarization, a love loss story", see jsalt2025.fit.vut.cz/plenary-lect... for details.

18.07.2025 09:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐Ÿ“ข Barbara Schuppler (TU Graz) gives the 2nd #JSALT plenary tomorrow, Tue July 1, 11:00 in Room E112: "Cross-layer models for conversational speech recognition in low-resourced scenarios". Join in person or on YouTube: www.youtube.com/playlist?lis... ๐ŸŽค๐Ÿ“บ
jsalt2025.fit.vut.cz/plenary-lect...

01.07.2025 08:33 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Honza is giving a lecture at Charles University in Prague (Faculty of Mathematics and Physics, MFF) today. If you want to attend, note that it takes place in the new buildings of MFF in Troja, not the historical one in Mala Strana.
www.mff.cuni.cz/en/research-...

28.05.2025 15:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

We have great pleasure to invite you to a talk of an excellent Czech scientist, Professor at #EPFL, Lenka Zdeborovรก. We have never seen a talk treating machine learning as a problem of statistical physics! Tuesday, May 20, 2025 at 13:00 in lecture room E112 and online www.youtube.com/live/FCvPhHm...

19.05.2025 16:11 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Several speech students participated at the FIT Conference of innovations, technology and science - Excel@FIT. Congratulations to Sathvik, Dominik, and Ondrej for winning Excel prizes!
excel.fit.vutbr.cz/vysledky/

09.05.2025 08:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Congratulations to Lin Zhang for her new post-doc position at CLSP at Johns Hopkins University! She is working on anti-spoofing and anonymization with Nicholas Andrews and Matthew Wiesner. She also collaborates closely with Sanjeev Khudanpur, Leibny Paola Garcรญa-Perera, and Kevin Duh.

09.05.2025 05:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Over the weekend we to plan for the upcoming #JSALT25 workshop for the topic "Advancing Expert-Level Reasoning and Understanding in Large Audio Language Models". Two days of intense brain storming ๐Ÿง  and planning powered by extra portions of coffee โ˜•๏ธ.
jsalt2025.fit.vut.cz/summer-works...

09.05.2025 05:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Glad to announce another married man in the group - Pradyoth's wedding with Sameeksha took place in Mangalore on Thu 3rd April (before ICASSP) in presence of 3.5k guests! All the best!

25.04.2025 07:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image Post image

And after ICASSP, Johan. Lukas, Alex, Martas and Santosh even made it to the local newspapers after their visit to the Ramappa Temple UNESCO heritage site!

25.04.2025 07:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The networking activities around ICASSP continued after Meeami: on Monday 7 April, Honza took part in an official visit to IIIT Hyderabad, met its director and spent nice time with Anil Kumar Vuppala and his colleagues and students in Language Technologies Research Center (LTRC).

25.04.2025 07:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

The Speech@FIT research group continues its industry collaboration at global scale, with Santosh and Alex recently visiting Meeami Technologies in Hyderabad.

07.04.2025 06:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin

02.04.2025 13:20 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Our papers to be presented at ICASSP in Hyderabad!

Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok

02.04.2025 13:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Yesterday, we hosted folks from @butspeech.bsky.social, Phonexia, Phrase, and MAMA AI at the first meeting of the Linguistics, AI, Speech, and Language Technologies project, which is funded by @msmtcr and the EU's Programme Johannes Amos Comenius.

26.02.2025 16:11 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ”— Competition details: www.nexdata.ai/competition/...
This work builds on DiCoW, our diarization-conditioned ASR modelโ€”learn more in our paper:
๐Ÿ”— arxiv.org/abs/2501.00114
๐Ÿ–ฅ๏ธ Codebase available on GitHub:
๐Ÿ”— github.com/BUTSpeechFIT...
[4/4]

24.03.2025 20:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ” Why should you try it?
โœ… Strong starting point for multilingual conversational ASR research
โœ… Open for experimentation, adaptation, and fine-tuning
โœ… Join us in pushing the boundaries of robust, multilingual speech recognition
๐Ÿš€ Test and improve multilingual conversational ASR
[3/4]

24.03.2025 20:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ“Š Baseline WER (No Domain Adaptation Yet, Oracle diarization):
๐Ÿ‡บ๐Ÿ‡ธ English (American): 9.4%
๐Ÿ‡ฎ๐Ÿ‡ณ English (Indian): 15.1%
๐Ÿ‡ต๐Ÿ‡ญ English (Filipino): 11.3%
๐Ÿ‡ฉ๐Ÿ‡ช German: 19.7%
๐Ÿ†• Now supports transcription of multiple speakers speaking different languages! ๐ŸŒ๐Ÿ—ฃ๏ธ
[2/4]

24.03.2025 20:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ—ฃ๏ธ Are you participating in the Interspeech 2025 Workshop on Multilingual Conversational Speech Language Models organised by Nexdataใ€ๆ—งDatatangๆ ชๅผไผš็คพๅ…ฌๅผใ€‘?

Weโ€™ve released our baseline model for the communityโ€”ready for you to explore and build upon!
๐Ÿ”— Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]

24.03.2025 20:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ“Š Baseline WER (No Domain Adaptation Yet, Oracle diarization):
๐Ÿ‡บ๐Ÿ‡ธ English (American): 9.4%
๐Ÿ‡ฎ๐Ÿ‡ณ English (Indian): 15.1%
๐Ÿ‡ต๐Ÿ‡ญ English (Filipino): 11.3%
๐Ÿ‡ฉ๐Ÿ‡ช German: 19.7%
๐Ÿ†• Now supports transcription of multiple speakers speaking different languages! ๐ŸŒ๐Ÿ—ฃ๏ธ
[2/4]

24.03.2025 19:57 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Committee

Committee

Beer bill

Beer bill

Congratulations to Dr. Karel ! The defense as well as the "one" in the evening were serious and successful.

14.03.2025 13:39 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Speech group members, past members and affiliates gathered for some skiing (probably the last this season) in Stuhleck. We were more but it was impossible to get everyone in one photo - some skiers were simply too fast!

14.03.2025 13:33 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Exciting news! ๐ŸŽ‰Karel Benesโ€™s PhD defense on "Language models supporting imperfect handwriting and speech recognition systems" is next Monday, March 10, 2025, at 10:00 in room G108 at FIT. Come or connect to support Karel! ๐Ÿ™Œ
teams.microsoft.com/l/meetup-joi...

10.03.2025 07:21 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿš€ From 2019โ€“2023, we worked on NEUREM3, exploring neural representations in multi-modal & multilingual modeling. ๐ŸŽ™๏ธProud of our achievements in speaker recognition, diarization & target speaker extraction at BUT! Our final report is now public. ๐Ÿ“–
www.fit.vut.cz/research/gro...

15.02.2025 15:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Excited to join AirbusDefence and 7 EU partners in the EDF project ARCHER! ARCHER will organize several rounds of human language technologies evaluations for defence domain scenarios. Our team will focus on building strong ASR and OCR baselines ๐Ÿ’ช!
defence-industry-space.ec.europa.eu/document/dow...

08.02.2025 19:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@butspeech is following 1 prominent accounts