BUT Speech's Avatar

BUT Speech

@butspeech.bsky.social

We do impactful research and raise new leading scientific personalities in the field of speech processing.

11 Followers  |  1 Following  |  42 Posts  |  Joined: 18.01.2025
Posts Following

Posts by BUT Speech (@butspeech.bsky.social)

Post image

Alex Polok has started an internship at CMUโ€™s
WAV Lab @wavlab.bsky.social , continuing DiCoW research on diarization-conditioned target-speaker ASR with Whisper (JSALT 2025).
Next: extending to SpeechLLMs (DiXtral) and joint diarization + TS-ASR toward end-to-end speaker-aware models.

16.01.2026 22:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Ladislav Moลกner will defend his PhD dissertation โ€œFar-Field Speaker Verification Incorporating Multichannel Processingโ€ on Wed 14 Jan at 09:00 CET.
๐Ÿ“ FIT BUT, G108 or ๐Ÿ’ป via MS Teams.

Official announcement & abstract: www.fit.vut.cz/fit/info/dd/...

MS-Teams: teams.microsoft.com/l/meetup-joi...

13.01.2026 09:33 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Exam period is in full swing at our Faculty of IT ๐Ÿ“š On Fri, Jan 9, over 500 2nd-year bachelor students took the Signals & Systems (ISS) exam. Honza promised reference fixesโ€ฆ but seems he delegated them (again!) to his cat Kaja ๐Ÿ˜ผ

12.01.2026 13:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

On Jan 13, Prachi Singh will give an online talk on Speaker Diarization as part of a Faculty Development Programme by UPES, Dehradun ๐Ÿ‡ฎ๐Ÿ‡ณ. The session covers current trends with hands-on insights. See the LinkedIN page for details:
www.linkedin.com/feed/update/...

12.01.2026 12:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Homepage Where AIโ€™s brightest minds connect to exchange ideas, challenge perspectives, and shape the future of artificial intelligence.

Honza will be speaking at the 2nd MBZUAI Speech & NLP Symposium in Abu Dhabi, organized by Hanan Aldarmaki, Bashar Alhafni, Preslav Nakov and Thamar Solorio from the Mohamed bin Zayed University of Artificial Intelligence.

05.01.2026 21:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

All the best for 2026 from the Brno crew!

05.01.2026 21:37 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

After presenting at USC (thanks Shri Narayanan), Alex Polok & Sathvik Udupa are sharing their ASRU posters in Honolulu on Dec 8. 10:30-12:00 and 14:00-15:30 respectively. Donโ€™t miss themโ€”and Sara Barahona presenting our VoxCeleb work too!

08.12.2025 19:49 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Hynek Heล™manskรฝ received the 2025 Czech AI Award at Vzlet, Prague! Born in Novรฉ Mฤ›sto na Moravฤ›, with a career spanning OGI, JHU and Google, heโ€™s known for RASTA/PLP and early NN advocacy. Congrats to Hynek and thanks to Czech AI Platform & JIC!

07.12.2025 14:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Last week we welcomed Prof. Jan G. ล vec for a great talk on how the human voice worksโ€”from source-filter basics to advanced biomechanics. Big thanks to FIT BUTโ€™s Studentsโ€™ Union for the venue and A/V support!
vgs-it.fit.vutbr.cz/2025/11/04/j...

04.12.2025 14:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
6 Phonexia founders + attorney (left)

6 Phonexia founders + attorney (left)

Great news from Phonexia: new growth chapter with Crescendo Equity Partners as 100% owner! Founded in 2006 by 6 Speech@FIT members, Phonexia will see accelerated expansion and R&D investment in speech tech for security/defense and business sectors.

03.12.2025 15:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Lin Zhang, longtime Brno collaborator now at JHU, will present the IEEE SPS talk on partially fake speech on Nov 20, 2025, 9:30 AM ET. Also see her Interspeech paper โ€œPartialEdit.โ€
landing.signalprocessingsociety.org/ieee-sps-web...
www.isca-archive.org/interspeech_...

20.11.2025 10:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Czech AI โ€” CNAIP Czech AI. One Map, the Complete Czech AI Ecosystem. Explore an overview of all the key players in the domestic AI market

The Czech National AI Platform is a joint initiative of the public, private, and non-profit sectors.
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai

20.10.2025 15:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Thu July 24 11:00, we will have the last plenary talk of #JSALT2025 - Jordan Boyd-Graber Ying [University of Maryland] will present "Helpful AI Models: You can't always get what you want, but you might get what you needโ€" You can also watch it on YT: youtube.com/playlist?lis...

24.07.2025 08:38 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Tue July 22 11:00, we will have another plenary talk of #JSALT2025 - Xavier Serra [UPF Barcelona] will speak about Methodologies for Music Understanding and Generation in the Context of Trustworthy AI. You can also watch it on YT: youtube.com/playlist?lis...

jsalt2025.fit.vut.cz/plenary-lect...

22.07.2025 09:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Today, July 18, at 11:00, Herve Bredin [pyannoteAI, France] will give the 5th Plenary talk at the JSALT workshop "Speaker diarization, a love loss story", see jsalt2025.fit.vut.cz/plenary-lect... for details.

18.07.2025 09:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐Ÿ“ข Barbara Schuppler (TU Graz) gives the 2nd #JSALT plenary tomorrow, Tue July 1, 11:00 in Room E112: "Cross-layer models for conversational speech recognition in low-resourced scenarios". Join in person or on YouTube: www.youtube.com/playlist?lis... ๐ŸŽค๐Ÿ“บ
jsalt2025.fit.vut.cz/plenary-lect...

01.07.2025 08:33 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Honza is giving a lecture at Charles University in Prague (Faculty of Mathematics and Physics, MFF) today. If you want to attend, note that it takes place in the new buildings of MFF in Troja, not the historical one in Mala Strana.
www.mff.cuni.cz/en/research-...

28.05.2025 15:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

We have great pleasure to invite you to a talk of an excellent Czech scientist, Professor at #EPFL, Lenka Zdeborovรก. We have never seen a talk treating machine learning as a problem of statistical physics! Tuesday, May 20, 2025 at 13:00 in lecture room E112 and online www.youtube.com/live/FCvPhHm...

19.05.2025 16:11 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Several speech students participated at the FIT Conference of innovations, technology and science - Excel@FIT. Congratulations to Sathvik, Dominik, and Ondrej for winning Excel prizes!
excel.fit.vutbr.cz/vysledky/

09.05.2025 08:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Congratulations to Lin Zhang for her new post-doc position at CLSP at Johns Hopkins University! She is working on anti-spoofing and anonymization with Nicholas Andrews and Matthew Wiesner. She also collaborates closely with Sanjeev Khudanpur, Leibny Paola Garcรญa-Perera, and Kevin Duh.

09.05.2025 05:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Over the weekend we to plan for the upcoming #JSALT25 workshop for the topic "Advancing Expert-Level Reasoning and Understanding in Large Audio Language Models". Two days of intense brain storming ๐Ÿง  and planning powered by extra portions of coffee โ˜•๏ธ.
jsalt2025.fit.vut.cz/summer-works...

09.05.2025 05:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Glad to announce another married man in the group - Pradyoth's wedding with Sameeksha took place in Mangalore on Thu 3rd April (before ICASSP) in presence of 3.5k guests! All the best!

25.04.2025 07:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image Post image

And after ICASSP, Johan. Lukas, Alex, Martas and Santosh even made it to the local newspapers after their visit to the Ramappa Temple UNESCO heritage site!

25.04.2025 07:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The networking activities around ICASSP continued after Meeami: on Monday 7 April, Honza took part in an official visit to IIIT Hyderabad, met its director and spent nice time with Anil Kumar Vuppala and his colleagues and students in Language Technologies Research Center (LTRC).

25.04.2025 07:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.

07.04.2025 06:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

The Speech@FIT research group continues its industry collaboration at global scale, with Santosh and Alex recently visiting Meeami Technologies in Hyderabad.

07.04.2025 06:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin

02.04.2025 13:20 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Our papers to be presented at ICASSP in Hyderabad!

Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok

02.04.2025 13:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0