Alex Polok has started an internship at CMUโs
WAV Lab @wavlab.bsky.social , continuing DiCoW research on diarization-conditioned target-speaker ASR with Whisper (JSALT 2025).
Next: extending to SpeechLLMs (DiXtral) and joint diarization + TS-ASR toward end-to-end speaker-aware models.
16.01.2026 22:24 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Ladislav Moลกner will defend his PhD dissertation โFar-Field Speaker Verification Incorporating Multichannel Processingโ on Wed 14 Jan at 09:00 CET.
๐ FIT BUT, G108 or ๐ป via MS Teams.
Official announcement & abstract: www.fit.vut.cz/fit/info/dd/...
MS-Teams: teams.microsoft.com/l/meetup-joi...
13.01.2026 09:33 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Exam period is in full swing at our Faculty of IT ๐ On Fri, Jan 9, over 500 2nd-year bachelor students took the Signals & Systems (ISS) exam. Honza promised reference fixesโฆ but seems he delegated them (again!) to his cat Kaja ๐ผ
12.01.2026 13:00 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
On Jan 13, Prachi Singh will give an online talk on Speaker Diarization as part of a Faculty Development Programme by UPES, Dehradun ๐ฎ๐ณ. The session covers current trends with hands-on insights. See the LinkedIN page for details:
www.linkedin.com/feed/update/...
12.01.2026 12:59 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Homepage
Where AIโs brightest minds connect to exchange ideas, challenge perspectives, and shape the future of artificial intelligence.
Honza will be speaking at the 2nd MBZUAI Speech & NLP Symposium in Abu Dhabi, organized by Hanan Aldarmaki, Bashar Alhafni, Preslav Nakov and Thamar Solorio from the Mohamed bin Zayed University of Artificial Intelligence.
05.01.2026 21:38 โ
๐ 1
๐ 0
๐ฌ 0
๐ 0
All the best for 2026 from the Brno crew!
05.01.2026 21:37 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
After presenting at USC (thanks Shri Narayanan), Alex Polok & Sathvik Udupa are sharing their ASRU posters in Honolulu on Dec 8. 10:30-12:00 and 14:00-15:30 respectively. Donโt miss themโand Sara Barahona presenting our VoxCeleb work too!
08.12.2025 19:49 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Hynek Heลmanskรฝ received the 2025 Czech AI Award at Vzlet, Prague! Born in Novรฉ Mฤsto na Moravฤ, with a career spanning OGI, JHU and Google, heโs known for RASTA/PLP and early NN advocacy. Congrats to Hynek and thanks to Czech AI Platform & JIC!
07.12.2025 14:28 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Last week we welcomed Prof. Jan G. ล vec for a great talk on how the human voice worksโfrom source-filter basics to advanced biomechanics. Big thanks to FIT BUTโs Studentsโ Union for the venue and A/V support!
vgs-it.fit.vutbr.cz/2025/11/04/j...
04.12.2025 14:04 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
6 Phonexia founders + attorney (left)
Great news from Phonexia: new growth chapter with Crescendo Equity Partners as 100% owner! Founded in 2006 by 6 Speech@FIT members, Phonexia will see accelerated expansion and R&D investment in speech tech for security/defense and business sectors.
03.12.2025 15:42 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Lin Zhang, longtime Brno collaborator now at JHU, will present the IEEE SPS talk on partially fake speech on Nov 20, 2025, 9:30 AM ET. Also see her Interspeech paper โPartialEdit.โ
landing.signalprocessingsociety.org/ieee-sps-web...
www.isca-archive.org/interspeech_...
20.11.2025 10:24 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Czech AI โ CNAIP
Czech AI. One Map, the Complete Czech AI Ecosystem. Explore an overview of all the key players in the domestic AI market
The Czech National AI Platform is a joint initiative of the public, private, and non-profit sectors.
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai
20.10.2025 15:43 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Thu July 24 11:00, we will have the last plenary talk of #JSALT2025 - Jordan Boyd-Graber Ying [University of Maryland] will present "Helpful AI Models: You can't always get what you want, but you might get what you needโ" You can also watch it on YT: youtube.com/playlist?lis...
24.07.2025 08:38 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Tue July 22 11:00, we will have another plenary talk of #JSALT2025 - Xavier Serra [UPF Barcelona] will speak about Methodologies for Music Understanding and Generation in the Context of Trustworthy AI. You can also watch it on YT: youtube.com/playlist?lis...
jsalt2025.fit.vut.cz/plenary-lect...
22.07.2025 09:04 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Today, July 18, at 11:00, Herve Bredin [pyannoteAI, France] will give the 5th Plenary talk at the JSALT workshop "Speaker diarization, a love loss story", see jsalt2025.fit.vut.cz/plenary-lect... for details.
18.07.2025 09:02 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
๐ข Barbara Schuppler (TU Graz) gives the 2nd #JSALT plenary tomorrow, Tue July 1, 11:00 in Room E112: "Cross-layer models for conversational speech recognition in low-resourced scenarios". Join in person or on YouTube: www.youtube.com/playlist?lis... ๐ค๐บ
jsalt2025.fit.vut.cz/plenary-lect...
01.07.2025 08:33 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Honza is giving a lecture at Charles University in Prague (Faculty of Mathematics and Physics, MFF) today. If you want to attend, note that it takes place in the new buildings of MFF in Troja, not the historical one in Mala Strana.
www.mff.cuni.cz/en/research-...
28.05.2025 15:43 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
We have great pleasure to invite you to a talk of an excellent Czech scientist, Professor at #EPFL, Lenka Zdeborovรก. We have never seen a talk treating machine learning as a problem of statistical physics! Tuesday, May 20, 2025 at 13:00 in lecture room E112 and online www.youtube.com/live/FCvPhHm...
19.05.2025 16:11 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Several speech students participated at the FIT Conference of innovations, technology and science - Excel@FIT. Congratulations to Sathvik, Dominik, and Ondrej for winning Excel prizes!
excel.fit.vutbr.cz/vysledky/
09.05.2025 08:05 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Congratulations to Lin Zhang for her new post-doc position at CLSP at Johns Hopkins University! She is working on anti-spoofing and anonymization with Nicholas Andrews and Matthew Wiesner. She also collaborates closely with Sanjeev Khudanpur, Leibny Paola Garcรญa-Perera, and Kevin Duh.
09.05.2025 05:26 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Over the weekend we to plan for the upcoming #JSALT25 workshop for the topic "Advancing Expert-Level Reasoning and Understanding in Large Audio Language Models". Two days of intense brain storming ๐ง and planning powered by extra portions of coffee โ๏ธ.
jsalt2025.fit.vut.cz/summer-works...
09.05.2025 05:25 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Glad to announce another married man in the group - Pradyoth's wedding with Sameeksha took place in Mangalore on Thu 3rd April (before ICASSP) in presence of 3.5k guests! All the best!
25.04.2025 07:44 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
The networking activities around ICASSP continued after Meeami: on Monday 7 April, Honza took part in an official visit to IIIT Hyderabad, met its director and spent nice time with Anil Kumar Vuppala and his colleagues and students in Language Technologies Research Center (LTRC).
25.04.2025 07:43 โ
๐ 0
๐ 0
๐ฌ 1
๐ 0
A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.
07.04.2025 06:55 โ
๐ 0
๐ 0
๐ฌ 0
๐ 0
Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.
07.04.2025 06:55 โ
๐ 0
๐ 0
๐ฌ 1
๐ 0
Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.
07.04.2025 06:55 โ
๐ 0
๐ 0
๐ฌ 1
๐ 0
The Speech@FIT research group continues its industry collaboration at global scale, with Santosh and Alex recently visiting Meeami Technologies in Hyderabad.
07.04.2025 06:55 โ
๐ 1
๐ 0
๐ฌ 1
๐ 0
Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin
02.04.2025 13:20 โ
๐ 1
๐ 0
๐ฌ 0
๐ 0
Our papers to be presented at ICASSP in Hyderabad!
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
02.04.2025 13:20 โ
๐ 0
๐ 1
๐ฌ 1
๐ 0