 
                        
                Czech AI โ CNAIP
                Czech AI. One Map, the Complete Czech AI Ecosystem. Explore an overview of all the key players in the domestic AI market
            
        
    
    
            The Czech National AI Platform is a joint initiative of the public, private, and non-profit sectors. 
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai
               
            
            
                20.10.2025 15:43 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Thu July 24 11:00, we will have the last plenary talk of #JSALT2025 - Jordan Boyd-Graber Ying [University of Maryland] will present "Helpful AI Models: You can't always get what you want, but you might get what you needโ" You can also watch it on YT: youtube.com/playlist?lis...
               
            
            
                24.07.2025 08:38 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Tue July 22 11:00, we will have another plenary talk of #JSALT2025 - Xavier Serra [UPF Barcelona] will speak about Methodologies for Music Understanding and Generation in the Context of Trustworthy AI. You can also watch it on YT: youtube.com/playlist?lis...
jsalt2025.fit.vut.cz/plenary-lect...
               
            
            
                22.07.2025 09:04 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Today, July 18, at 11:00, Herve Bredin [pyannoteAI, France] will give the 5th Plenary talk at the JSALT workshop "Speaker diarization, a love loss story", see jsalt2025.fit.vut.cz/plenary-lect...  for details.
               
            
            
                18.07.2025 09:02 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            ๐ข Barbara Schuppler (TU Graz) gives the 2nd #JSALT plenary tomorrow, Tue July 1, 11:00 in Room E112: "Cross-layer models for conversational speech recognition in low-resourced scenarios". Join in person or on YouTube: www.youtube.com/playlist?lis... ๐ค๐บ
jsalt2025.fit.vut.cz/plenary-lect...
               
            
            
                01.07.2025 08:33 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Honza is giving a lecture at Charles University in Prague (Faculty of Mathematics and Physics, MFF) today. If you want to attend, note  that it takes place in the new buildings of MFF in Troja, not the historical one in Mala Strana.
www.mff.cuni.cz/en/research-...
               
            
            
                28.05.2025 15:43 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            We have great pleasure to invite you to a talk of an excellent Czech scientist, Professor at #EPFL, Lenka Zdeborovรก. We have never seen a talk treating machine learning as a problem of statistical physics! Tuesday, May 20, 2025 at 13:00 in lecture room E112 and online www.youtube.com/live/FCvPhHm...
               
            
            
                19.05.2025 16:11 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Several speech students participated at the FIT Conference of innovations, technology and science - Excel@FIT. Congratulations to Sathvik, Dominik, and Ondrej for winning Excel prizes! 
excel.fit.vutbr.cz/vysledky/
               
            
            
                09.05.2025 08:05 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Congratulations to Lin Zhang for her new post-doc position at CLSP at Johns Hopkins University! She is working on anti-spoofing and anonymization with Nicholas Andrews and Matthew Wiesner. She also collaborates closely with Sanjeev Khudanpur, Leibny Paola Garcรญa-Perera, and Kevin Duh.
               
            
            
                09.05.2025 05:26 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Over the weekend we to plan  for the upcoming #JSALT25 workshop for the topic "Advancing Expert-Level Reasoning and Understanding in Large Audio Language Models". Two days of intense brain storming ๐ง  and planning powered by extra portions of coffee โ๏ธ.
jsalt2025.fit.vut.cz/summer-works...
               
            
            
                09.05.2025 05:25 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Glad to announce another married man in the group - Pradyoth's wedding with Sameeksha took place in Mangalore on Thu 3rd April (before ICASSP) in presence of 3.5k guests! All the best!
               
            
            
                25.04.2025 07:44 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
        
            
            
            
            
            
    
    
    
    
            The networking activities around ICASSP continued after Meeami: on Monday 7 April, Honza took part in an official visit to IIIT Hyderabad, met its director and spent nice time with Anil Kumar Vuppala and his colleagues and students in Language Technologies Research Center (LTRC).
               
            
            
                25.04.2025 07:43 โ ๐ 0    ๐ 0    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.
               
            
            
                07.04.2025 06:55 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.
               
            
            
                07.04.2025 06:55 โ ๐ 0    ๐ 0    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.
               
            
            
                07.04.2025 06:55 โ ๐ 0    ๐ 0    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                         
                                                
    
    
    
    
            The Speech@FIT research group continues its industry collaboration at global scale, with Santosh and Alex recently visiting Meeami Technologies in Hyderabad.
               
            
            
                07.04.2025 06:55 โ ๐ 1    ๐ 0    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin
               
            
            
                02.04.2025 13:20 โ ๐ 1    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Our papers to be presented at ICASSP in Hyderabad!
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
               
            
            
                02.04.2025 13:20 โ ๐ 0    ๐ 1    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Yesterday, we hosted folks from @butspeech.bsky.social, Phonexia, Phrase, and MAMA AI at the first meeting of the Linguistics, AI, Speech, and Language Technologies project, which is funded by @msmtcr and the EU's Programme Johannes Amos Comenius.
               
            
            
                26.02.2025 16:11 โ ๐ 6    ๐ 1    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            ๐ Competition details: www.nexdata.ai/competition/...
This work builds on DiCoW, our diarization-conditioned ASR modelโlearn more in our paper:
๐ arxiv.org/abs/2501.00114
๐ฅ๏ธ Codebase available on GitHub:
๐ github.com/BUTSpeechFIT...
[4/4]
               
            
            
                24.03.2025 20:00 โ ๐ 0    ๐ 1    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            ๐ Why should you try it?
โ
 Strong starting point for multilingual conversational ASR research
โ
 Open for experimentation, adaptation, and fine-tuning
โ
 Join us in pushing the boundaries of robust, multilingual speech recognition
๐ Test and improve multilingual conversational ASR
[3/4]
               
            
            
                24.03.2025 20:00 โ ๐ 0    ๐ 0    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            ๐ Baseline WER (No Domain Adaptation Yet, Oracle diarization):
๐บ๐ธ English (American): 9.4%
๐ฎ๐ณ English (Indian): 15.1%
๐ต๐ญ English (Filipino): 11.3%
๐ฉ๐ช German: 19.7%
๐ Now supports transcription of multiple speakers speaking different languages! ๐๐ฃ๏ธ
[2/4]
               
            
            
                24.03.2025 20:00 โ ๐ 0    ๐ 1    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            ๐ฃ๏ธ Are you participating in the Interspeech 2025 Workshop on Multilingual Conversational Speech Language Models organised by NexdataใๆงDatatangๆ ชๅผไผ็คพๅ
ฌๅผใ?
Weโve released our baseline model for the communityโready for you to explore and build upon!
๐ Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]
               
            
            
                24.03.2025 20:00 โ ๐ 0    ๐ 1    ๐ฌ 1    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            ๐ Baseline WER (No Domain Adaptation Yet, Oracle diarization):
๐บ๐ธ English (American): 9.4%
๐ฎ๐ณ English (Indian): 15.1%
๐ต๐ญ English (Filipino): 11.3%
๐ฉ๐ช German: 19.7%
๐ Now supports transcription of multiple speakers speaking different languages! ๐๐ฃ๏ธ
[2/4]
               
            
            
                24.03.2025 19:57 โ ๐ 1    ๐ 1    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                            Committee
                                                         
                                            Beer bill
                                                
    
    
    
    
            Congratulations to Dr. Karel ! The defense as well as the "one" in the evening were serious and successful.
               
            
            
                14.03.2025 13:39 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Speech group members, past members and affiliates gathered for some skiing (probably the last this season) in Stuhleck.  We were more but it was impossible to get everyone in one photo - some skiers were simply too fast!
               
            
            
                14.03.2025 13:33 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Exciting news! ๐Karel Benesโs PhD defense on "Language models supporting imperfect handwriting and speech recognition systems" is next Monday, March 10, 2025, at 10:00 in room G108 at FIT. Come or connect to support Karel! ๐
teams.microsoft.com/l/meetup-joi...
               
            
            
                10.03.2025 07:21 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            ๐ From 2019โ2023, we worked on NEUREM3, exploring neural representations in multi-modal & multilingual modeling. ๐๏ธProud of our achievements in speaker recognition, diarization & target speaker extraction at BUT! Our final report is now public. ๐
www.fit.vut.cz/research/gro...
               
            
            
                15.02.2025 15:43 โ ๐ 0    ๐ 0    ๐ฌ 0    ๐ 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            Excited to join AirbusDefence and 7 EU partners in the EDF project ARCHER! ARCHER will organize several rounds of human language technologies evaluations for defence domain scenarios. Our team will focus on building strong ASR and OCR baselines ๐ช!
defence-industry-space.ec.europa.eu/document/dow...
               
            
            
                08.02.2025 19:32 โ ๐ 1    ๐ 0    ๐ฌ 0    ๐ 0