Rogier van Dalen's Avatar

Rogier van Dalen

@rogiercvd.bsky.social

Researcher in machine learning (speech recognition / private federated learning) in Cambridge

41 Followers  |  83 Following  |  2 Posts  |  Joined: 18.11.2024  |  1.5344

Latest posts by rogiercvd.bsky.social on Bluesky

Preview
BLOG | Samsung Research Globally Normalizing the Transducer for Streaming Speech Recognition

There is now a blog post explaining how to fix the mathematics of streaming speech recognisers. research.samsung.com/blog/Globall...

16.04.2025 14:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Globally Normalizing the Transducer for Streaming Speech Recognition The Transducer (e.g. RNN-Transducer or Conformer-Transducer) generates an output label sequence as it traverses the input sequence. It is straightforward to use in streaming mode, where it generates p...

Your streaming speech recognizer is probably mathematically flawed, degrading its accuracy. Ask me to explain how to fix this next week in the Thursday morning poster session at #ICASSP, or look at ieeexplore.ieee.org/abstract/doc...

01.04.2025 15:47 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
a person is using a bosch drill to drill the f5 and f6 keys ALT: a person is using a bosch drill to drill the f5 and f6 keys

#ICASSP2025

19.12.2024 11:08 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@rogiercvd is following 20 prominent accounts