Our latest open-source speech-to-text model just claimed 1st place among streaming models and 5th place overall on the OpenASR leaderboard ๐ฅ๐๏ธ
While all other models need the whole audio, ours delivers top-tier accuracy on streaming content.
Open, fast, and ready for production!
27.06.2025 10:31 โ ๐ 4 ๐ 3 ๐ฌ 1 ๐ 0
Have you enjoyed talking to ๐ขMoshi and dreamt of making your own speech to speech chat experience๐งโ๐ฌ๐ค? It's now possible with the moshi-finetune codebase! Plug your own dataset and change the voice/tone/personality of Moshi ๐๐๐ฟ. An example after finetuning w/ only 20 hours of the DailyTalk dataset. ๐งต
01.04.2025 15:47 โ ๐ 6 ๐ 1 ๐ฌ 1 ๐ 2
Just back from holidays, so a bit late, to announce MoshiVis, extending Moshi's multimodal capabilities to take in images ๐ท.
Only 200M weights were added to plug a ViT through cross attention with gating ๐ผ๏ธ๐๐ค
Training relies on a mix of text only and text+audio synthetic data (~20k hours) ๐ฝ
31.03.2025 10:06 โ ๐ 3 ๐ 2 ๐ฌ 0 ๐ 0
Hello ๐!
15.03.2025 07:13 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU
https://emerge-lab.github.io
https://www.admonymous.co/eugenevinitsky
Sr. Software Engineer / Combining mathematics with code to solve problems / Owner of too many audiobooks and music recordings
Chief Models Officer @ Stealth Startup; Inria & MVA - Ex: Llama @AIatMeta & Gemini and BYOL @GoogleDeepMind
machine learning researcher @Apple | PhD from @CoML_ENS | speech, ml and cognition.
Mathematician at UCLA. My primary social media account is https://mathstodon.xyz/@tao . I also have a blog at https://terrytao.wordpress.com/ and a home page at https://www.math.ucla.edu/~tao/
Software engineer at probabl, scikit-learn contributor.
Also at:
https://sigmoid.social/@ogrisel
https://github.com/ogrisel
Research Scientist Meta/FAIR, Prof. University of Geneva, co-founder Neural Concept SA. I like reality.
https://fleuret.org
Research Scientist at valeo.ai | Teaching at Polytechnique, ENS | Alumni at Mines Paris, Inria, ENS | AI for Autonomous Driving, Computer Vision, Machine Learning | Robotics amateur
โฒ Paris, France ๐ abursuc.github.io
Cofounded and lead PyTorch at Meta. Also dabble in robotics at NYU.
AI is delicious when it is accessible and open-source.
http://soumith.ch
Researcher in machine learning
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Co-founder and CEO, Mistral AI
Professor a NYU; Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
http://yann.lecun.com
AI safety at Anthropic, on leave from a faculty job at NYU.
Views not employers'.
I think you should join Giving What We Can.
cims.nyu.edu/~sbowman
Research & code: Research director @inria
โบData, Health, & Computer science
โบPython coder, (co)founder of scikit-learn, joblib, & @probabl.bsky.social
โบSometimes does art photography
โบPhysics PhD
Developer on PyTorch at Meta. Previously Haskeller and GHC developer.