I have a bunch of catch up to do in ocaml, effects/multicore/oxcaml all seem very exciting to play with!
17.07.2025 14:12 β π 2 π 0 π¬ 1 π 0@lmazare.bsky.social
Functional programming enjoyer.
I have a bunch of catch up to do in ocaml, effects/multicore/oxcaml all seem very exciting to play with!
17.07.2025 14:12 β π 2 π 0 π¬ 1 π 0Thanks for the kind words @anil.recoil.org , just to mention that these models are a team effort from a bunch of folks here at Kyutai! The MLX bugs are most likely mine though :) hopefully I'll get some free cycles soon and make an ocaml implementation that will have less of these!
17.07.2025 14:11 β π 2 π 0 π¬ 1 π 0π Say hello to unmute.sh β a modular voice AI system built on our in-house low latency text-to-speech and speech-to-text engines. It works in English π¬π§ and French π«π· and you can customize the voice and personality.
ποΈTry it live and tell us what you think!
I'll present a dive into Moshi π’ and our translation model Hibiki π«π·β»οΈπ¬π§ as part of the next @convai-rg.bsky.social reading group π¨βπ«π.
π
13th of March π°οΈ 11am ET, 4pm in Paris.
I'll discuss Mimi ποΈ and multi-stream audio modeling π.
Join on Zoom, replay on YT.
β¬ β¬ π§ π§ π¨ π¨ π© π© π© β¬
β¬ π§ π§ π¨ π¨ π© π© π© β¬ β¬
Ah, we already have inference codebases in python/rust/swift but maybe we need an ocaml one too for folks that interact with the French ocaml community π
12.02.2025 20:06 β π 5 π 0 π¬ 0 π 0Afraid of missing out on French pop culture references because you don't speak the language? Fear no more and try our Hibiki speech-to-speech translation modelβ no more FOMO! π«π·β¨ #Translation #AI
12.02.2025 13:39 β π 7 π 1 π¬ 1 π 0Even Kavinsky π§πͺ© can't break Hibiki! Just like Moshi, Hibiki is robust to extreme background conditions π₯π.
11.02.2025 16:11 β π 8 π 4 π¬ 0 π 1We just released Hibiki π’, a real time speech-to-speech translation π«π· -> π¬π§. It preserves the voice of the user, and the smaller variant can run on iPhone as showed by Neil in this video.
Find the code on github github.com/kyutai-labs/... and the weights on HF and give it a spin!
Very impressive to hear this Japanese π―π΅ version of moshi π’. I don't speak the language so I cannot understand what it's trying to tell me but at least it sounds great π
github.com/nu-dialogue/...
Getting our latest LLM to run on the edge was pretty fun, it had been a while since I last used swift and it's still a pretty enjoyable language!
14.01.2025 16:42 β π 1 π 0 π¬ 0 π 0That's certainly the plan, ideally this will make it possible to run a moshi locally on a mobile phone but there is still a lot to be done before we get there!
13.01.2025 22:33 β π 1 π 0 π¬ 0 π 0Super proud of our first publicly released text model, helium-1 preview, a 2B model trained on 6 languages. It should be a great fit for on-device applications. Already available in candle/transformers, can't wait to see what the community builds with it! #OpenSource #AI #FTW!
13.01.2025 17:53 β π 7 π 0 π¬ 1 π 0It's certainly open-source but also pretty much a work in progress :) github.com/LaurentMazar... There is no AMD support for now but as long as the programming model is similar to cuda or metal, adding a new backend should be easy (or at least that's the goal).
28.11.2024 16:25 β π 2 π 0 π¬ 0 π 0Ah, sounds like a nice part to have! Note that the bindings are now developed by JS here github.com/janestreet/t... Also lately I've worked on a small compiler for deep learning models, inspired by triton/tinygrad. It's written in rust, but ocaml would be a great choice for writing such a compiler.
28.11.2024 15:31 β π 4 π 0 π¬ 2 π 0The benchmark code with all the matmul variants is on github: github.com/LaurentMazar...
24.11.2024 12:20 β π 0 π 0 π¬ 0 π 0Last week we've received a new M4pro mac mini so I've benched it with various matmul variants and the results are pretty impressive for a tiny form factor. Even with a naive approach it reaches ~5.2TFlops in f32 (so probably more than 10TFlops in bf16), and that's just using the GPU, no NPU for now.
24.11.2024 12:20 β π 3 π 0 π¬ 1 π 0