Alexandre Défossez (@honualx) 's Twitter Profile
Alexandre Défossez

@honualx

Chief exploration officer @kyutai_labs, with strong interests in stochastic optimization, audio generative models, and AI for science.

ID: 1104779445007716354

linkhttp://ai.honu.io calendar_today10-03-2019 16:22:00

659 Tweet

4,4K Followers

494 Following

Alexandre Défossez (@honualx) 's Twitter Profile Photo

We just released the Helium-1 model , a 2B multi-lingual LLM which Edouard Grave and Laurent Mazare have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks 🇬🇧🇮🇹🇪🇸🇵🇹🇫🇷🇩🇪 On HF, under CC-BY licence: huggingface.co/kyutai/helium-…

We just released the Helium-1 model , a 2B multi-lingual LLM which <a href="/EXGRV/">Edouard Grave</a> and <a href="/lmazare/">Laurent Mazare</a> have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks 🇬🇧🇮🇹🇪🇸🇵🇹🇫🇷🇩🇪
On HF, under CC-BY licence: huggingface.co/kyutai/helium-…
kyutai (@kyutai_labs) 's Twitter Profile Photo

Helium 2B running locally on an iPhone 16 Pro at 28 tok/s, faster than you can read, all thanks to mlx-swift with q4 quantization!

Neil Zeghidour (@neilzegh) 's Twitter Profile Photo

Today we release Hibiki, real-time speech translation that runs on your phone. Adaptive flow without fancy policy, simple temperature sampling of a multistream audio-text LM. Very proud of Tom Labiausse 's work as an intern.

Alexandre Défossez (@honualx) 's Twitter Profile Photo

We just released Hibiki, a 🎙️-to-🔊 simultaneous translation model 🇫🇷🇬🇧 We leverage a large synthetic corpus synthesized from the text translation model MADLAD, and our own TTS + simple lag rule. Model is decoder only, runs at scale, even on device 📲 github.com/kyutai-labs/hi…

Laurent Mazare (@lmazare) 's Twitter Profile Photo

Afraid of missing out on French pop culture references because you don't speak the language? Fear no more and try our Hibiki speech-to-speech translation model— no more FOMO! 🇫🇷✨ #Translation #AI

Jean-Rémi King (@jeanremiking) 's Twitter Profile Photo

Two new studies from our team we're particularly happy about Study 1: Brain-to-Text Decoding: ai.meta.com/research/publi… Study 2: From Thought to Action: ai.meta.com/research/publi… Blog: ai.meta.com/blog/brain-ai-…

Alexandre Défossez (@honualx) 's Twitter Profile Photo

I’ll be talking about our speech-to-speech framework that underpin Moshi and our recent live translation model Hibiki, this Wednesday 12th at 2:40pm at the MBZUAI workshop!

Alexandre Défossez (@honualx) 's Twitter Profile Photo

I'll present a dive into Moshi 🟢 and our translation model Hibiki 🇫🇷♻️🇬🇧 in the next Convai_rg reading group👨‍🏫📗. 📅 13/03 🕰️ 11am ET, 4pm in Paris. I'll discuss Mimi 🗜️ and multistream audio modeling 🔊. Join on Zoom, replay on YT. ⬛⬛🟧🟧🟨🟨🟩🟩🟩⬛ ⬛🟧🟧🟨🟨🟩🟩🟩⬛⬛

Alexandre Défossez (@honualx) 's Twitter Profile Photo

Just back from holidays, a bit late to announce MoshiVis, extending Moshi's capabilities to take in images 📷. Only 200M trainable weights to plug a vision encoder through cross attention🖼️🔀🎤 Training relies on a mix of text only and text+audio synthetic data (~20k hours) 💽

kyutai (@kyutai_labs) 's Twitter Profile Photo

🚀 Thrilled to announce Helium 1, our new 2B-parameter LLM, now available alongside dactory, an open-source pipeline to reproduce its training dataset covering all 24 EU official languages. Helium sets new standards within its size class on European languages!

🚀 Thrilled to announce Helium 1, our new 2B-parameter LLM, now available alongside dactory, an open-source pipeline to reproduce its training dataset covering all 24 EU official languages. Helium sets new standards within its size class on European languages!
kyutai (@kyutai_labs) 's Twitter Profile Photo

Talk to unmute.sh 🔊, the most modular voice AI around. Empower any text LLM with voice, instantly, by wrapping it with our new speech-to-text and text-to-speech. Any personality, any voice. Interruptible, smart turn-taking. We’ll open-source everything within the

kyutai (@kyutai_labs) 's Twitter Profile Photo

Using Unmute with a custom voice and prompt to create a very intense ice cream seller, inspired by Justin Kuritzkes' sketch🍦