Alexandre Défossez (@honualx) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

We just released the Helium-1 model , a 2B multi-lingual LLM which Edouard Grave and Laurent Mazare have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks 🇬🇧🇮🇹🇪🇸🇵🇹🇫🇷🇩🇪 On HF, under CC-BY licence: huggingface.co/kyutai/helium-…

We just released the Helium-1 model , a 2B multi-lingual LLM which <a href="/EXGRV/">Edouard Grave</a> and <a href="/lmazare/">Laurent Mazare</a> have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks 🇬🇧🇮🇹🇪🇸🇵🇹🇫🇷🇩🇪
On HF, under CC-BY licence: huggingface.co/kyutai/helium-…

thumb_up_off_alt42

chat_bubble_outline0

repeat4

shareShare

kyutai

@kyutai_labs

6 months ago

Helium 2B running locally on an iPhone 16 Pro at 28 tok/s, faster than you can read, all thanks to mlx-swift with q4 quantization!

thumb_up_off_alt131

chat_bubble_outline7

repeat20

shareShare

Neil Zeghidour

@neilzegh

6 months ago

Today we release Hibiki, real-time speech translation that runs on your phone. Adaptive flow without fancy policy, simple temperature sampling of a multistream audio-text LM. Very proud of Tom Labiausse 's work as an intern.

thumb_up_off_alt398

chat_bubble_outline11

repeat53

shareShare

Alexandre Défossez

@honualx

6 months ago

We just released Hibiki, a 🎙️-to-🔊 simultaneous translation model 🇫🇷🇬🇧 We leverage a large synthetic corpus synthesized from the text translation model MADLAD, and our own TTS + simple lag rule. Model is decoder only, runs at scale, even on device 📲 github.com/kyutai-labs/hi…

thumb_up_off_alt56

chat_bubble_outline3

repeat7

shareShare

Laurent Mazare

@lmazare

6 months ago

Afraid of missing out on French pop culture references because you don't speak the language? Fear no more and try our Hibiki speech-to-speech translation model— no more FOMO! 🇫🇷✨ #Translation #AI

thumb_up_off_alt27

chat_bubble_outline1

repeat2

shareShare

Jean-Rémi King

@jeanremiking

6 months ago

Two new studies from our team we're particularly happy about Study 1: Brain-to-Text Decoding: ai.meta.com/research/publi… Study 2: From Thought to Action: ai.meta.com/research/publi… Blog: ai.meta.com/blog/brain-ai-…

thumb_up_off_alt293

chat_bubble_outline5

repeat88

shareShare

kyutai

@kyutai_labs

6 months ago

Thanks to Xavier Niel for stopping by at the #AIActionSummit to try Hibiki. No need to struggle with English anymore 😅

thumb_up_off_alt214

chat_bubble_outline9

repeat27

shareShare

Alexandre Défossez

@honualx

6 months ago

I’ll be talking about our speech-to-speech framework that underpin Moshi and our recent live translation model Hibiki, this Wednesday 12th at 2:40pm at the MBZUAI workshop!

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Jean-Rémi King

@jeanremiking

6 months ago

Very happy to have participated in this *beautiful* documentary from Florent Muller on the frontiers of humans and machine together with Yann LeCun Joelle Pineau Tom M Mitchell Alexandre Défossez and many more france.tv/documentaires/…

thumb_up_off_alt31

chat_bubble_outline2

repeat10

shareShare

Neil Zeghidour

@neilzegh

5 months ago

Great night hitting the club with Alexandre Défossez and Hibiki!

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Alexandre Défossez

@honualx

5 months ago

I'll present a dive into Moshi 🟢 and our translation model Hibiki 🇫🇷♻️🇬🇧 in the next Convai_rg reading group👨‍🏫📗. 📅 13/03 🕰️ 11am ET, 4pm in Paris. I'll discuss Mimi 🗜️ and multistream audio modeling 🔊. Join on Zoom, replay on YT. ⬛⬛🟧🟧🟨🟨🟩🟩🟩⬛ ⬛🟧🟧🟨🟨🟩🟩🟩⬛⬛

thumb_up_off_alt36

chat_bubble_outline0

repeat4

shareShare

Alexandre Défossez

@honualx

4 months ago

I haven't tested it yet, but those results looks really impressive, so nice that it is open source and medium sized.

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

kyutai

@kyutai_labs

4 months ago

What are we waiting for? 🤔

thumb_up_off_alt246

chat_bubble_outline10

repeat28

shareShare

Alexandre Défossez

@honualx

4 months ago

Just back from holidays, a bit late to announce MoshiVis, extending Moshi's capabilities to take in images 📷. Only 200M trainable weights to plug a vision encoder through cross attention🖼️🔀🎤 Training relies on a mix of text only and text+audio synthetic data (~20k hours) 💽

thumb_up_off_alt31

chat_bubble_outline0

repeat2

shareShare

Alexandre Défossez

@honualx

4 months ago

We just open sourced a fine tuning codebase for Moshi!

thumb_up_off_alt76

chat_bubble_outline2

repeat11

shareShare

kyutai

@kyutai_labs

3 months ago

🚀 Thrilled to announce Helium 1, our new 2B-parameter LLM, now available alongside dactory, an open-source pipeline to reproduce its training dataset covering all 24 EU official languages. Helium sets new standards within its size class on European languages!

thumb_up_off_alt325

chat_bubble_outline11

repeat57

shareShare

kyutai

@kyutai_labs

2 months ago

Talk to unmute.sh 🔊, the most modular voice AI around. Empower any text LLM with voice, instantly, by wrapping it with our new speech-to-text and text-to-speech. Any personality, any voice. Interruptible, smart turn-taking. We’ll open-source everything within the

thumb_up_off_alt1,1K

chat_bubble_outline83

repeat212

shareShare

kyutai

@kyutai_labs

2 months ago

Using Unmute with a custom voice and prompt to create a very intense ice cream seller, inspired by Justin Kuritzkes' sketch🍦

thumb_up_off_alt175

chat_bubble_outline7

repeat8

shareShare