Patrick Pérez (@ptrkprz) 's Twitter Profile
Patrick Pérez

@ptrkprz

AI & CV scientist, CEO at @kyutai_labs

ID: 1734208975267082240

linkhttps://ptrckprz.github.io calendar_today11-12-2023 13:51:44

38 Tweet

675 Followers

62 Following

Ian Hogarth (@soundboy) 's Twitter Profile Photo

1/ Today the UK's AI Safety Institute is open sourcing our safety evaluations platform. We call it "Inspect": gov.uk/government/new…

F. Güney (@ftm_guney) 's Twitter Profile Photo

we’ve got multiple PhD and postdoc positions funded by my #ERCstg project ENSURE. if you’re interested in computer vision and self-driving, please consider applying. graduate students: apply ASAP! details at gsse.ku.edu.tr postdocs: send me an email with your CV and

valeo.ai (@valeoai) 's Twitter Profile Photo

📢We introduce the ScaLR models (code+checkpoints) for LiDAR perception distilled from vision foundation models tl;dr: don’t neglect the choice of teacher, student, and pretraining datasets -> their impact is probably more important than the distillation method #CVPR2024 🧵 [1/8]

📢We introduce the ScaLR models (code+checkpoints) for LiDAR perception distilled from vision foundation models
tl;dr: don’t neglect the choice of teacher, student, and pretraining datasets -> their impact is probably more important than the distillation method #CVPR2024
🧵
[1/8]
Amir Zamir (@zamir_ar) 's Twitter Profile Photo

We are releasing 4M-21 with a permissive license, including its source code and trained models. It's a pretty effective multimodal model that solves 10s of tasks & modalities. See the demo code, sample results, and the tokenizers of diverse modalities on the website. IMO, the

Patrick Pérez (@ptrkprz) 's Twitter Profile Photo

It feels so good to have shared at last what we have been up to in the past 6 months. We worked hard on this unique voice AI, carefully training it on a mix of text and speech, making it multi-stream and real-time, and putting it in an online demo for everyone to experience it.

Patrick Pérez (@ptrkprz) 's Twitter Profile Photo

Thanks Thomas Wolf Moshi experimental voice AI is indeed a crazy adventure / a radical innovation / a new technology / a surprising experience / a research prototype / a shared resource / a starting point…. not a productized conversational bot.

Patrick Pérez (@ptrkprz) 's Twitter Profile Photo

The attentive listener will notice that even when speaking over Alex, Moshi still listens (taking into account the "in space" instruction for the second poem)

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Moshi is a very nice/fun conversational AI audio 🔊 model release from kyutai . Are you slowly losing faith in the objective reality and existence of Advanced Voice Mode? Talk to Moshi instead :) You can talk to it on their website: moshi.chat Or even locally

Alexandre Défossez (@honualx) 's Twitter Profile Photo

I’ll be presenting a deep dive into how Moshi works at the next NLP Meetup in Paris, this Wednesday the 9th at 7pm. Register if you want to attend ! 🧩🔎🟢 meetup.com/fr-FR/paris-nl…