Michael Noukhovitch, gonna be @ICLR 2025 (@mnoukhov) Twitter Tweets • TwiCopy

Michael Noukhovitch, gonna be @ICLR 2025

@mnoukhov

+ Follow

PhD Student in AI at @Mila_Quebec making LMs better, previously @FacebookAI @ServiceNowRSRCH and software eng @UWaterloo

also @mnoukhov.bsky.social

ID: 4656943452

linkhttp://mnoukhov.github.io calendar_today26-12-2015 19:37:07

163 Tweet

930 Followers

292 Following

Costa Huang

@vwxyzjn

a year ago

Michael Noukhovitch This is a simple gist demonstrating how async RL for LLM works: gist.github.com/vwxyzjn/473b8b…

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

🚨 New paper alert! [NeurIPS 2024 spotlight] 🚨 Trajectory Flow Matching with Applications to Clinical Time Series Modeling ⏳📈 With: Yuan Pu , Yuki Kawamura 川村祐貴 , Andrew Loza, Yoshua Bengio, Dennis Shung, Alex Tong 💻: github.com/nZhangx/Trajec… 📄: arxiv.org/abs/2410.21154 🧵👇

🚨 New paper alert! [NeurIPS 2024 spotlight] 🚨
Trajectory Flow Matching with Applications to Clinical Time Series Modeling ⏳📈
With: <a href="/yuanpu__/">Yuan Pu</a> , <a href="/YukiKawamura_/">Yuki Kawamura 川村祐貴</a> , Andrew Loza, <a href="/Yoshua_Bengio/">Yoshua Bengio</a>, <a href="/dlshung/">Dennis Shung</a>, <a href="/AlexanderTong7/">Alex Tong</a>

💻: github.com/nZhangx/Trajec…
📄: arxiv.org/abs/2410.21154
🧵👇

thumb_up_off_alt233

chat_bubble_outline3

repeat52

shareShare

Michael Noukhovitch, gonna be @ICLR 2025

@mnoukhov

10 months ago

Big news for the little guy (me and my 48Gb gpus)

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Costa Huang

@vwxyzjn

10 months ago

🚀 Happy to share Tülu 3! We trained the model with actual RL: the model only receives rewards if its generations are verified to be correct (e.g., correct math solution). ❤️ Check out our beautiful RL curves. Code is also available: ~single file PPO that scales to 70B.

thumb_up_off_alt486

chat_bubble_outline13

repeat79

shareShare

Sara Vera Marjanović

@saraveramarjano

5 months ago

Models like DeepSeek-R1 🐋 mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour. 🔗: mcgill-nlp.github.io/thoughtology/

thumb_up_off_alt227

chat_bubble_outline3

repeat62

shareShare

Kevin Farhat

@notkevinfarhat

2 months ago

The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? Introducing

thumb_up_off_alt254

chat_bubble_outline8

repeat46

shareShare

Samuel Lavoie

@lavoiems

2 months ago

The code and model weights for this paper are finally open! Despite being a little late for releasing them, I hope you will find them useful! Code: github.com/facebookresear… Models: - (ViT-G): huggingface.co/lavoies/llip-v… - (ViT-B): huggingface.co/lavoies/llip-v…

thumb_up_off_alt31

chat_bubble_outline0

repeat9

shareShare

Michael Noukhovitch, gonna be @ICLR 2025

@mnoukhov

2 months ago

Open sourcing code and models from industry is the invisible labour that actually creates progress in the field. It's a massive effort and requires authors to overcomes so many logistical and administrative hurdles. This stuff deserves more recognition!

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Samuel Lavoie

@lavoiems

2 months ago

🧵 Everyone is chasing new diffusion models—but what about the representations they model from? We introduce Discrete Latent Codes (DLCs): - Discrete representation for diffusion models - Uncond. gen. SOTA FID (1.59 on ImageNet) - Compositional generation - Integrates with LLM 🧱

thumb_up_off_alt284

chat_bubble_outline3

repeat43

shareShare

Samuel Lavoie

@lavoiems

2 months ago

LLMs can speak in DLC! We fine-tune a language model to sample DLC tokens from text, giving us a pipeline: Text → DLC → Image This also enables generation beyond ImageNet.

thumb_up_off_alt8

chat_bubble_outline1

repeat1

shareShare

Michael Noukhovitch, gonna be @ICLR 2025

@mnoukhov

2 months ago

We're releasing a cool paper! DLCs are image tokens that enable better diffusion modelling. For now, we show this is the right representation. But in the future, this can allow LLMs to "speak in images"🤯to enable visual reasoning and more powerful text-image generalization. ⬇️

thumb_up_off_alt20

chat_bubble_outline0

repeat3

shareShare

Michael Noukhovitch, gonna be @ICLR 2025

Costa Huang

Xi (Nicole) Zhang

Michael Noukhovitch, gonna be @ICLR 2025

Costa Huang

Sara Vera Marjanović

Kevin Farhat

Samuel Lavoie

Michael Noukhovitch, gonna be @ICLR 2025

Samuel Lavoie

Samuel Lavoie

Michael Noukhovitch, gonna be @ICLR 2025