Michael Noukhovitch, gonna be @ICLR 2025 (@mnoukhov) 's Twitter Profile
Michael Noukhovitch, gonna be @ICLR 2025

@mnoukhov

PhD Student in AI at @Mila_Quebec making LMs better, previously @FacebookAI @ServiceNowRSRCH and software eng @UWaterloo

also @mnoukhov.bsky.social

ID: 4656943452

linkhttp://mnoukhov.github.io calendar_today26-12-2015 19:37:07

163 Tweet

930 Followers

292 Following

Xi (Nicole) Zhang (@nzhang211) 's Twitter Profile Photo

🚨 New paper alert! [NeurIPS 2024 spotlight] 🚨 Trajectory Flow Matching with Applications to Clinical Time Series Modeling ⏳📈 With: Yuan Pu , Yuki Kawamura 川村祐貴 , Andrew Loza, Yoshua Bengio, Dennis Shung, Alex Tong 💻: github.com/nZhangx/Trajec… 📄: arxiv.org/abs/2410.21154 🧵👇

🚨 New paper alert! [NeurIPS 2024 spotlight] 🚨
Trajectory Flow Matching with Applications to Clinical Time Series Modeling ⏳📈
With: <a href="/yuanpu__/">Yuan Pu</a> , <a href="/YukiKawamura_/">Yuki Kawamura 川村祐貴</a> , Andrew Loza, <a href="/Yoshua_Bengio/">Yoshua Bengio</a>, <a href="/dlshung/">Dennis Shung</a>, <a href="/AlexanderTong7/">Alex Tong</a>

💻: github.com/nZhangx/Trajec…
📄: arxiv.org/abs/2410.21154
🧵👇
Costa Huang (@vwxyzjn) 's Twitter Profile Photo

🚀 Happy to share Tülu 3! We trained the model with actual RL: the model only receives rewards if its generations are verified to be correct (e.g., correct math solution). ❤️ Check out our beautiful RL curves. Code is also available: ~single file PPO that scales to 70B.

🚀 Happy to share Tülu 3!  We trained the model with actual RL: the model only receives rewards if its generations are verified to be correct (e.g., correct math solution). 

❤️ Check out our beautiful RL curves. Code is also available: ~single file PPO that scales to 70B.
Sara Vera Marjanović (@saraveramarjano) 's Twitter Profile Photo

Models like DeepSeek-R1 🐋 mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour. 🔗: mcgill-nlp.github.io/thoughtology/

Models like DeepSeek-R1 🐋 mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour.
🔗: mcgill-nlp.github.io/thoughtology/
Samuel Lavoie (@lavoiems) 's Twitter Profile Photo

The code and model weights for this paper are finally open! Despite being a little late for releasing them, I hope you will find them useful! Code: github.com/facebookresear… Models: - (ViT-G): huggingface.co/lavoies/llip-v… - (ViT-B): huggingface.co/lavoies/llip-v…

Michael Noukhovitch, gonna be @ICLR 2025 (@mnoukhov) 's Twitter Profile Photo

Open sourcing code and models from industry is the invisible labour that actually creates progress in the field. It's a massive effort and requires authors to overcomes so many logistical and administrative hurdles. This stuff deserves more recognition!

Samuel Lavoie (@lavoiems) 's Twitter Profile Photo

🧵 Everyone is chasing new diffusion models—but what about the representations they model from? We introduce Discrete Latent Codes (DLCs): - Discrete representation for diffusion models - Uncond. gen. SOTA FID (1.59 on ImageNet) - Compositional generation - Integrates with LLM 🧱

🧵 Everyone is chasing new diffusion models—but what about the representations they model from?
We introduce Discrete Latent Codes (DLCs):
- Discrete representation for diffusion models
- Uncond. gen. SOTA FID (1.59 on ImageNet)
- Compositional generation
- Integrates with LLM
🧱
Samuel Lavoie (@lavoiems) 's Twitter Profile Photo

LLMs can speak in DLC! We fine-tune a language model to sample DLC tokens from text, giving us a pipeline: Text → DLC → Image This also enables generation beyond ImageNet.

LLMs can speak in DLC!

We fine-tune a language model to sample DLC tokens from text, giving us a pipeline:
Text → DLC → Image
This also enables generation beyond ImageNet.
Michael Noukhovitch, gonna be @ICLR 2025 (@mnoukhov) 's Twitter Profile Photo

We're releasing a cool paper! DLCs are image tokens that enable better diffusion modelling. For now, we show this is the right representation. But in the future, this can allow LLMs to "speak in images"🤯to enable visual reasoning and more powerful text-image generalization. ⬇️