Dimitri von Rütte (@dvruette) 's Twitter Profile
Dimitri von Rütte

@dvruette

PhD @ETH_en, prev. Machine Learning @DeepJudgeAI

ID: 1617468796104581122

calendar_today23-01-2023 10:26:34

939 Tweet

1,1K Followers

258 Following

Dimitri von Rütte (@dvruette) 's Twitter Profile Photo

Happy to share that GIDD was accepted at ICML 2025! 🥳 See our thread to learn how we added self-correction capabilities to discrete diffusion models with a simple change in the noise schedule 👇

Dimitri von Rütte (@dvruette) 's Twitter Profile Photo

The switch from a hard to a soft reward going from o1 to o3 really shows when giving it math tasks. If it cannot solve the task, o3 will just make up some bullshit that tricks the verifier but is actually wrong. This never happened with o1.

Dimitri von Rütte (@dvruette) 's Twitter Profile Photo

Huge step forward for non-AR models! Breaking away from the fixed sequence length and has been a big challenge for discrete diffusion/flow models due to the position-independence assumption. Only a question of time until AR LLMs become obsolete? 👀

Simone Scardapane (@s_scardapane) 's Twitter Profile Photo

*Generalized Interpolating Discrete Diffusion* by Dimitri von Rütte Antonio Orvieto & al. A class of discrete diffusion models combining standard masking with uniform noise to allow the model to potentially "correct" previously wrong tokens. arxiv.org/abs/2503.04482

*Generalized Interpolating Discrete Diffusion*
by <a href="/dvruette/">Dimitri von Rütte</a> <a href="/orvieto_antonio/">Antonio Orvieto</a> &amp; al.

A class of discrete diffusion models combining standard masking with uniform noise to allow the model to potentially "correct" previously wrong tokens.

arxiv.org/abs/2503.04482