
Danilo J. Rezende
@danilojrezende
Head of AI Research @ EIT | ex-Director @ DeepMind Building models to accelerate fundamental sciences and medicine. Opinions my own.
ID: 797433864
https://danilorezende.com/ 02-09-2012 03:44:53
3,3K Tweet
35,35K Followers
1,1K Following





It’s done because it’s much easier to 1) collect, 2) evaluate, and 3) beat and make progress on. We’re going to see every task that is served neatly packaged on a platter like this improved (including those that need PhD-grade expertise). But jobs (even intern-level) that need






This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to





