Valentin De Bortoli (@valentindebort1) Twitter Tweets • TwiCopy

James Thornton

6 months ago

Small plug, not really advertised but we similarly showed how to perform temperature based control and composition of separately trained diffusion models via SMC and the Feynman Kac model formalism, with score distillation of the energy at AISTATS last year - Diversity control

thumb_up_off_alt145

chat_bubble_outline3

repeat23

shareShare

Adam Foster

@adamefoster

6 months ago

I am very happy to share Orbformer, a foundation model for wavefunctions using deep QMC that offers a route to tackle strongly correlated quantum states! arxiv.org/abs/2506.19960

thumb_up_off_alt89

chat_bubble_outline7

repeat29

shareShare

Quanquan Gu

@quanquangu

6 months ago

It’s not that people think calculus or math is useless in AI. They’re just tired of theory folks who never touch code, never scale a model, and still argue they’re solving problems in AI:) If theory becomes detached from practice, the world will treat it like noise and that’s on

thumb_up_off_alt237

chat_bubble_outline7

repeat22

shareShare

Peyman Milanfar

@docmilanfar

5 months ago

a stark demonstration that a Caltech PhD and partnership at a top-tier venture capital firm are no remedy for profound ignorance

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat100

shareShare

Subham Sahoo

@ssahoo_

5 months ago

Attending ICML ✈️Tues-Fri to present "The Diffusion Duality" 🗓️Wed, July 16 @ 4:30pm 📍East Exhibition Hall A-B (E-3003) DM if you want to chat about diffusion LMs, or my current work on Duality or Esoteric LMs! x.com/ssahoo_/status…

thumb_up_off_alt157

chat_bubble_outline1

repeat17

shareShare

Arthur Gretton

@arthurgretton

5 months ago

Accelerated Diffusion Models via Speculative Sampling, at #icml25 ! at 16:30 Tuesday July 15 poster E-3012 arxiv.org/abs/2501.05370 Valentin De Bortoli Alexandre Galashov Arnaud Doucet

Accelerated Diffusion Models via Speculative Sampling, at #icml25 !

at 16:30 Tuesday July 15 poster E-3012

arxiv.org/abs/2501.05370

<a href="/ValentinDeBort1/">Valentin De Bortoli</a> <a href="/agalashov/">Alexandre Galashov</a> <a href="/ArnaudDoucet1/">Arnaud Doucet</a>

thumb_up_off_alt304

chat_bubble_outline2

repeat46

shareShare

Gautam Kamath

@thegautamkamath

5 months ago

There are many great researchers out there. But the ones that really stand out to me are the ones who are also kind, even when they don't need to be.

thumb_up_off_alt468

chat_bubble_outline7

repeat32

shareShare

James Thornton

@jamestthorn

5 months ago

I won’t be at ICML but check out accepted papers with folks from my former team at MLR - Projective Composition of Diffusion Models led by Arwen Bradley and Preetum Nakkiran icml.cc/virtual/2025/p…

thumb_up_off_alt27

chat_bubble_outline1

repeat3

shareShare

Molei Tao

@moleitaomath

5 months ago

If still around #ICML2025, plz consider checking out my collaborator Qing Qu 's Oral in the MemFM Workshop, 11am Sat West Meeting Room 223-224, on A Closer Look at Model Collapse (in diffusion model): From a Generalization-to-Memorization Perspective

thumb_up_off_alt23

chat_bubble_outline2

repeat4

shareShare

Rota

@pli_cachete

5 months ago

Terence Tao on the supposed Gold from OpenAI at IMO

thumb_up_off_alt5,5K

chat_bubble_outline86

repeat454

shareShare

Deepak Pathak

@pathak2206

5 months ago

Thrilled to finally release this study! 🚀 We view (discrete) diffusion models as implicitly doing data augmentation over autoregressive. Through this lens, we find that diffusion outperforms AR in data-constrained settings, but it requires larger models and way more epochs to

thumb_up_off_alt300

chat_bubble_outline7

repeat40

shareShare

Molei Tao

@moleitaomath

5 months ago

Interested in some foundation aspects? Waiting or unhappy about NeurIPS reviews? Plz consider NeurIPS workshop DynaFront: Dynamics at the Frontiers of Optimization, Sampling, and Games sites.google.com/view/dynafront… Yuejie Chi Andrea Montanari Taiji Suzuki Tatjana Chavdarova ++ Sponsor appreciated!

thumb_up_off_alt102

chat_bubble_outline2

repeat22

shareShare

Simo Ryu

@cloneofsimo

5 months ago

ReLU MLP with width / depth going to infinity. Note how different parameterization makes pathlogical scaling behavior (yellow / blue on activations / gradients of the weight). muP solves this.

thumb_up_off_alt299

chat_bubble_outline8

repeat23

shareShare

Simo Ryu

@cloneofsimo

5 months ago

Ive just learned so much from this playground today. For example, here is what happens if you do gradient descent on relu MLP, even with muP setup. This shows how model is optimizing on last layers for most part, once it gets it, early layers gets updated(meaningful signal

thumb_up_off_alt273

chat_bubble_outline5

repeat17

shareShare

Simo Ryu

@cloneofsimo

5 months ago

Datarater on CIFAR-10 dataset, on single-step meta gradient purely implemented on pytorch fair to say, it looks like Datarater works! High score images are much more "feature crisp", where some low score images often looks confusing

thumb_up_off_alt184

chat_bubble_outline8

repeat14

shareShare

Rob Cornish

@rob_cornish

5 months ago

I'm looking for talented and ambitious PhD students to join me at Nanyang Technological University Singapore to work on safe and robust AI systems! Full scholarships covering tuition and a stipend are available, and are open to local and international students alike.

thumb_up_off_alt77

chat_bubble_outline5

repeat17

shareShare

Ruiqi Gao

@ruiqigao

4 months ago

It feels so different when you can explore, interact and play with the world you generated, in real time! Check out #genie3. We are having so much fun. It unlocks minutes-long consistency, simply by generating next frame auto-regressively.

thumb_up_off_alt163

chat_bubble_outline11

repeat4

shareShare

Chris Lu

@_chris_lu_

4 months ago

To all my academic friends who gave me crap for joining OpenAI: We just open-sourced some banger models. Have fun with them!

thumb_up_off_alt2,2K

chat_bubble_outline49

repeat60

shareShare

Simo Ryu

@cloneofsimo

4 months ago

What is with these guys, they are printing out papers! They do this very interesting editing-forward-process that performs corruption more than absorption, to 'mimick' false sampling process of langugage model but they dont describe the sampling process afterwords so Im

thumb_up_off_alt169

chat_bubble_outline1

repeat9

shareShare

Aleksander Holynski

@holynski_

4 months ago

Something fun we discovered: you can use #Genie3 to step into and explore your favorite paintings. Here's a short visit to Edward Hopper's "Nighthawks".

thumb_up_off_alt7,7K

chat_bubble_outline410

repeat851

shareShare