Kirill Neklyudov (@k_neklyudov) Twitter Tweets • TwiCopy

SIAM Activity Group on Dynamical Systems

@dynamicssiam

6 months ago

Course notes: "Optimal Transport for Machine Learners" (by Gabriel Peyré): arxiv.org/abs/2505.06589

thumb_up_off_alt230

chat_bubble_outline1

repeat49

shareShare

Kirill Neklyudov

@k_neklyudov

6 months ago

Summer is the best season in Montreal and a great time for presenting your research at MoML 2025!

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Gavin Crooks

@gavincrooks

6 months ago

Some light summer reading. Tristan Needham’s “Visual Differential Geometry and Forms”. Very readable.

thumb_up_off_alt69

chat_bubble_outline5

repeat3

shareShare

🔥 Time for my first bioML blog post! This one is for all the folks getting into ML-for-protein-design. ✨ "Just know stuff, proteinML edition" kidger.site/thoughts/just-… This is intended as a curriculum-with-context, as a starting point for the field. 1/2

thumb_up_off_alt167

chat_bubble_outline4

repeat21

shareShare

Austin Cheng

@auhcheng

6 months ago

Excited to share Quetzal, a simple but scalable model for building 3D molecules atom-by-atom. 🐉 Named after Quetzalcoatl, the Aztec god of creation We equip a standard causal transformer with a per-atom diffusion MLP to model the continuous 3D position of the next atom. [1/3]

thumb_up_off_alt130

chat_bubble_outline6

repeat30

shareShare

Daniel Severo

@_dsevero

5 months ago

New work: a scalable way to learn dists over permutations/rankings. The method can trade-off compute and expressivity by varying # NFEs (ie unmasking more than one token at a time), and subsumes well known families of models (eg Mallow' model) arxiv.org/abs/2505.24664

thumb_up_off_alt13

chat_bubble_outline1

repeat3

shareShare

Danyal Rehman

@danyalrehman17

5 months ago

Excited to release FORT, a new regression-based approach for training normalizing flows 🔥! 🔗 Paper available here: arxiv.org/abs/2506.01158 New paper w/ Oscar Davis Jiarui Lu Jian Tang Michael Bronstein Yoshua Bengio Alex Tong Joey Bose 🧵1/6

thumb_up_off_alt77

chat_bubble_outline2

repeat18

shareShare

Lorenz Richter @ICLR'25

@lorenz_richter

5 months ago

We derive policy gradients for reinforcement learning with random time horizons in arxiv.org/pdf/2506.00962. While arguably being a typical setting in applications, it has been largely overlooked in the literature. Our adjusted formulas offer significant numerical improvements.

thumb_up_off_alt40

chat_bubble_outline1

repeat5

shareShare

Floor Eijkelboom

@feijkelboom

5 months ago

Generative models excel at images and text, but tabular data remains a challenge.🤔 We introduce 🐈 TabbyFlow 🐈 - a variational flow matching approach with general exponential families for mixed-type tables. Work with Andrés Guzmán-Cordero & Jan-Willem van de Meent accepted to #ICML2025 🎉 👇 1/n

thumb_up_off_alt72

chat_bubble_outline1

repeat18

shareShare

Kirill Neklyudov

@k_neklyudov

5 months ago

The supervision signal in AI4Science is so crisp that we can solve very complicated problems almost without any data or RL! In this project, we train a model to solve the Schrödinger equation for different molecular conformations using Density Functional Theory (DFT) In the

thumb_up_off_alt89

chat_bubble_outline0

repeat21

shareShare

Ricky T. Q. Chen

@rickytqchen

5 months ago

Padding in our non-AR sequence models? Yuck. 🙅 👉 Instead of unmasking, our new work *Edit Flows* perform iterative refinements via position-relative inserts and deletes, operations naturally suited for variable-length sequence generation. Easily better than using mask tokens.

thumb_up_off_alt482

chat_bubble_outline8

repeat71

shareShare

Acceleration Consortium (AC)

@acceleration_c

5 months ago

We're spotlighting #WomenInSTEM and their inspiring journeys! Meet Marta Skreta, Computer Science PhD student University of Toronto. Video created by Biomedical Engineering @ University of Toronto students Meghan + Ana-Maria Oproescu with support from Helen Tran and the AC’s EDI Initiate Grant. 🎥 youtube.com/watch?v=h2uRpm…

thumb_up_off_alt20

chat_bubble_outline0

repeat8

shareShare

Alex Thiery

@alexxthiery

5 months ago

Geodesic path v.s. KL(pi, target) gradient flow under the Fisher-Rao Metric: can you tell which is which?

thumb_up_off_alt41

chat_bubble_outline6

repeat2

shareShare

Alex Tong

@alexandertong7

5 months ago

Check out FKCs! A principled flexible approach for diffusion sampling. I was surprised how well it scaled to high dimensions given its reliance on importance reweighting. Thanks to great collaborators Mila - Institut québécois d'IA Vector Institute Imperial College London and Google DeepMind. Thread👇🧵

thumb_up_off_alt29

chat_bubble_outline1

repeat4

shareShare

Microsoft Research

@msftresearch

5 months ago

Microsoft researchers achieved a breakthrough in the accuracy of DFT, a method for predicting the properties of molecules and materials, by using deep learning. This work can lead to better batteries, green fertilizers, precision drug discovery, and more. msft.it/6011SQwKX

thumb_up_off_alt298

chat_bubble_outline2

repeat86

shareShare

Rianne van den Berg

@vdbergrianne

5 months ago

🚀 After two+ years of intense research, we’re thrilled to introduce Skala — a scalable deep learning density functional that hits chemical accuracy on atomization energies and matches hybrid-level accuracy on main group chemistry — all at the cost of semi-local DFT. ⚛️🔥🧪🧬

thumb_up_off_alt150

chat_bubble_outline2

repeat34

shareShare

Rob Brekelmans

@brekelmaniac

5 months ago

Given q_t, r_t as diffusion model(s), an SDE w/drift β ∇ log q_t + α ∇ log r_t doesn’t sample the sequence of geometric avg/product/tempered marginals! To correct this, we derive an SMC scheme via PDE perspective Resampling weights are ‘free’, depend only on (exact) scores!

thumb_up_off_alt73

chat_bubble_outline0

repeat5

shareShare

Max Zhdanov

@maxxxzdn

5 months ago

🤹 New blog post! I write about our recent work on using hierarchical trees to enable sparse attention over irregular data (point clouds, meshes) - Erwin Transformer. blog: maxxxzdn.github.io/blog/erwin/ paper: arxiv.org/abs/2502.17019 Compressed version in the thread below:

thumb_up_off_alt513

chat_bubble_outline7

repeat85

shareShare

Kirill Neklyudov

@k_neklyudov

5 months ago

This work is exemplary! James and his coauthors took the direction that most researchers wouldn't call shiny. They gave the idea a full shot and came out with a beautiful study. They pushed the boundaries of our understanding of the energy-based models miles further!

thumb_up_off_alt20

chat_bubble_outline1

repeat3

shareShare

Kirill Neklyudov

SIAM Activity Group on Dynamical Systems

Kirill Neklyudov

Gavin Crooks

Patrick Kidger

Austin Cheng

Daniel Severo

Danyal Rehman

Lorenz Richter @ICLR'25

Floor Eijkelboom

Kirill Neklyudov

Ricky T. Q. Chen

Acceleration Consortium (AC)

Alex Thiery

Alex Tong

Microsoft Research

Rianne van den Berg

Rob Brekelmans

Max Zhdanov

Kirill Neklyudov