Ben Hoover (@ben_hoov) Twitter Tweets • TwiCopy

Dmitry Krotov

8 months ago

I am super excited to announce the call for papers for the New Frontiers in Associative Memories workshop at ICLR 2025. New architectures and algorithms, memory-augmented LLMs, energy-based models, Hopfield networks, associative memory and diffusion, and many other exciting

thumb_up_off_alt101

chat_bubble_outline1

repeat23

shareShare

Julia Kempe

@kempelab

8 months ago

Submit to our New Frontiers in Associative Memories workshop ICLR 2026. New architectures & algorithms, memory-augmented LLMs, energy-based models, Hopfield networks, assoc. memory & diffusion.. nfam.vizhub.ai openreview.net/group?id=ICLR.… Organizing with Dmitry Krotov et al

thumb_up_off_alt41

chat_bubble_outline1

repeat13

shareShare

John Hopfield

@hopfieldjohn

8 months ago

I very much enjoyed reading the papers from the first iteration of this workshop in 2023. If you are working on associative memory, consider submitting your work and participating in this event.

thumb_up_off_alt49

chat_bubble_outline2

repeat9

shareShare

Dmitry Krotov

@dimakrotov

7 months ago

Now that ICML papers are submitted and we are in the midst of discussions on whether scaling is enough or new architectural/algorithmic ideas are needed, what can be a better time to submit your best work to our workshop on New Frontiers in Associative Memory ICLR 2026?

thumb_up_off_alt30

chat_bubble_outline1

repeat4

shareShare

Alec Helbling

@alec_helbling

7 months ago

Gradient descent alone tends to converge to local minima. Momentum frames optimization as a ball with mass moving down a hill. By adding inertia, the ball resists settling in small basins, allowing it to arrive at the global minimum.

thumb_up_off_alt36

chat_bubble_outline1

repeat5

shareShare

Alec Helbling

@alec_helbling

6 months ago

Introducing ConceptAttention, an approach to interpreting diffusion transformer models! Write a prompt, choose some concepts, generate an image, and get high-quality heatmaps of text concepts. Our method outperforms existing methods like cross attention. Link to demo 👇

thumb_up_off_alt485

chat_bubble_outline9

repeat85

shareShare

Alec Helbling

@alec_helbling

6 months ago

Create heatmaps that localize text concepts in generated videos. We discovered that our approach, ConceptAttention, can be directly extended from image generation to video generation models! It's amazing how simple techniques often generalize way better than more complex ones.

thumb_up_off_alt534

chat_bubble_outline9

repeat66

shareShare

Alec Helbling

@alec_helbling

6 months ago

Diffusion models leverage a variety of samplers. Deterministic methods like DDIM produce orderly paths. In contrast, stochastic samplers like DDPM produce chaotic trajectories. Despite their differences, both methods draw valid samples from the underlying distribution.

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat185

shareShare

Alec Helbling

@alec_helbling

6 months ago

One of the simplest algorithms for sampling from a probability distribution is Random Walk Metropolis-Hastings. It proposes new samples by taking Gaussian-distributed steps, accepting or rejecting them to maintain the target distribution. I call this pdf the "fidget spinner".

thumb_up_off_alt1,1K

chat_bubble_outline8

repeat164

shareShare

Alec Helbling

@alec_helbling

5 months ago

Hamiltonian Monte Carlo (HMC) frames sampling from a probability distribution as simulating the dynamics of a physical system. Samples are expressed as particles whose trajectories are updated following Hamilton's equations based on the structure of the target distribution.

thumb_up_off_alt1,1K

chat_bubble_outline11

repeat215

shareShare

Alec Helbling

@alec_helbling

5 months ago

Random Walks aim to explore a distribution through random perturbations. This is likely to move into low density regions, which is inefficient. Hamiltonian MC sails through a distribution much more rapidly by incorporating the distribution's structure into proposals.

thumb_up_off_alt612

chat_bubble_outline13

repeat87

shareShare

Victor

@victor_explore

5 months ago

This website has visualizations to understand almost all major topics in Machine Learning (link in comment)

thumb_up_off_alt284

chat_bubble_outline3

repeat38

shareShare

Alec Helbling

@alec_helbling

4 months ago

Our work ConceptAttention was accepted to ICML 2025 as a Spotlight Poster ("top" 2.6% of submissions)! ConceptAttention creates rich saliency maps of text concepts present in generated images and videos. It requires no additional training, only repurposing existing parameters.

thumb_up_off_alt716

chat_bubble_outline4

repeat102

shareShare

Alec Helbling

@alec_helbling

4 months ago

Flow Matching aims to learn a "flow" that transforms a simple source distribution (e.g. Gaussian) to an arbitrarily complex target distribution. This video shows the evolution of the marginal probability path as a source distribution is transformed to a target distribution.

thumb_up_off_alt2,2K

chat_bubble_outline26

repeat247

shareShare

Alec Helbling

@alec_helbling

4 months ago

I've been putting together an interactive tool called DiffusionLab for explaining the geometric intuition behind diffusion and flow based generative models. Sampling is actually being done in the browser using Tensorflow.js! It is still in the very early stages.

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat149

shareShare

Alec Helbling

@alec_helbling

4 months ago

I made a tool called Diffusion Explorer that lets you to train and visualize simple 2D diffusion and flow models live in the browser. You can draw your own distributions and observe how the generated samples converge during training. Try it live 👇

thumb_up_off_alt1,1K

chat_bubble_outline10

repeat114

shareShare

Anthony Peng

@realanthonypeng

4 months ago

Guardrail models like 🛡️ Llama Guard do more than filtering — we repurpose them to track how safety risk evolves 📉 through a response. This gives rise to the STAR ⭐ score: a fine-grained signal for finetuning LLMs more safely 🤖🔒 Curious how it works? More in the thread 👇

thumb_up_off_alt10

chat_bubble_outline1

repeat4

shareShare

Anthony Peng

@realanthonypeng

3 months ago

🚨 Sharing our new #ACL2025NLP main paper! 🎥 Deploying video VLMs at scale? Inference compute is your bottleneck. We study how to optimally allocate inference FLOPs across LLM size, frame count, and visual tokens. 💡 Large-scale training sweeps (~100k A100 hrs) 📊 Parametric

thumb_up_off_alt32

chat_bubble_outline1

repeat6

shareShare