Virginia Smith (@gingsmith) Twitter Tweets • TwiCopy

ML@CMU

10 months ago

blog.ml.cmu.edu/2025/01/08/opt… How can we train LLMs to solve complex challenges beyond just data scaling? In a new blogpost, Amrith Setlur, Yuxiao Qu Matthew Yang, Lunjun Zhang , Virginia Smith and Aviral Kumar demonstrate that Meta RL can help LLMs better optimize test time compute

thumb_up_off_alt91

chat_bubble_outline3

repeat22

shareShare

ICML Conference

@icmlconf

10 months ago

ICML submission site is now open. Note that there's an abstract deadline on Jan 23. icml.cc/Conferences/20…

thumb_up_off_alt84

chat_bubble_outline6

repeat24

shareShare

Amrith Setlur

@setlur_amrith

7 months ago

How to effectively unlearn finetuning data? ❌ Approx. methods leak sensitive data ✅ Exact unlearning (eg. retraining) is secure 🔒 but inefficient 🚨 New paper: *efficient* & *exact* unlearning (led by Kevin) 🗝️ Idea: model merging at scale arxiv.org/pdf/2504.04626 🧵⤵️

thumb_up_off_alt54

chat_bubble_outline1

repeat10

shareShare

Andrew Gordon Wilson

@andrewgwils

7 months ago

The ICML 2025 workshops list is online! icml.cc/virtual/2025/e…. Many exciting topics, spanning multi-agent systems, world models, test-time adaptation, actionable interpretability, and much more.

thumb_up_off_alt114

chat_bubble_outline4

repeat16

shareShare

ML@CMU

@mlcmublog

7 months ago

blog.ml.cmu.edu/2025/04/18/llm… 📈⚠️ Is your LLM unlearning benchmark measuring what you think it is? In a new blog post authored by Pratiksha Thaker, Shengyuan Hu, Neil Kale, Yash Maurya, Steven Wu, and Virginia Smith, we discuss why empirical benchmarks are necessary but not

thumb_up_off_alt12

chat_bubble_outline0

repeat11

shareShare

Aashiq Muhamed

@aashiqmuhamed

7 months ago

Thrilled to share our new work on improving LLM unlearning! 🚀 Gradient-based unlearning struggle with high cost, instability & lack of precision. We introduce Dynamic SAE Guardrails (DSG): an activation-based approach using SAEs for targeted, efficient knowledge removal.

thumb_up_off_alt43

chat_bubble_outline4

repeat9

shareShare

ICML Conference

@icmlconf

6 months ago

Invited talked are announced. icml.cc/virtual/2025/e… Jon Kleinberg Pamela Samuelson Frauke Kreuter Anca Dragan Andreas Krause

thumb_up_off_alt55

chat_bubble_outline0

repeat5

shareShare

ML@CMU

@mlcmublog

6 months ago

blog.ml.cmu.edu/2025/05/22/unl… Are your LLMs truly forgetting unwanted data? In this new blog post authored by Shengyuan Hu, Yiwei Fu, Steven Wu, and Virginia Smith, we discuss how benign relearning can jog unlearned LLM's memory to recover knowledge that is supposed to be forgotten.

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Ameet Talwalkar

@atalwalkar

6 months ago

I’m excited to share new work from Datadog AI Research! We just released Toto, a new SOTA (by a wide margin!) time series foundation model, and BOOM, the largest benchmark of observability metrics. Both are available under the Apache 2.0 license. 🧵

thumb_up_off_alt241

chat_bubble_outline4

repeat53

shareShare

Matthew Yang

@matthewyryang

5 months ago

🚨 NEW PAPER: What if LLMs could tackle harder problems - not by explicitly training on longer traces, but by learning how to think longer? Our recipe e3 teaches models to explore in-context, enabling LLMs to unlock longer reasoning chains without ever seeing them in training.

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

ICML Conference

@icmlconf

5 months ago

ICML offers an optional poster printing service icml.myprintdesk.net Orders can be picked up the day at the Vancouver Convention Centre in West MR 104 during the following hours: Monday - Friday: 7:30 am - 5:00 pm Saturday: 8:00 am - 1:00 pm

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Pratiksha Thaker

@prthaker_

4 months ago

I'm very excited to share some new work arxiv.org/abs/2506.06488. This work started out in conversations with Thorn where we realized that shadow model MIAs couldn't be used to audit models for harmful content of children. See 🧵 for why, and our progress on solving this...

thumb_up_off_alt23

chat_bubble_outline1

repeat9

shareShare