Sam Park (@smsampark) Twitter Tweets • TwiCopy

Andrew Ilyas

@andrew_ilyas

2 years ago

James Zou presenting about attributing ChatGPT behavior drift at ATTRIB 2023! Room 271-273

<a href="/james_y_zou/">James Zou</a> presenting about attributing ChatGPT behavior drift at ATTRIB 2023!

Room 271-273

thumb_up_off_alt13

chat_bubble_outline0

repeat5

shareShare

Andrew Ilyas

@andrew_ilyas

2 years ago

Contributed talks at ATTRIB 2023 (Rm 271-273): starting with Teddi Worledge on Corroborative vs Contributive data attribution for language models!

Contributed talks at ATTRIB 2023 (Rm 271-273): starting with <a href="/TeddiWorledge/">Teddi Worledge</a> on Corroborative vs Contributive data attribution for language models!

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

I gave a keynote this week at the fantastic ATTRIB Workshop #NeurIPS2023 "What does scale give us: Why we are building a ladder 🪜 to the moon 🌕" Some of you asked for my slides, sharing below: docs.google.com/presentation/d… Thanks to the organizers for a fantastic workshop! 🔥

thumb_up_off_alt184

chat_bubble_outline4

repeat18

shareShare

Aleksander Madry

@aleks_madry

2 years ago

How do we attribute an image generated by a diffusion model back to the training data? w/ Kristian Georgiev Josh Vendrow Hadi Salman Sam Park we show that it’s useful to look at each step of the diffusion process:

thumb_up_off_alt136

chat_bubble_outline2

repeat24

shareShare

Aleksander Madry

@aleks_madry

2 years ago

We tend to choose LM training data via intuitive notions of text quality... but LMs are often *un*intuitive. Is there a better way? w/Logan Engstrom, Axel Feldmann: we select better data by modeling how models learn from data. Our method, DsDm, can greatly improve

We tend to choose LM training data via intuitive notions of text quality... but LMs are often *un*intuitive. Is there a better way?

w/<a href="/logan_engstrom/">Logan Engstrom</a>, <a href="/axel_s_feldmann/">Axel Feldmann</a>: we select better data by modeling how models learn from data. Our method, DsDm, can greatly improve

thumb_up_off_alt296

chat_bubble_outline5

repeat52

shareShare

Aleksander Madry

@aleks_madry

2 years ago

Our second (and final) blog post on model components is now out: we show that component attributions enable model editing! See the blog post for more: gradientscience.org/modelcomponent… Paper: arxiv.org/abs/2404.11534 (this time with no typos :) Code: github.com/MadryLab/model…

thumb_up_off_alt105

chat_bubble_outline0

repeat13

shareShare

Sarah Cen

@cen_sarah

2 years ago

In work w/ Andrew Ilyas Jennifer Allen Hannah Li Aleksander Madry we give experimental evidence that users strategize on recommender systems! We find that users react to their (beliefs about) *algorithms* (not just content!) to shape future recs. Paper: arxiv.org/abs/2405.05596 1/8

In work w/ <a href="/andrew_ilyas/">Andrew Ilyas</a> <a href="/_JenAllen/">Jennifer Allen</a> <a href="/hannahq_li/">Hannah Li</a> <a href="/aleks_madry/">Aleksander Madry</a> we give experimental evidence that users strategize on recommender systems!

We find that users react to their (beliefs about) *algorithms* (not just content!) to shape future recs.

Paper: arxiv.org/abs/2405.05596

1/8

thumb_up_off_alt98

chat_bubble_outline3

repeat19

shareShare

Aleksander Madry

@aleks_madry

a year ago

In ML, we train on biased (huge) datasets ➡️ models encode spurious corrs and fail on minority groups. Can we scalably remove "bad" data? w/ Saachi Jain Kimia Hamidieh Kristian Georgiev Andrew Ilyas Marzyeh we propose D3M, a method for exactly this: gradientscience.org/d3m/

In ML, we train on biased (huge) datasets ➡️ models encode spurious corrs and fail on minority groups. Can we scalably remove "bad" data?

w/ <a href="/saachi_jain_/">Saachi Jain</a> <a href="/kimiahmdh/">Kimia Hamidieh</a> <a href="/kris_georgiev1/">Kristian Georgiev</a> <a href="/andrew_ilyas/">Andrew Ilyas</a> <a href="/MarzyehGhassemi/">Marzyeh</a> we propose D3M, a method for exactly this: gradientscience.org/d3m/

thumb_up_off_alt122

chat_bubble_outline2

repeat22

shareShare

Aleksander Madry

@aleks_madry

a year ago

At #ICML2024 ? Our tutorial "Data Attribution at Scale" will be to tomorrow at 9:30 AM CEST in Hall A1! I will not be able to make it (but will arrive later that day), but my awesome students Andrew Ilyas Sam Park Logan Engstrom will carry the torch :)

thumb_up_off_alt70

chat_bubble_outline6

repeat10

shareShare

Andrew Ilyas

@andrew_ilyas

a year ago

Starting now in Hall A1! With accompanying notes (WIP) at ml-data-tutorial.org (compiled w/ Sam Park Logan Engstrom Kristian Georgiev Aleksander Madry )

thumb_up_off_alt23

chat_bubble_outline2

repeat4

shareShare

Aleksander Madry

@aleks_madry

a year ago

Attending #ICML2024? Check out our work on decomposing predictions and editing model behavior via targeted interventions to model internals! Poster: #2513, Hall C 4-9, 1:30p (Tue) Paper: arxiv.org/abs/2404.11534 w/ Harshay Shah Andrew Ilyas

thumb_up_off_alt64

chat_bubble_outline7

repeat11

shareShare

Logan Engstrom

@logan_engstrom

a year ago

Stop by our poster on model-aware dataset selection at ICML! Location/time: 1:30pm Hall C 4-9 #1010 (Tuesday) Paper: arxiv.org/abs/2401.12926 with: Axel Feldmann Aleksander Madry

thumb_up_off_alt40

chat_bubble_outline6

repeat6

shareShare

Andrew Ilyas

@andrew_ilyas

a year ago

Thanks to all who attended our tutorial "Data Attribution at Scale" at ICML (w/ Sam Park Logan Engstrom Kristian Georgiev Aleksander Madry)! We're really excited to see the response to this emerging topic. Slides, notes, ICML video: ml-data-tutorial.org Public recording soon!

Thanks to all who attended our tutorial "Data Attribution at Scale" at ICML (w/ <a href="/smsampark/">Sam Park</a> <a href="/logan_engstrom/">Logan Engstrom</a> <a href="/kris_georgiev1/">Kristian Georgiev</a> <a href="/aleks_madry/">Aleksander Madry</a>)! We're really excited to see the response to this emerging topic.

Slides, notes, ICML video: ml-data-tutorial.org
Public recording soon!

thumb_up_off_alt157

chat_bubble_outline2

repeat27

shareShare

Andrew Ilyas

@andrew_ilyas

a year ago

The ATTRIB workshop is back @ NeurIPS 2024! We welcome papers connecting model behavior to data, algorithms, parameters, scale, or anything else. Submit by Sep 18! More info: attrib-workshop.cc Co-organizers: Tolga Bolukbasi Logan Engstrom Sadhika Malladi Elisa Nguyen Sam Park

thumb_up_off_alt50

chat_bubble_outline0

repeat14

shareShare

Andrew Ilyas

@andrew_ilyas

a year ago

Have a paper on understanding how ML design choices relate to model behavior? A reminder that the (first) ATTRIB deadline is this Wednesday AoE! The second deadline is October 4th AoE, but one author will have to volunteer as an emergency reviewer.

thumb_up_off_alt11

chat_bubble_outline1

repeat3

shareShare

Andrew Ilyas

@andrew_ilyas

a year ago

Machine unlearning ("removing" training data from a trained ML model) is a hard, important problem. Datamodel Matching (DMM): a new unlearning paradigm with strong empirical performance! w/ Kristian Georgiev Roy Rinberg Sam Park Shivam Garg Aleksander Madry Seth Neel (1/4)

thumb_up_off_alt137

chat_bubble_outline2

repeat23

shareShare

Ken Liu

@kenziyuliu

8 months ago

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

thumb_up_off_alt290

chat_bubble_outline10

repeat79

shareShare

Zitong Yang

@zitongyang0

7 months ago

To stress-test our technique, we apply Synthetic Continued Pretraining on 1K ICLR papers (joint effort with CLS), obtaining an LM that “knows” your ICLR submission in its weights. If you’d like to try out our LM, come to the oral or poster session below! Oral Session

To stress-test our technique, we apply Synthetic Continued Pretraining on 1K ICLR papers (joint effort with <a href="/ChengleiSi/">CLS</a>), obtaining an LM that “knows” your ICLR submission in its weights.

If you’d like to try out our LM, come to the oral or poster session below!

Oral Session

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Tristan Thrush

@tristanthrush

7 months ago

At #ICLR, check out Perplexity Correlations: a statistical framework to select the best pretraining data with no LLM training! I can’t make the trip, but Tatsunori Hashimoto will present the poster for us! Continue reading for the latest empirical validations of PPL Correlations:

thumb_up_off_alt51

chat_bubble_outline1

repeat13

shareShare

Sam Park

@smsampark

5 months ago

Cool application of training data attribution / influence estimation (using TRAK) to data selection for robot imitation learning!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Sam Park

Andrew Ilyas

Andrew Ilyas

Sara Hooker

Aleksander Madry

Aleksander Madry

Aleksander Madry

Sarah Cen

Aleksander Madry

Aleksander Madry

Andrew Ilyas

Aleksander Madry

Logan Engstrom

Andrew Ilyas

Andrew Ilyas

Andrew Ilyas

Andrew Ilyas

Ken Liu

Zitong Yang

Tristan Thrush

Sam Park