Harshay Shah (@harshays_) Twitter Tweets • TwiCopy

Harshay Shah

5 years ago

Neural nets can generalize well on test data, but often lack robustness to distributional shifts & adversarial attacks. Our #NeurIPS2020 paper on simplicity bias sheds light on this phenomenon. Poster: session #4, town A2, spot C0, 12pm ET today! Paper: bit.ly/39RXDel

thumb_up_off_alt67

chat_bubble_outline0

repeat10

shareShare

Harshay Shah

@harshays_

4 years ago

Do input gradients highlight discriminative and task-relevant features? Our #NeurIPS2021 paper takes a three-pronged approach to evaluate the fidelity of input gradient attributions. Poster: session 3, spot C0 Paper: bit.ly/3EzdvyH with Prateek Jain and @pnetrapalli

thumb_up_off_alt58

chat_bubble_outline0

repeat13

shareShare

Aleksander Madry

@aleks_madry

3 years ago

You’re deploying an ML system, choosing between two models trained w/ diff algs. Same training data, same acc... how do you differentiate their behavior? ModelDiff (gradientscience.org/modeldiff) lets you compare *any* two learning algs! w/ Harshay Shah Sam Park Andrew Ilyas (1/8)

thumb_up_off_alt292

chat_bubble_outline4

repeat65

shareShare

Andrew Ilyas

@andrew_ilyas

3 years ago

TRAK, our latest work on data attribution (trak.csail.mit.edu), speeds up datamodels up to 1000x! ➡️ our earlier work ModelDiff (w/ Harshay Shah Sam Park Aleksander Madry) can now compare any two learning algorithms in larger-scale settings. Try it out: github.com/MadryLab/model…

thumb_up_off_alt42

chat_bubble_outline1

repeat12

shareShare

Harshay Shah

@harshays_

2 years ago

If you are at #ICML2023 today, check out our work on ModelDiff, a model-agnostic framework for pinpointing differences between any two (supervised) learning algorithms! Poster: #407 at 2pm (Wednesday) Paper: icml.cc/virtual/2023/p… w/ Sam Park Andrew Ilyas Aleksander Madry

thumb_up_off_alt51

chat_bubble_outline0

repeat11

shareShare

Harshay Shah

@harshays_

2 years ago

New work with Andrew Ilyas and Aleksander Madry on tracing predictions back to individual components (conv filters, attn heads) in the model! Paper: arxiv.org/abs/2404.11534 Thread: 👇

thumb_up_off_alt51

chat_bubble_outline1

repeat10

shareShare

Aleksander Madry

@aleks_madry

2 years ago

How is an LLM actually using the info given to it in its context? Is it misinterpreting anything or making things up? Introducing ContextCite: a simple method for attributing LLM responses back to the context: gradientscience.org/contextcite w/ Ben Cohen-Wang, Harshay Shah, Kristian Georgiev

thumb_up_off_alt242

chat_bubble_outline7

repeat49

shareShare

Harshay Shah

@harshays_

a year ago

♥️

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

MIT CSAIL

@mit_csail

a year ago

How do black-box neural networks transform raw data into predictions? Inside these models are thousands of simple "components" working together. New MIT CSAIL research (bit.ly/473lcfE) introduces a method that helps us understand how these components compose to affect

thumb_up_off_alt253

chat_bubble_outline4

repeat61

shareShare

MIT CSAIL

@mit_csail

a year ago

How can we really know if a chatbot is giving a reliable answer? 🧵 MIT CSAIL’s "ContextCite" tool can ID the parts of external context used to generate any particular statement from a language model, improving trust by helping users easily verify the statement:

thumb_up_off_alt48

chat_bubble_outline3

repeat14

shareShare

Lucas Nestler

@_clashluke

a year ago

Wake up babe New MoE scaling laws dropped

thumb_up_off_alt440

chat_bubble_outline6

repeat49

shareShare

Harshay Shah

@harshays_

a year ago

MoEs provide two knobs for scaling: model size (total params) + FLOPs-per-token (via active params). What’s the right scaling strategy? And how does it depend on the pretraining budget? Our work introduces sparsity-aware scaling laws for MoE LMs to tackle these questions! 🧵👇

thumb_up_off_alt35

chat_bubble_outline1

repeat5

shareShare

Ben Cohen-Wang

@bcohenwang

8 months ago

It can be helpful to pinpoint the in-context information that a language model uses when generating content (is it using provided documents? or its own intermediate thoughts?). We present Attribution with Attention (AT2), a method for doing so efficiently and reliably! (1/8)

thumb_up_off_alt56

chat_bubble_outline3

repeat13

shareShare

Harshay Shah

@harshays_

8 months ago

If you’re at #ICLR2025, go watch Vimal Thilak🦉🐒 give an oral presentation at the @SparseLLMs workshop on scaling laws for pretraining MoE LMs! Had a great time co-leading this project with Samira Abnar & Vimal Thilak🦉🐒 at Apple MLR last summer. When: Sun Apr 27, 9:30a Where: Hall 4-07

thumb_up_off_alt19

chat_bubble_outline0

repeat5

shareShare