Suraj Srinivas (@suuraj) Twitter Tweets • TwiCopy

jack morris

6 months ago

# A new type of information theory this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else it says that we should measure the amount of information available in some representation based on how *extractable* it is,

thumb_up_off_alt2,2K

chat_bubble_outline64

repeat348

shareShare

Suraj Srinivas

@suuraj

6 months ago

One of my most favourite (and thought-provoking) ML papers!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

6 months ago

Take a break from arxiv/LW/AF. Sit in the woods with a random textbook and mull new ideas away from interp community lockstep. Diverge. Don’t compete with a saturated subtopic, maybe you’ll get to take weekends off. Premature overinvestment comes from monoculture.

thumb_up_off_alt220

chat_bubble_outline5

repeat18

shareShare

☉rthonormalist🧭✡️

@orthonormalist

6 months ago

DID I CRACK IT? I think I figured out at least a chunk of the math. It's trade deficit divided by their exports. EU: exports 531.6, imports 333.4, deficit 198.2. 198.2/531.6 is 37, close to 39. Israel: exports 22.2, imports 14.8, deficit 7.4. 7.4/22.2 is 33.

thumb_up_off_alt16,16K

chat_bubble_outline476

repeat2,2K

shareShare

Michal Moshkovitz

@ml_theorist

6 months ago

Today in **two hours** Mirco Mutti will talk about interpretable bandits Zoom link: uva-live.zoom.us/j/87120549999 Suraj Srinivas @ ICML Tim van Erven

Today in **two hours** <a href="/mirco_mutti/">Mirco Mutti</a> will talk about interpretable bandits

Zoom link: uva-live.zoom.us/j/87120549999

<a href="/Suuraj/">Suraj Srinivas @ ICML</a> <a href="/tverven/">Tim van Erven</a>

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Michal Moshkovitz

@ml_theorist

5 months ago

In April 2024, we launched the Theory of Interpretable XAI seminar, aiming to build a community—unsure if we’d even have enough speakers. A year later, we’re still growing. New to the seminar? Join us in building the foundations of XAI together Tim van Erven Suraj Srinivas @ ICML 1/n

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Michal Moshkovitz

@ml_theorist

5 months ago

⏰⏰Theory of Interpretable AI Seminar ⏰⏰ Interested in Feature Attribution Explanations? In two weeks, May 6, Gunnar König Gunnar König will talk about "Disentangling Interactions and Dependencies in Feature Attribution" Tim van Erven Suraj Srinivas @ ICML

⏰⏰Theory of Interpretable AI Seminar ⏰⏰

Interested in Feature Attribution Explanations?

In two weeks, May 6, Gunnar König <a href="/gcskoenig/">Gunnar König</a> will talk about "Disentangling Interactions and Dependencies in Feature Attribution"

<a href="/tverven/">Tim van Erven</a> <a href="/Suuraj/">Suraj Srinivas @ ICML</a>

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Michal Moshkovitz

@ml_theorist

5 months ago

Curious about feature attribution? SHAP & LIME treat features independently—but features interact! Come hear how to "Disentangle Interactions and Dependencies in Feature Attribution" Tuesday (tomorrow!) 4pm CET, 10am ET Suraj Srinivas @ ICML Tim van Erven

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Nora Belrose

@norabelrose

4 months ago

data attribution is the most neglected thing in interpretability and people should join me in working on it

thumb_up_off_alt153

chat_bubble_outline15

repeat4

shareShare

arlo_son

@gson_ai

4 months ago

#NLProc AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫) In my recent paper, we introduce SPOT, a dataset of STEM manuscripts (math, materials science, chemistry, physics, etc), annotated with real errors. SOTA models like o3, gemini-2.5-pro

thumb_up_off_alt161

chat_bubble_outline4

repeat38

shareShare

Michal Moshkovitz

@ml_theorist

4 months ago

⏰⏰ Theory of Interpretable AI Seminar ⏰⏰ Chain-of-Thought: Why does explaining to LLMs using CoT prompting work? Join us on June 3, when Bohang Zhang @ICLR 2024 will dive into the mechanisms behind chain-of-thought prompting — and what makes it so effective Tim van Erven Suraj Srinivas @ ICML

⏰⏰ Theory of Interpretable AI Seminar ⏰⏰
Chain-of-Thought: Why does explaining to LLMs using CoT prompting work?

Join us on June 3, when <a href="/bohang_zhang/">Bohang Zhang @ICLR 2024</a> will dive into the mechanisms behind chain-of-thought prompting — and what makes it so effective

<a href="/tverven/">Tim van Erven</a>
<a href="/Suuraj/">Suraj Srinivas @ ICML</a>

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Suraj Srinivas

@suuraj

4 months ago

we live in a world where "verification is easier than generation" is no longer true

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Goodfire

@goodfireai

4 months ago

We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting with the latent concepts the model has learned. Try out Paint with Ember for yourself 👇

thumb_up_off_alt917

chat_bubble_outline40

repeat97

shareShare

Michal Moshkovitz

@ml_theorist

4 months ago

Why does Chain of Thought prompting actually work? Bohang Zhang @ICLR 2024 will be talking about it today. Join us! Suraj Srinivas @ ICML Tim van Erven

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

jack morris

@jxmnop

4 months ago

## The case for more ambition i wrote about how AI researchers should ask bigger and simpler questions, and publish fewer papers:

thumb_up_off_alt983

chat_bubble_outline24

repeat90

shareShare

Sebastian Bordt

@sbordt

3 months ago

Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper about the effect of data contamination on LLM evals might be for you!🚀 "How Much Can We Forget about Data Contamination?" (accepted at #ICML2025) shows

thumb_up_off_alt17

chat_bubble_outline1

repeat6

shareShare

Alex Oesterling @ NeurIPS 2024

@alex_oesterling

2 months ago

‼️🕚New paper alert with Usha Bhalla: Leveraging the Sequential Nature of Language for Interpretability (openreview.net/pdf?id=hgPf1ki…)! 1/n

thumb_up_off_alt17

chat_bubble_outline1

repeat8

shareShare

Michael Black

@michael_j_black

2 months ago

Here's how my recent papers & reviews are going: * To solve a vision problem today, the sensible thing is to leverage a pre-trained VLM or video diffusion model. Such models implicitly represent a tremendous amount about the visual world that we can exploit. * Figure out how to

thumb_up_off_alt454

chat_bubble_outline5

repeat53

shareShare