Ian Tenney (@iftenney@sigmoid.social) (@iftenney) Twitter Tweets • TwiCopy

Ian Tenney (@[email protected])

@iftenney

+ Follow

Staff Research Scientist, People + AI Research @GoogleAI #GoogleResearch. Interpretability, analysis, and visualizations for LLMs. Opinions my own.

ID: 898538970

linkhttp://iftenney.github.io calendar_today22-10-2012 22:16:38

92 Tweet

1,1K Followers

530 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Recent innovation has given rise to #ML models w/ impressive capabilities, but there’s much to learn about how we attribute model behavior to training data, algorithms, architecture, & more! Have papers or ideas on this? Submit to ATTRIB @ #NeurIPS2023 → attrib-workshop.cc

thumb_up_off_alt201

chat_bubble_outline5

repeat55

shareShare

iislucas (Lucas Dixon)

@iislucas

2 years ago

PAIR is looking for a Research Scientist interested in making hard ML problems (like understanding langauge) much smaller... In Paris, and working closely with fun interactive explorable visualizations too. See: goo.gle/3PcvPEs

thumb_up_off_alt48

chat_bubble_outline0

repeat16

shareShare

Jeff Dean

@jeffdean

a year ago

We're also releasing a Responsible Generative AI Toolkit that provides resources to apply best practices for responsible use of open models such as the Gemma models, including: Guidance on setting safety policies, safety tuning, safety classifiers and model evaluation. The

thumb_up_off_alt162

chat_bubble_outline5

repeat10

shareShare

Ian Tenney (@[email protected])

@iftenney

a year ago

Super excited for the Gemma model release, and with it a new debugging tool we built on 🔥LIT - use gradient-based salience to debug and refine complex LLM prompts! ai.google.dev/responsible/mo…

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Ian Tenney (@[email protected])

@iftenney

a year ago

Very, very cool work from Asma Ghandeharioun and colleagues - Patchscopes is a super flexible way to decode knowledge from internal states of an LLM, bridging between interpretability, causality, and control.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Ian Tenney (@[email protected])

@iftenney

a year ago

New open-source tool: LLM Comparator can help make sense of side-by-side model evals. In-browser demo at pair-code.github.io/llm-comparator/, and read more in the blog post below.

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Ian Tenney (@[email protected])

@iftenney

a year ago

Try out LLM Comparator to help make sense of LLM evaluations! Alongside the Gemma 2 launch, we've released a Python library to help run rater models, bulletizing, and clustering - so all you need are prompts and model responses. Scripts and notebooks at goo.gle/llm-comparator

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Ian Tenney (@[email protected])

@iftenney

7 months ago

Check out our new work on scaling training data attribution (TDA) toward LLM pretraining - and some interesting things we found along the way. arxiv.org/abs/2410.17413 and more below from most excellent student researcher Tyler Chang ⬇️

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Tyler Chang

@tylerachang

3 months ago

Presenting our work on training data attribution for pretraining this morning: iclr.cc/virtual/2025/p… -- come stop by in Hall 2/3 #526 if you're here at ICLR!

thumb_up_off_alt20

chat_bubble_outline0

repeat5

shareShare

Ian Tenney (@[email protected])

@iftenney

3 months ago

Submission deadline extended to May 19! Working on or have thoughts on real-world applications of interpretability, and how we can use it in practice? Consider submitting to our workshop at ICML 2025. More at actionable-interpretability.github.io and below👇

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Mor Geva

@megamor2

18 days ago

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (Actionable Interpretability Workshop ICML2025)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (<a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨

thumb_up_off_alt43

chat_bubble_outline1

repeat5

shareShare

Tal Haklay

@tal_haklay

16 days ago

🚨Meet our panelists at the Actionable Interpretability Workshop Actionable Interpretability Workshop ICML2025 at ICML Conference! Join us July 19 at 4pm for a panel on making interpretability research actionable, its challenges, and how the community can drive greater impact. Naomi Saphra hiring my lab at ICML 🧈🪰 Samuel Marks Kyle Lo Fazl Barez

🚨Meet our panelists at the Actionable Interpretability Workshop <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a> at <a href="/icmlconf/">ICML Conference</a>!

Join us July 19 at 4pm for a panel on making interpretability research actionable, its challenges, and how the community can drive greater impact.
<a href="/nsaphra/">Naomi Saphra hiring my lab at ICML 🧈🪰</a> <a href="/saprmarks/">Samuel Marks</a> <a href="/kylelostat/">Kyle Lo</a> <a href="/FazlBarez/">Fazl Barez</a>

thumb_up_off_alt55

chat_bubble_outline0

repeat12

shareShare

Ian Tenney (@[email protected])

Gate.io

Google AI

iislucas (Lucas Dixon)

Jeff Dean

Ian Tenney (@[email protected])

Ian Tenney (@[email protected])

Ian Tenney (@[email protected])

Ian Tenney (@[email protected])

Ian Tenney (@[email protected])

Tyler Chang

Ian Tenney (@[email protected])

Mor Geva

Tal Haklay