Sarah Wiegreffe (on faculty job market!) (@sarahwiegreffe) Twitter Tweets • TwiCopy

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

+ Follow

Research in language model explainability & interpretability since 2017. Postdoc @allen_ai @uwnlp PhD from @mlatgt @gtcomputing Views my own, not my employer's.

ID: 1882939814

linkhttp://sarahwie.github.io calendar_today19-09-2013 12:05:27

1,1K Tweet

4,4K Followers

1,1K Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 months ago

Checkout our new preprint/project which has been over a year in the making! This has been a very fun collaboration (and one of the biggest I've personally participated in). We are quite excited about the leaderboard and release, and are open to feedback to help this remain a

thumb_up_off_alt29

chat_bubble_outline0

repeat2

shareShare

Abhilasha Ravichander

@lasha_nlp

3 months ago

Stoked that HALoGEN (non-archival version) won best paper award at the TrustNLP workshop @ #NAACL2025! Our work explores LLM hallucinations and their potential roots in training data. Excited to discuss more --- come find us!

thumb_up_off_alt78

chat_bubble_outline5

repeat11

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 months ago

We extended the deadline by 10 days! Consider submitting ⬇️

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Yonatan Belinkov

@boknilev

2 months ago

Since people have been asking - the #blackboxNLP workshop will return this year, to be held with #emnlp2025. This workshop is all about interpreting and analyzing NLP models (and yes, this includes LLMs). More details soon, follow BlackboxNLP

thumb_up_off_alt73

chat_bubble_outline1

repeat11

shareShare

Hadas Orgad

@orgadhadas

2 months ago

Just 6 days left! ⏰ Submit your work to the Actionable Interpretability Workshop at #ICML2025 by May 19th. Contribute to the future of interpretable and impactful AI! Actionable Interpretability Workshop ICML2025

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Tal Haklay

@tal_haklay

2 months ago

We knew many of you wanted to submit to our Actionable Interpretability workshop, but we didn’t expect to crash Overleaf! 😏🍃 Only 5 days left ⏰! Got a paper accepted to ICML that fits our theme? Submit it to our conference track! 👉 Actionable Interpretability Workshop ICML2025

thumb_up_off_alt16

chat_bubble_outline2

repeat5

shareShare

Yonatan Belinkov

@boknilev

2 months ago

BlackboxNLP will be co-located with #EMNLP2025 in Suzhou this November! 📷This edition will feature a new shared task on circuits/causal variable localization in LMs, details: blackboxnlp.github.io/2025/task If you're into mech interp and care about evaluation, please submit!

thumb_up_off_alt71

chat_bubble_outline1

repeat20

shareShare

Hadas Orgad

@orgadhadas

2 months ago

🚨 Announcing the keynote speakers in the Actionable Interpretability Workshop ICML2025 workshop at #icml2025 Join us to hear these leading experts share their take on how interpretability can drive real-world impact in AI. Been Kim Sarah Schwettmann byron wallace Eric Wong

🚨 Announcing the keynote speakers in the <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a> workshop at #icml2025
Join us to hear these leading experts share their take on how interpretability can drive real-world impact in AI.
<a href="/_beenkim/">Been Kim</a> <a href="/cogconfluence/">Sarah Schwettmann</a> <a href="/byron_c_wallace/">byron wallace</a> <a href="/RICEric22/">Eric Wong</a>

thumb_up_off_alt51

chat_bubble_outline0

repeat5

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 months ago

We got more submissions to the workshop than we anticipated, and are looking for reviewers willing to review 2-4 papers between May 24 and June 7. If you are interested, please self-nominate! Thank you 🙏 docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt13

chat_bubble_outline0

repeat5

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

a month ago

A bit late to announce, but I’m excited to share that I'll be starting as an assistant professor at the University of Maryland UMD Department of Computer Science this August. I'll be recruiting PhD students this upcoming cycle for fall 2026. (And if you're a UMD grad student, sign up for my fall seminar!)

thumb_up_off_alt564

chat_bubble_outline70

repeat48

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

a month ago

Go Ai2 Ai2!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Neel Nanda

@neelnanda5

22 days ago

Good news! There will be a mechanistic interpretability workshop at NeurIPS (Dec 6/7, San Diego) If you were disappointed that ICML rejected us, now we'll do an even better one: 4 more months of progress to discuss! Papers likely due late August/early Sept, more info soon

thumb_up_off_alt288

chat_bubble_outline4

repeat19

shareShare

Mor Geva

@megamor2

18 days ago

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (Actionable Interpretability Workshop ICML2025)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (<a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨

thumb_up_off_alt43

chat_bubble_outline1

repeat5

shareShare

Tal Haklay

@tal_haklay

16 days ago

🚨Meet our panelists at the Actionable Interpretability Workshop Actionable Interpretability Workshop ICML2025 at ICML Conference! Join us July 19 at 4pm for a panel on making interpretability research actionable, its challenges, and how the community can drive greater impact. Naomi Saphra hiring my lab at ICML 🧈🪰 Samuel Marks Kyle Lo Fazl Barez

🚨Meet our panelists at the Actionable Interpretability Workshop <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a> at <a href="/icmlconf/">ICML Conference</a>!

Join us July 19 at 4pm for a panel on making interpretability research actionable, its challenges, and how the community can drive greater impact.
<a href="/nsaphra/">Naomi Saphra hiring my lab at ICML 🧈🪰</a> <a href="/saprmarks/">Samuel Marks</a> <a href="/kylelostat/">Kyle Lo</a> <a href="/FazlBarez/">Fazl Barez</a>

thumb_up_off_alt55

chat_bubble_outline0

repeat12

shareShare

Hadas Orgad

@orgadhadas

15 days ago

Started packing for #ICML2025? We're already excited for the Actionable Interpretability Workshop ICML2025 workshop! Only 8 days away. Confirmed keynotes: Been Kim, Sarah Schwettmann, Byron Wallace and Eric Wong. Schedule is out. Plan to join us 👉

Started packing for #ICML2025? We're already excited for the <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a> workshop! Only 8 days away.

Confirmed keynotes: <a href="/_beenkim/">Been Kim</a>, <a href="/cogconfluence/">Sarah Schwettmann</a>, <a href="/ByronWallace/">Byron Wallace</a> and <a href="/RICEric22/">Eric Wong</a>.

Schedule is out. Plan to join us 👉

thumb_up_off_alt18

chat_bubble_outline1

repeat6

shareShare

Samuel Marks

@saprmarks

11 days ago

I'm excited to discuss downstream applications of interpretability at Actionable Interpretability Workshop ICML2025! For a preview of my thoughts on the topic, see my blog post on how I think about picking applications to target x.com/saprmarks/stat…

thumb_up_off_alt30

chat_bubble_outline0

repeat4

shareShare