Anja Reusch (@anja_reu) 's Twitter Profile
Anja Reusch

@anja_reu

Postdoc @ Technion, working on Interpretability in Information Retrieval πŸ”Ž and NLProc πŸ’¬

ID: 1546482735958597635

calendar_today11-07-2022 13:14:46

8 Tweet

31 Followers

52 Following

Zorik Gekhman (@zorikgekhman) 's Twitter Profile Photo

🚨 It's often claimed that LLMs know more facts than they show in their outputs, but what does this actually mean, and how can we measure this β€œhidden knowledge”? In our new paper, we clearly define this concept and design controlled experiments to test it. 1/🧡

🚨 It's often claimed that LLMs know more facts than they show in their outputs, but what does this actually mean, and how can we measure this β€œhidden knowledge”?

In our new paper, we clearly define this concept and design controlled experiments to test it.
1/🧡
Tal Haklay (@tal_haklay) 's Twitter Profile Photo

🚨 Call for Papers is Out! The First Workshop on π€πœπ­π’π¨π§πšπ›π₯𝐞 πˆπ§π­πžπ«π©π«πžπ­πšπ›π’π₯𝐒𝐭𝐲 will be held at ICML 2025 in Vancouver! πŸ“… Submission Deadline: May 9 Follow us >> Actionable Interpretability Workshop ICML2025 🧠Topics of interest include: πŸ‘‡

🚨 Call for Papers is Out!

The First Workshop on π€πœπ­π’π¨π§πšπ›π₯𝐞 πˆπ§π­πžπ«π©π«πžπ­πšπ›π’π₯𝐒𝐭𝐲 will be held at ICML 2025 in Vancouver!

πŸ“… Submission Deadline: May 9
Follow us &gt;&gt; <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>

🧠Topics of interest include: πŸ‘‡
Hadas Orgad (@orgadhadas) 's Twitter Profile Photo

So – why actionable interpretability? Most interpretability work stops at "here’s what the model is doing." But what do we do with that knowledge? Can we change how models are trained, deployed, or aligned based on these insights? That’s the core question we’re tackling.

Actionable Interpretability Workshop ICML2025 (@actinterp) 's Twitter Profile Photo

🚨 We're looking for reviewers for the workshop! If you're passionate about making interpretability useful and want to help shape the conversation, we'd love your input. Sign up to review >>πŸ’‘πŸ”

🚨 We're looking for reviewers for the workshop!

If you're passionate about making interpretability useful and want to help shape the conversation, we'd love your input.

Sign up to review &gt;&gt;πŸ’‘πŸ”
Hadas Orgad (@orgadhadas) 's Twitter Profile Photo

Position papers wanted! For the First Workshop on Actionable Interpretability, we’re looking for diverse perspectives on the state of the field. Should certain areas of interpretability research be developed further? Are there key metrics we should prioritize? Or do you have >>

Position papers wanted!

For the First Workshop on Actionable Interpretability, we’re looking for diverse perspectives on the state of the field. Should certain areas of interpretability research be developed further? Are there key metrics we should prioritize? Or do you have &gt;&gt;
Yaniv Nikankin (@ynikankin) 's Twitter Profile Photo

Interested in mechanistic interpretability? We'll be presenting our work on arithmetic mechanisms in LLMs later this week at #ICLR2025. DM me if you're there and want to chat about AI interpretability. πŸ“†Friday, April 25th, 10-12:30 (Poster #243) πŸ”–iclr.cc/virtual/2025/p…

Interested in mechanistic interpretability? We'll be presenting our work on arithmetic mechanisms in LLMs later this week at #ICLR2025.
DM me if you're there and want to chat about AI interpretability.
πŸ“†Friday, April 25th, 10-12:30 (Poster #243)
πŸ”–iclr.cc/virtual/2025/p…
Hadas Orgad (@orgadhadas) 's Twitter Profile Photo

Deadline extended! ⏳ The Actionable Interpretability Workshop at #ICML2025 has moved its submission deadline to May 19th. More time to submit your work πŸ”πŸ§ βœ¨Β Don’tΒ missΒ out!

Deadline extended! ⏳

The Actionable Interpretability Workshop at #ICML2025 has moved its submission deadline to May 19th. More time to submit your work πŸ”πŸ§ βœ¨Β Don’tΒ missΒ out!
Hadas Orgad (@orgadhadas) 's Twitter Profile Photo

Congratulations to everyone whose papers were accepted to ICML Conference ! If your works lies at the intersection of interpretability and actionability/practicality/usefulness, consider submitting it to our Actionable Interpretability Workshop ICML2025 conference track! Deadline is May 19th.

Congratulations to everyone whose papers were accepted to <a href="/icmlconf/">ICML Conference</a> !
If your works lies at the intersection of interpretability and actionability/practicality/usefulness, consider submitting it to our <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a> conference track!
Deadline is May 19th.
Sarah Wiegreffe (on faculty job market!) (@sarahwiegreffe) 's Twitter Profile Photo

We got more submissions to the workshop than we anticipated, and are looking for reviewers willing to review 2-4 papers between May 24 and June 7. If you are interested, please self-nominate! Thank you πŸ™ docs.google.com/forms/d/e/1FAI…

Mor Geva (@megamor2) 's Twitter Profile Photo

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (Actionable Interpretability Workshop ICML2025)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨

Going to #icml2025? Don't miss the Actionable Interpretability Workshop (<a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>)! We've got an amazing lineup of speakers, panelists, and papers, all focused on leveraging insights from interpretability research to tackle practical, real-world problems ✨
Actionable Interpretability Workshop ICML2025 (@actinterp) 's Twitter Profile Photo

🚨The Actionable Interpretability Workshop is happening tomorrow at ICML! Join us for an exciting lineup of speakers, nearly 70 posters, and a great panel discussion πŸ™Œ Don’t miss it! πŸ”βš™οΈ ICML Conference Actionable Interpretability Workshop ICML2025

🚨The Actionable Interpretability Workshop is happening tomorrow at  ICML! 
Join us for an exciting lineup of speakers, nearly 70 posters, and a great panel discussion πŸ™Œ
Don’t miss it! πŸ”βš™οΈ

<a href="/icmlconf/">ICML Conference</a> <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>