
Martin Tutek
@mtutek
Postdoc @ Technion | previously postdoc @ UKP Lab, TU Darmstadt | PhD @ TakeLab, UniZG | Working on interpretability & safety of LLMs.
ID: 4075234643
http://mttk.github.io 30-10-2015 12:50:57
428 Tweet
436 Followers
798 Following


1/14 🎉 Excited to announce that our paper, "DEPTH: Discourse Education through Pre-Training Hierarchically", has been accepted to #Rep4NLP at #NAACL2025!!! Joint work with Ofek Glick , Chaim Baskin and Yonatan Belinkov


🎉 Our Actionable Interpretability workshop has been accepted to #ICML2025! 🎉 >> Follow Actionable Interpretability Workshop ICML2025 Tal Haklay Anja Reusch Marius Mosbach Sarah Wiegreffe Ian Tenney (@[email protected]) Mor Geva Paper submission deadline: May 9th!


Check out our new work on how information flows in text-to-image models! Turns out, the text encoder isn’t doing what you’d expect — and that has real consequences for model performance and errors. For a deeper dive, see Guy Kaplan’s post. Paper link is in the first comment!





🚨New paper at #ACL2025 Findings! REVS: Unlearning Sensitive Information in LMs via Rank Editing in the Vocabulary Space. LMs memorize and leak sensitive data—emails, SSNs, URLs from their training. We propose a surgical method to unlearn it. 🧵👇w/Yonatan Belinkov Martin Tutek 1/8




Delighted that ✨Mor Geva (Mor Geva) and ✨Anna Ivanova (Anna Ivanova) will complete our speaker lineup and talk about the INTERPLAY of model internals and behavior. Be there and submit by June 30th 📄 shorturl.at/sBomu See you in 🇨🇦 Conference on Language Modeling #nlproc #interpretability

