Kevin Du (@kevdududu) 's Twitter Profile
Kevin Du

@kevdududu

nlp @ eth zurich

ID: 1678778211252174849

calendar_today11-07-2023 14:48:15

25 Tweet

100 Followers

55 Following

Kevin Du (@kevdududu) 's Twitter Profile Photo

not me joining twitter on its deathbed to self-promote :p shoutout to my wonderful fun collaborators Lucas Torroba-Hennigen Niklas Stoehr Alex Warstadt and @ryandcotterell! and yes check out the vid :D youtu.be/SUTMeVdCuXs

Niklas Stoehr (@niklas_stoehr) 's Twitter Profile Photo

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵 📄arxiv.org/pdf/2410.04962 Kevin Du, Vésteinn Snæbjarnarson, Bob West, Ryan Cotterell and Aaron Schein thread 👇

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵

📄arxiv.org/pdf/2410.04962

<a href="/kevdududu/">Kevin Du</a>, <a href="/vesteinns/">Vésteinn Snæbjarnarson</a>, <a href="/cervisiarius/">Bob West</a>, Ryan Cotterell and <a href="/AaronSchein/">Aaron Schein</a>

thread 👇
Julian Minder (@jkminder) 's Twitter Profile Photo

Can we understand and control how language models balance context and prior knowledge? Our latest paper shows it’s all about a 1D knob! 🎛️ arxiv.org/abs/2411.07404 Co-led with Kevin Du, as well as Niklas Stoehr, Giovanni Monea, Chris Wendler, Bob West & Ryan Cotterell.