Bob West (@cervisiarius) 's Twitter Profile
Bob West

@cervisiarius

Associate Professor at EPFL, Data Science Lab (dlab)

ID: 193115103

linkhttp://dlab.epfl.ch/people/west calendar_today21-09-2010 00:38:50

281 Tweet

1,1K Followers

115 Following

Chris Wendler (@wendlerch) 's Twitter Profile Photo

Very grateful to have received an outstanding paper award for our “Do Llamas Work in English?” paper. This paper would not have been possible without 4️⃣ co-firstauthors. Great job everyone! The energetic young upcoming PhD students Venia and Giovanni and the legendary Prof. West!

Very grateful to have received an outstanding paper award for our “Do Llamas Work in English?” paper. This paper would not have been possible without 4️⃣ co-firstauthors. Great job everyone! The energetic young upcoming PhD students Venia and Giovanni and the legendary Prof. West!
Bob West (@cervisiarius) 's Twitter Profile Photo

Exciting 4-yr PhD position pushing the boundaries of LLMs for engineering at the Helmholtz-Zentrum Hereon / TU Hamburg, supervised by the amazing Roland Aydin & co-supervised by me. Applications due on 1 Sep! tinyurl.com/22aawphh #PhDPosition #ML #Engineering #LLM #Hamburg

Niklas Stoehr (@niklas_stoehr) 's Twitter Profile Photo

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵 📄arxiv.org/pdf/2410.04962 Kevin Du, Vésteinn Snæbjarnarson, Bob West, Ryan Cotterell and Aaron Schein thread 👇

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵

📄arxiv.org/pdf/2410.04962

<a href="/kevdududu/">Kevin Du</a>, <a href="/vesteinns/">Vésteinn Snæbjarnarson</a>, <a href="/cervisiarius/">Bob West</a>, Ryan Cotterell and <a href="/AaronSchein/">Aaron Schein</a>

thread 👇
Giuseppe (Peppe) Russo (@russogiusep) 's Twitter Profile Photo

Our paper "The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates" got covered by the The Chronicle of Higher Education. Exciting times for peer-reviewing ICLR 2026 chronicle.com/article/ai-sci…

Akhil Arora (@akhilarora.bsky.social) (@aroraakhilcs) 's Twitter Profile Photo

Disappointed to not be at #EMNLP owing to a dislocated shoulder 😢 Debjit Paul will present our poster on Multilingual Entity Insertion (cf. arxiv.org/pdf/2410.04254). Swing by our poster in session #6 on Wed 13@10:30 EST 🚀 PS: I am hiring PhD students Computer Science at Aarhus University #LLM #GNNs #CSS

Disappointed to not be at #EMNLP owing to a dislocated shoulder 😢
<a href="/DebjitPaul2/">Debjit Paul</a>  will present our poster on Multilingual Entity Insertion (cf. arxiv.org/pdf/2410.04254). Swing by our poster in session #6 on Wed 13@10:30 EST

🚀 PS: I am hiring PhD students <a href="/csaudk/">Computer Science at Aarhus University</a> 
#LLM #GNNs #CSS
Julian Minder (@jkminder) 's Twitter Profile Photo

Can we understand and control how language models balance context and prior knowledge? Our latest paper shows it’s all about a 1D knob! 🎛️ arxiv.org/abs/2411.07404 Co-led with Kevin Du, as well as Niklas Stoehr, Giovanni Monea, Chris Wendler, Bob West & Ryan Cotterell.

Communications of the ACM (@cacmmag) 's Twitter Profile Photo

"The AI Alignment Paradox," by Bob West (EPFL) and Roland Aydin, says that aligning #AIModels with our values makes it easier for adversaries to misalign them. bit.ly/42E8CmY

"The AI Alignment Paradox," by <a href="/cervisiarius/">Bob West</a> (<a href="/EPFL_en/">EPFL</a>) and Roland Aydin, says that aligning #AIModels with our values makes it easier for adversaries to misalign them.  bit.ly/42E8CmY
Bob West (@cervisiarius) 's Twitter Profile Photo

New CACM opinion piece written together with Roland Aydin about what we call the “AI Alignment Paradox”: virtuous AI may be more easily made vicious. Possibly this might be a reason for an intriguing recent result by Owain Evans et al.: x.com/OwainEvans_UK/…

Akhil Arora (@akhilarora.bsky.social) (@aroraakhilcs) 's Twitter Profile Photo

I am recruiting 2 PhD students for Fall'25 Computer Science at Aarhus University to work on bleeding-edge topics in #NLProc #LLMs #AIAgents (e.g. LLM reasoning, knowledge-seeking agents, and more). Details: cs.au.dk/~clan/openings Deadline: May 1, 2025 Please boost! cc: WikiResearch Pioneer Centre for AI SODAS, Copenhagen (Bluesky: @cphsodas.bsky.social)

Veniamin Veselovsky (@vminvsky) 's Twitter Profile Photo

New paper: Language models have “universal” concept representation – but can they capture cultural nuance? 🌏 If someone from Japan asks an LLM what color a pumpkin is, will it correctly say green (as they are in Japan)? Or does cultural nuance require more than just language?

New paper: Language models have “universal” concept representation – but can they capture cultural nuance? 🌏

If someone from Japan asks an LLM what color a pumpkin is, will it correctly say green (as they are in Japan)?

Or does cultural nuance require more than just language?
Chris Wendler (@wendlerch) 's Twitter Profile Photo

How do diffusion models create images and can we control that process? We are excited to release a update to our SDXL Turbo sparse autoencoder paper. New title: One Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models Spoiler: We have FLUX SAEs now :)

Clément Dumas (at ICLR) (@butanium_) 's Twitter Profile Photo

This work got accepted to ACL 2025 main! 🎉 In this updated version, we extended our results to several models and showed they can actually generate good definitions of mean concept representations across languages.🧵