Bob West (@cervisiarius) Twitter Tweets • TwiCopy

Chris Wendler

a year ago

Very grateful to have received an outstanding paper award for our “Do Llamas Work in English?” paper. This paper would not have been possible without 4️⃣ co-firstauthors. Great job everyone! The energetic young upcoming PhD students Venia and Giovanni and the legendary Prof. West!

thumb_up_off_alt48

chat_bubble_outline5

repeat6

shareShare

Bob West

@cervisiarius

a year ago

Exciting 4-yr PhD position pushing the boundaries of LLMs for engineering at the Helmholtz-Zentrum Hereon / TU Hamburg, supervised by the amazing Roland Aydin & co-supervised by me. Applications due on 1 Sep! tinyurl.com/22aawphh #PhDPosition #ML #Engineering #LLM #Hamburg

thumb_up_off_alt36

chat_bubble_outline0

repeat11

shareShare

Tim Davidson @ICLR25

@im_td

a year ago

shout out to my amazing collaborators: Viacheslav Surkov, Veniamin Veselovsky, Giuseppe (Peppe) Russo, Bob West, and Caglar Gulcehre paper link: arxiv.org/abs/2407.06946

shout out to my amazing collaborators: <a href="/ViaSurkov/">Viacheslav Surkov</a>, <a href="/VminVsky/">Veniamin Veselovsky</a>, <a href="/russogiusep/">Giuseppe (Peppe) Russo</a>, <a href="/cervisiarius/">Bob West</a>, and <a href="/caglarml/">Caglar Gulcehre</a>

paper link: arxiv.org/abs/2407.06946

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Niklas Stoehr

@niklas_stoehr

a year ago

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵 📄arxiv.org/pdf/2410.04962 Kevin Du, Vésteinn Snæbjarnarson, Bob West, Ryan Cotterell and Aaron Schein thread 👇

thumb_up_off_alt100

chat_bubble_outline3

repeat18

shareShare

Giuseppe (Peppe) Russo

@russogiusep

a year ago

Our paper "The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates" got covered by the The Chronicle of Higher Education. Exciting times for peer-reviewing ICLR 2026 chronicle.com/article/ai-sci…

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

Viacheslav Surkov

@viasurkov

a year ago

Excited to share our latest breakthrough! We trained sparse autoencoders to decompose intermediate results of SDXL Turbo's forward pass. These autoencoders learn highly interpretable features that can be used to manipulate the image generation process. arxiv.org/abs/2410.22366

thumb_up_off_alt67

chat_bubble_outline4

repeat12

shareShare

Akhil Arora (@akhilarora.bsky.social)

@aroraakhilcs

a year ago

Disappointed to not be at #EMNLP owing to a dislocated shoulder 😢 Debjit Paul will present our poster on Multilingual Entity Insertion (cf. arxiv.org/pdf/2410.04254). Swing by our poster in session #6 on Wed 13@10:30 EST 🚀 PS: I am hiring PhD students Computer Science at Aarhus University #LLM #GNNs #CSS

Disappointed to not be at #EMNLP owing to a dislocated shoulder 😢
<a href="/DebjitPaul2/">Debjit Paul</a> will present our poster on Multilingual Entity Insertion (cf. arxiv.org/pdf/2410.04254). Swing by our poster in session #6 on Wed 13@10:30 EST

🚀 PS: I am hiring PhD students <a href="/csaudk/">Computer Science at Aarhus University</a>
#LLM #GNNs #CSS

thumb_up_off_alt47

chat_bubble_outline2

repeat11

shareShare

Julian Minder

@jkminder

a year ago

Can we understand and control how language models balance context and prior knowledge? Our latest paper shows it’s all about a 1D knob! 🎛️ arxiv.org/abs/2411.07404 Co-led with Kevin Du, as well as Niklas Stoehr, Giovanni Monea, Chris Wendler, Bob West & Ryan Cotterell.

thumb_up_off_alt70

chat_bubble_outline6

repeat22

shareShare

Bob West

@cervisiarius

a year ago

I love this paper, one of my recent favorites — not just because of the amazing animated llama in Julian Minder’s tweet :)

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Bob West

@cervisiarius

a year ago

New work with Kristina Gligorić Eric Horvitz Emre Kıcıman Arnaud Chiolero and Ryen White on food choice mimicry at EPFL

thumb_up_off_alt12

chat_bubble_outline1

repeat5

shareShare

Communications of the ACM

@cacmmag

9 months ago

"The AI Alignment Paradox," by Bob West (EPFL) and Roland Aydin, says that aligning #AIModels with our values makes it easier for adversaries to misalign them. bit.ly/42E8CmY

"The AI Alignment Paradox," by <a href="/cervisiarius/">Bob West</a> (<a href="/EPFL_en/">EPFL</a>) and Roland Aydin, says that aligning #AIModels with our values makes it easier for adversaries to misalign them. bit.ly/42E8CmY

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Tzu-Sheng Kuo 郭子生

@tzushengkuo

9 months ago

Excited to announce the #WikiNLP workshop at ACL 2025! We welcome #NLP contributions to Wikimedia, especially on datasets, and ideas to advance its mission. w/ Isaac Johnson (Wikimedia Foundation), Akhil Arora @ICML’25 🇨🇦, Lucie-Aimée Kaffee, Tiziano Piccardi, Indira Sen See: w.wiki/CumQ

Excited to announce the #WikiNLP workshop at <a href="/aclmeeting/">ACL 2025</a>!

We welcome #NLP contributions to Wikimedia, especially on datasets, and ideas to advance its mission.

w/ Isaac Johnson (<a href="/Wikimedia/">Wikimedia Foundation</a>), <a href="/aroraakhilcs/">Akhil Arora @ICML’25 🇨🇦</a>, <a href="/frimelle/">Lucie-Aimée Kaffee</a>, <a href="/tizianopiccardi/">Tiziano Piccardi</a>, <a href="/indiiigosky/">Indira Sen</a>

See: w.wiki/CumQ

thumb_up_off_alt35

chat_bubble_outline0

repeat9

shareShare

Bob West

@cervisiarius

8 months ago

New CACM opinion piece written together with Roland Aydin about what we call the “AI Alignment Paradox”: virtuous AI may be more easily made vicious. Possibly this might be a reason for an intriguing recent result by Owain Evans et al.: x.com/OwainEvans_UK/…

thumb_up_off_alt11

chat_bubble_outline0

repeat5

shareShare

Akhil Arora (@akhilarora.bsky.social)

@aroraakhilcs

8 months ago

I am recruiting 2 PhD students for Fall'25 Computer Science at Aarhus University to work on bleeding-edge topics in #NLProc #LLMs #AIAgents (e.g. LLM reasoning, knowledge-seeking agents, and more). Details: cs.au.dk/~clan/openings Deadline: May 1, 2025 Please boost! cc: WikiResearch Pioneer Centre for AI SODAS, Copenhagen (Bluesky: @cphsodas.bsky.social)

thumb_up_off_alt36

chat_bubble_outline0

repeat20

shareShare

Veniamin Veselovsky

@vminvsky

7 months ago

New paper: Language models have “universal” concept representation – but can they capture cultural nuance? 🌏 If someone from Japan asks an LLM what color a pumpkin is, will it correctly say green (as they are in Japan)? Or does cultural nuance require more than just language?

thumb_up_off_alt131

chat_bubble_outline6

repeat33

shareShare

Chris Wendler

@wendlerch

5 months ago

How do diffusion models create images and can we control that process? We are excited to release a update to our SDXL Turbo sparse autoencoder paper. New title: One Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models Spoiler: We have FLUX SAEs now :)

thumb_up_off_alt55

chat_bubble_outline3

repeat17

shareShare

Clément Dumas (at ICLR)

@butanium_

4 months ago

This work got accepted to ACL 2025 main! 🎉 In this updated version, we extended our results to several models and showed they can actually generate good definitions of mean concept representations across languages.🧵

thumb_up_off_alt42

chat_bubble_outline1

repeat5

shareShare

Bob West

@cervisiarius

4 months ago

Very exciting work out of my lab at EPFL Computer and Communication Sciences, led by Saibo-Creator — zip2zip, a method for allowing LLMs to work directly in compressed token space.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare