
Aengus Lynch
@aengus_lynch1
AI safety researcher.
ID: 701424958178795522
http://aenguslynch.com 21-02-2016 15:15:16
113 Tweet
836 Followers
1,1K Following

Visit us at #NeurIPS2024 11am-2pm today! ⚙️Analysing the Generalisation and Reliability of Steering Vectors 🎓Daniel Tan David Chanin, Aengus Lynch Dimitrios Kanoulas Brooks Paige Adrià Garriga-Alonso Robert Kirk neurips.cc/virtual/2024/p…








Interested in test time / inference scaling laws? Then check out our newest preprint!! 📉 How Do Large Language Monkeys Get Their Power (Laws)? 📉 arxiv.org/abs/2502.17578 w/ Joshua Kazdan Sanmi Koyejo Azalia Mirhoseini John Hughes Jordan Juravsky Sara Price Aengus Lynch






LLMs' sycophancy issues are a predictable result of optimizing for user feedback. Even if clear sycophantic behaviors get fixed, AIs' exploits of our cognitive biases may only become more subtle. Grateful our research on this was featured by Nitasha Tiku & The Washington Post!

