
Arthur Conmy
@arthurconmy
Aspiring 10x reverse engineer @GoogleDeepMind
ID: 1422331230620639233
02-08-2021 22:59:40
455 Tweet
2,2K Followers
1,1K Following









'The key lesson from mechanistic interpretability is that a surprising number of AI behaviors are surprisingly well-described as linear directions in activation space' ~Lewis Smith We'll have more work in this area soon, thanks to Constantin Venhoff and Iván Arcuschin !!