Chris Olah (@ch402) 's Twitter Profile
Chris Olah

@ch402

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

ID: 153196789

linkhttp://colah.github.io calendar_today07-06-2010 23:08:04

5,5K Tweet

112,112K Followers

178 Following

Michael Nielsen (@michael_nielsen) 's Twitter Profile Photo

My three-sentence summary of Lakatos's "Proofs and Refutations", with apologies to Don Knuth: "Premature definition is the root of much conceptual evil. Good definitions arise out of a back-and-forth interplay between rough definitions and powerful insight-giving arguments.

Sam Bowman (@sleepinyourhat) 's Twitter Profile Photo

🧵✨🙏 With the new Claude Opus 4, we conducted what I think is by far the most thorough pre-launch alignment assessment to date, aimed at understanding its values, goals, and propensities. Preparing it was a wild ride. Here’s some of what we learned. 🙏✨🧵

Anthropic (@anthropicai) 's Twitter Profile Photo

Our interpretability team recently released research that traced the thoughts of a large language model. Now we’re open-sourcing the method. Researchers can generate “attribution graphs” like those in our study, and explore them interactively.

Michael Hanna (@michaelwhanna) 's Twitter Profile Photo

Mateusz and I are excited to announce circuit-tracer, a library that makes circuit-finding simple! Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on neuronpedia: shorturl.at/SUX2A

<a href="/mntssys/">Mateusz</a> and I are excited to announce circuit-tracer, a library that makes circuit-finding simple!

Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on <a href="/neuronpedia/">neuronpedia</a>: shorturl.at/SUX2A
Barack Obama (@barackobama) 's Twitter Profile Photo

At a time when people are understandably focused on the daily chaos in Washington, these articles describe the rapidly accelerating impact that AI is going to have on jobs, the economy, and how we live. axios.com/2025/05/28/ai-…