
Alex Tamkin
@alextamkin
machine learning, science & society @AnthropicAI | recently: Clio, Anthropic Economic Index, Claude Artifacts | prev: phd @StanfordAILab, @stanfordnlp
ID: 846245360
http://alextamkin.com 25-09-2012 21:18:59
968 Tweet
5,5K Followers
1,1K Following



New SEAL leaderboard in partnership with Center for AI Safety just dropped. Introducing MASK, a consistency-based benchmark designed to measure honesty in language models. Anthropic sweeps.



Excited to share our new research with Saffron Huang on studying Claude's values in real-world conversations! We've created a comprehensive taxonomy of AI-expressed values based on interactions in the wild.


If you're at ICLR this year, come check out our work and chat with Belinda Li @ ICML!

Will be in DC on Tuesday to discuss the Anthropic Economic Index and how AI is impacting the economy, with Rep Congresswoman Valerie Foushee opening the event. Please RSVP here - see you there!


Looking forward to joining Erik Brynjolfsson and Stanford Digital Economy Lab at stanford on monday to talk about our work on the Anthropic Economic Index!




