Alex Tamkin (@alextamkin) 's Twitter Profile
Alex Tamkin

@alextamkin

machine learning, science & society @AnthropicAI | recently: Clio, Anthropic Economic Index, Claude Artifacts | prev: phd @StanfordAILab, @stanfordnlp

ID: 846245360

linkhttp://alextamkin.com calendar_today25-09-2012 21:18:59

968 Tweet

5,5K Followers

1,1K Following

Jennifer Martinez (@jenmartinez) 's Twitter Profile Photo

Fresh off the presses from Alex Tamkin ! v2 of the Anthropic Economic Index is out, with sights from Claude Sonnet 3.7 and its extending thinking feature. tip Techmeme

Scale AI (@scale_ai) 's Twitter Profile Photo

New SEAL leaderboard in partnership with Center for AI Safety just dropped. Introducing MASK, a consistency-based benchmark designed to measure honesty in language models. Anthropic sweeps.

New SEAL leaderboard in partnership with <a href="/ai_risks/">Center for AI Safety</a> just dropped.

Introducing MASK, a consistency-based benchmark designed to measure honesty in language models. Anthropic sweeps.
Saffron Huang (@saffronhuang) 's Twitter Profile Photo

Really proud and excited to release this work on empirically measuring AI values “in the wild” — understanding, analyzing and taxonomizing what values guide model outputs in real interactions with real users. There is a lot of work on training models to follow particular

Esin Durmus (@esindurmusnlp) 's Twitter Profile Photo

Excited to share our new research with Saffron Huang on studying Claude's values in real-world conversations! We've created a comprehensive taxonomy of AI-expressed values based on interactions in the wild.

Jerry Hong (@jhoer100) 's Twitter Profile Photo

it's been very fun side questing to design figures for the societal impacts papers — clio, economic index, now AI values in the wild! they're doing some stellarrrrrr work

it's been very fun side questing to design figures for the societal impacts papers — clio, economic index, now AI values in the wild! they're doing some stellarrrrrr work
Jack Clark (@jackclarksf) 's Twitter Profile Photo

Will be in DC on Tuesday to discuss the Anthropic Economic Index and how AI is impacting the economy, with Rep Congresswoman Valerie Foushee opening the event. Please RSVP here - see you there!

Alex Imas (@alexolegimas) 's Twitter Profile Photo

🚨New paper (link in reply)🚨 Are we underestimating AI use in self-report surveys? YES, by as much as 30 percentage pts. We find 60% self-reported vs. truth closer to ~90% (!) Why? Social desirability bias, people embarrassed/worried to admit AI use, so they underreport.

🚨New paper (link in reply)🚨

Are we underestimating AI use in self-report surveys?

YES, by as much as 30 percentage pts. 

We find 60% self-reported vs. truth closer to ~90% (!)

Why? Social desirability bias, people embarrassed/worried to admit AI use, so they underreport.
sam mcallister (@sammcallister) 's Twitter Profile Photo

Spent the last few weeks working closely with @rickrubin on his latest project: The Way of Code. Incredibly proud of our tiny team for helping bring Rick's vision to life and creating such a beautiful experience with Claude to augment it.

Yijia Shao (@echoshao8899) 's Twitter Profile Photo

🚨 70 million US workers are about to face their biggest workplace transmission due to AI agents. But nobody asks them what they want. While AI races to automate everything, we took a different approach: auditing what workers want vs. what AI can do across the US workforce.🧵

🚨 70 million US workers are about to face their biggest workplace transmission due to AI agents. But nobody asks them what they want.

While AI races to automate everything, we took a different approach: auditing what workers want vs. what AI can do across the US workforce.🧵
Anthropic (@anthropicai) 's Twitter Profile Photo

New Anthropic Research: How people use Claude for emotional support. From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.

New Anthropic Research: How people use Claude for emotional support.

From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.
Helen Toner (@hlntnr) 's Twitter Profile Photo

Spearphishing PSA—looks like there's a concerted attack on AI safety/governance folks going around. Be wary of calendar links via DM, and *never* give a 2-factor auth code over the phone. I almost got caught by this—got a phone call last week, but figured out it was sus. 🧵