
Vijay Karunamurthy
@vjkaruna
Field CTO @scale_AI . Apple, Google, YouTube.
ID: 4413741
12-04-2007 21:44:28
3,3K Tweet
2,2K Followers
492 Following




Great talking with Summer Yue and Dan Hendrycks about Humanity’s Last Exam, and pushing the frontiers of model evaluation, reasoning and calibration.


🚨 Gemini 2.5 Pro Exp dropped and it's now #1 across SEAL leaderboards: 🥇 Humanity’s Last Exam 🥇 VISTA (multimodal) 🥇 (tie) Tool Use 🥇 (tie) MultiChallenge (multi-turn) 🥉 (tie) Enigma (puzzles) Congrats to Demis Hassabis Sundar Pichai & team! 🔗 scale.com/leaderboard






If a model lies when pressured—it’s not ready for AGI. The new MASK leaderboard is live. Built on the private split of our open-source honesty benchmark (w/ Center for AI Safety), it tests whether models lie under pressure—even when they know better. 📊 Leaderboard:






Incredible getting a tour of the Simons Institute for the Theory of Computing at Berkeley this morning - new candidate for a transmon qubit (in a Cal enclosure!)



Fdr and Exec Director Jamil N. Jaffer spoke on a panel at the World Economic Forum Technology Retreat 2025 titled “AI and the Privatization of Sovereignty.” Mod: Cathy Li of Centre for AI Excellence, WEF Co-Panellists: Eileen Donahoe of Global Digital Policy Incubator; Vijay Karunamurthy of Scale AI;

