
Summer Yue
@summeryue0
VP of Research at Scale AI. Prev: RLHF lead on Bard, researcher at Google DeepMind / Brain (LaMDA, RL/TF-Agents, superhuman chip design). Opinions my own.
ID: 2726658913
12-08-2014 16:33:03
103 Tweet
2,2K Followers
331 Following



On the heels of Humanity's Last Exam, Scale AI & Center for AI Safety have released a new very-hard reasoning eval: EnigmaEval: 1,184 multimodal puzzles so hard they take groups of humans many hours to days to solve. All top models score 0 on the Hard set, and <10% on the Normal set š§µ



If a model lies when pressuredāitās not ready for AGI. The new MASK leaderboard is live. Built on the private split of our open-source honesty benchmark (w/ Center for AI Safety), it tests whether models lie under pressureāeven when they know better. š Leaderboard:




š SEAL and Red Team at @scale_ai present a position paper outlining what weāve learned from red teaming LLMs so farāwhat matters, whatās missing, and how model safety fits into broader system safety and monitoring. š scale.com/research/red_t⦠š scale.com/blog/rethink-rā¦