Xinyang (Young) Geng
@younggeng
Research scientist at Google DeepMind. Opinions are my own.
ID: 2362406610
http://young-geng.xyz/ 26-02-2014 09:17:53
66 Tweet
1,1K Followers
513 Following
Appreciate Aidan McLaughlin looking into the thinking model results. Originally scores looked weak as the response was plucked from the thought content versus output. We are looking into ways of making thinking output less confusing for people running evals. This is why we 🚢, to