
Chuanyang Jin
@chuanyang_jin
PhD student @JohnsHopkins | prev @MITCoCoSci & @MIT_CSAIL & @CILVRatNYU
ID: 1544706063471108098
http://chuanyangjin.com 06-07-2022 15:33:33
76 Tweet
406 Followers
359 Following




📊Summary of updates on the MMToM-QA leaderboard: chuanyangjin.com/mmtom-qa-leade… - Recent LLMs with inference-time scaling (e.g., o3-mini) have significantly improved ToM performance but still fall short of human levels. Notably, they excel in belief questions but score below random on





