Chuanyang Jin (@chuanyang_jin) 's Twitter Profile
Chuanyang Jin

@chuanyang_jin

PhD student @JohnsHopkins | prev @MITCoCoSci & @MIT_CSAIL & @CILVRatNYU

ID: 1544706063471108098

linkhttp://chuanyangjin.com calendar_today06-07-2022 15:33:33

76 Tweet

406 Followers

359 Following

Zhijiang Guo (@zhijiangg) 's Twitter Profile Photo

🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! Check out our latest survey on Complex Reasoning with LLMs. Analyzed over 300 papers to explore the progress. Paper: arxiv.org/pdf/2502.17419 Github: github.com/zzli2022/Aweso…

🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! 
Check out our latest survey on Complex Reasoning with LLMs. Analyzed over 300 papers to explore the progress.
Paper: arxiv.org/pdf/2502.17419
Github: github.com/zzli2022/Aweso…
Tianmin Shu (@tianminshu) 's Twitter Profile Photo

Very excited to introduce AutoToM, our latest effort toward open-ended machine Theory of Mind. Given any context and ToM question, AutoToM automatically formulates a minimally sufficient probabilistic model to produce confident inference of any target mental variable.

Zhining Zhang (@zhining_zhang03) 's Twitter Profile Photo

Check out our latest work on machine Theory of Mind: #AutoToM ! We propose an approach that (1) combines the open-endedness of LLMs with robustness of Bayesian models; (2) leverages the uncertainties to refine the model, achieving better performance while maintaining low compute.

Chuanyang Jin (@chuanyang_jin) 's Twitter Profile Photo

📊Summary of updates on the MMToM-QA leaderboard: chuanyangjin.com/mmtom-qa-leade… - Recent LLMs with inference-time scaling (e.g., o3-mini) have significantly improved ToM performance but still fall short of human levels. Notably, they excel in belief questions but score below random on

Natasha Jaques (@natashajaques) 's Twitter Profile Photo

Human-AI cooperation is an important problem, but many existing papers focus on training agents in the same 5 fixed Overcooked layouts, and use population-based training (PBT) to try to cover the diversity of human partner strategies. Diving into this problem, we find that

Homanga Bharadhwaj (@mangahomanga) 's Twitter Profile Photo

Check out this exciting workshop on continual learning from humans at RSS 2025 in LA! I am happy to be speaking and will share our works on observational learning through visual imitation of humans.

Jason Weston (@jaseweston) 's Twitter Profile Photo

🚨Announcing RAM 2 workshop @ COLM25 - call for papers🚨 - 10 years on, we present the sequel to the classic RAM🐏 (Reasoning, Attention, Memory) workshop that took place in 2015 at the cusp of major change in the area. Now in 2025 we reflect on what's happened and discuss the

🚨Announcing RAM 2 workshop @ COLM25 - call for papers🚨 
- 10 years on, we present the sequel to the classic RAM🐏 (Reasoning, Attention, Memory) workshop that took place in 2015 at the cusp of major change in the area. Now in 2025 we reflect on what's happened and discuss the
Tianmin Shu (@tianminshu) 's Twitter Profile Photo

🚀 Excited to introduce SimWorld: an embodied simulator for infinite photorealistic world generation 🏙️ populated with diverse agents 🤖 If you are at #CVPR2025, come check out the live demo 👇 Jun 14, 12:00-1:00 pm at JHU booth, ExHall B Jun 15, 10:30 am-12:30 pm, #7, ExHall B

Chuanyang Jin (@chuanyang_jin) 's Twitter Profile Photo

Existing robot-manipulation benchmarks stop at object-level tasks, missing the part-level semantics essential for fine-grained control. Very excited to see PartInstruct, which finally fills this gap with a large-scale dataset for training and evaluating precise, long-horizon,

Chuanyang Jin (@chuanyang_jin) 's Twitter Profile Photo

Welcome to join us tomorrow! 🗓️ June 21 | 8:50 AM – 12:30 PM PT 📍 USC (OHE 132) & Zoom (wse.zoom.us/j/95095685281)