Yuandong Tian (@tydsh) 's Twitter Profile
Yuandong Tian

@tydsh

Research Scientist Director in Meta GenAI (previously in FAIR). Doing reasoning. Novelist in spare time. PhD in @CMU_Robotics.

ID: 97939183

linkhttp://yuandong-tian.com calendar_today19-12-2009 17:18:23

962 Tweet

23,23K Followers

852 Following

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Great to give a talk in #CPAL2025 in Stanford yesterday and glad to see so much interest! While I am in big company, I still firmly believe that later there should be a giant leap in both architecture and algorithms in order to reach human-level data efficiency in learning the

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Apr. 26-28 in Singapore for ICLR'25. We have a few papers accepted and I will also give 3 invited workshop talks! See u all.

Yuandong Tian (@tydsh) 's Twitter Profile Photo

📣 This afternoon 3:00-5:30pm SG time, we will present POWER-DL (poster #218), a novel algorithm that addresses reward hacking problem! Thanks Paria Rashidinejad for the great work! #ICLR2025

Yuandong Tian (@tydsh) 's Twitter Profile Photo

📢Our GSM-infinite (accepted in #ICML2025) shows weakness in long-context capabilities in existing SoTA reasoning LLMs: for complicated synthetic reasoning tasks, they are not doing that well. Therefore, whenever there is a novel attention mechanism, there may be a hidden price

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Check our ParamΔ (#ICLR2025) that shows that the weights delta between post-trained and pre-trained checkpoints can be directly applied to related models without any training. E.g. 1️⃣ Δ can be transferred across different LLM version (e.g., llama 3 -> 3.1). 2️⃣ Task-specific Δ

Yuandong Tian (@tydsh) 's Twitter Profile Photo

📢 Our travel planner solver (arxiv.org/abs/2410.16456, published in EMNLP Demo Track'24, and arxiv.org/abs/2411.13904) is now open sourced in github.com/facebookresear… 🚀🚀 In these works, we build LLM-equipped agent that can take user inputs in natural language, in either the

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Great work from my former intern Yi Wu and the team! Fully async RL is great😀 When we reproduced AlphaZero back in 2018, I found that a fully async distributed RL framework is much more efficient than a synced counterpart: after 1 day of training, the async agent simply wins

Yuandong Tian (@tydsh) 's Twitter Profile Photo

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Great to see a lot of interest! It takes some time to construct the superpositional encoding correctly, and to make it compatible with popular positional embeddings. So it is not super obvious😁. More interestingly, our experiments show that such superpositional encodings

Agentica Project (@agentica_) 's Twitter Profile Photo

🚀 Introducing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. 💪DeepSWE

🚀 Introducing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models.

💪DeepSWE