Yuandong Tian (@tydsh) Twitter Tweets • TwiCopy

Yuandong Tian

@tydsh

+ Follow

Research Scientist Director in Meta GenAI (previously in FAIR). Doing reasoning. Novelist in spare time. PhD in @CMU_Robotics.

ID: 97939183

linkhttp://yuandong-tian.com calendar_today19-12-2009 17:18:23

962 Tweet

23,23K Followers

852 Following

Yuandong Tian

@tydsh

9 months ago

Great to give a talk in #CPAL2025 in Stanford yesterday and glad to see so much interest! While I am in big company, I still firmly believe that later there should be a giant leap in both architecture and algorithms in order to reach human-level data efficiency in learning the

thumb_up_off_alt24

chat_bubble_outline1

repeat0

shareShare

Yuandong Tian

@tydsh

9 months ago

oh wow. This is the right side of the history!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Yuandong Tian

@tydsh

8 months ago

Finally Llama4 is out~ Enjoy!

thumb_up_off_alt59

chat_bubble_outline2

repeat0

shareShare

Yuandong Tian

@tydsh

8 months ago

Apr. 26-28 in Singapore for ICLR'25. We have a few papers accepted and I will also give 3 invited workshop talks! See u all.

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

Yuandong Tian

@tydsh

8 months ago

📣 This afternoon 3:00-5:30pm SG time, we will present POWER-DL (poster #218), a novel algorithm that addresses reward hacking problem! Thanks Paria Rashidinejad for the great work! #ICLR2025

thumb_up_off_alt49

chat_bubble_outline0

repeat3

shareShare

Yuandong Tian

@tydsh

8 months ago

Our Token Assorted paper is now accepted in #ICML2025!

thumb_up_off_alt32

chat_bubble_outline6

repeat3

shareShare

Yuandong Tian

@tydsh

8 months ago

After ~1 year, AdvPrompter is finally accepted in #ICML2025!

thumb_up_off_alt39

chat_bubble_outline7

repeat5

shareShare

Yuandong Tian

@tydsh

7 months ago

📢Our GSM-infinite (accepted in #ICML2025) shows weakness in long-context capabilities in existing SoTA reasoning LLMs: for complicated synthetic reasoning tasks, they are not doing that well. Therefore, whenever there is a novel attention mechanism, there may be a hidden price

thumb_up_off_alt75

chat_bubble_outline10

repeat13

shareShare

Yuandong Tian

@tydsh

7 months ago

Check our ParamΔ (#ICLR2025) that shows that the weights delta between post-trained and pre-trained checkpoints can be directly applied to related models without any training. E.g. 1️⃣ Δ can be transferred across different LLM version (e.g., llama 3 -> 3.1). 2️⃣ Task-specific Δ

thumb_up_off_alt94

chat_bubble_outline0

repeat15

shareShare

Yuandong Tian

@tydsh

7 months ago

📢 Our travel planner solver (arxiv.org/abs/2410.16456, published in EMNLP Demo Track'24, and arxiv.org/abs/2411.13904) is now open sourced in github.com/facebookresear… 🚀🚀 In these works, we build LLM-equipped agent that can take user inputs in natural language, in either the

thumb_up_off_alt34

chat_bubble_outline1

repeat7

shareShare

Yuandong Tian

@tydsh

6 months ago

Great work from my former intern Yi Wu and the team! Fully async RL is great😀 When we reproduced AlphaZero back in 2018, I found that a fully async distributed RL framework is much more efficient than a synced counterpart: after 1 day of training, the async agent simply wins

thumb_up_off_alt61

chat_bubble_outline3

repeat7

shareShare

Yuandong Tian

@tydsh

6 months ago

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat159

shareShare

Yuandong Tian

@tydsh

6 months ago

Congrats to our formal colleague Ledell Wu! AI Agent is super hot now 😀

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Yuandong Tian

@tydsh

6 months ago

Great to see a lot of interest! It takes some time to construct the superpositional encoding correctly, and to make it compatible with popular positional embeddings. So it is not super obvious😁. More interestingly, our experiments show that such superpositional encodings

thumb_up_off_alt54

chat_bubble_outline1

repeat6

shareShare

Yuandong Tian

@tydsh

6 months ago

7/14-7/20 @ ICML

thumb_up_off_alt26

chat_bubble_outline5

repeat0

shareShare

Agentica Project

@agentica_

6 months ago

🚀 Introducing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. 💪DeepSWE

thumb_up_off_alt345

chat_bubble_outline15

repeat65

shareShare