Xinyu Yang (@xinyu2ml) Twitter Tweets • TwiCopy

Xinyu Yang

@xinyu2ml

+ Follow

Ph.D. @CarnegieMellon. Working on principled algorithm & system co-design for scalable and generalizable foundation models. he/they. A fan of TileLang!!!

ID: 1601134489161400321

linkhttps://xinyuyang.me/ calendar_today09-12-2022 08:40:04

146 Tweet

530 Followers

606 Following

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

23 days ago

❗️Self-evolution is quietly pushing LLM agents off the rails. ⚠️ Even perfect alignment at deployment can gradually forget human alignment and shift toward self-serving strategies. Over time, LLM agents stop following values, imitate bad strategies, and even spread misaligned

thumb_up_off_alt58

chat_bubble_outline1

repeat21

shareShare

Shizhe Diao

@shizhediao

22 days ago

✨ We’re hiring interns at NVIDIA Research! Our team works on efficient agentic systems, new model architectures, multi-modal models and post-training optimization. If interested, please send your CV to [email protected] 🚀 #hiring #internship

thumb_up_off_alt715

chat_bubble_outline14

repeat65

shareShare

Stanford NLP Group

@stanfordnlp

22 days ago

Hi everyone! This Thursday, we will host the second NLP Seminar of the year! For this week's seminar, we are excited to host Tianyu Gao (Tianyu Gao) from OpenAI and UC San Diego (UCSD)! If you are interested in attending remotely, here is the Zoom link:

Hi everyone! This Thursday, we will host the second NLP Seminar of the year! For this week's seminar, we are excited to host Tianyu Gao (<a href="/gaotianyu1350/">Tianyu Gao</a>) from OpenAI and UC San Diego (UCSD)! If you are interested in attending remotely, here is the Zoom link:

thumb_up_off_alt228

chat_bubble_outline0

repeat32

shareShare

VraserX e/acc

@vraserx

22 days ago

A 7 million parameter model from Samsung just outperformed DeepSeek-R1, Gemini 2.5 Pro, and o3-mini on reasoning benchmarks like ARC-AGI. Let that sink in. It’s 10,000x smaller yet smarter. The secret is recursion. Instead of brute-forcing answers like giant LLMs, it drafts a

thumb_up_off_alt1,1K

chat_bubble_outline60

repeat275

shareShare

Andrew Campbell

@andrewc_ml

22 days ago

Very excited to share our preprint: Self-Speculative Masked Diffusions We speed up sampling of masked diffusion models by ~2x by using speculative sampling and a hybrid non-causal / causal transformer arxiv.org/abs/2510.03929 w/ Valentin De Bortoli Jiaxin Shi Arnaud Doucet

thumb_up_off_alt184

chat_bubble_outline2

repeat38

shareShare

Jiawei Zhao

@jiawzhao

22 days ago

We’ve always assumed stale and off-policy data hurts RL a lot — but our latest work shows the opposite. 🧠 M2PO (Second-Moment Trust Policy Optimization) reveals that even data stale by 256 model updates can train LLMs as effectively as on-policy RL, unlocking scalable and

thumb_up_off_alt134

chat_bubble_outline4

repeat24

shareShare

Wenhao Yu

@wyu_nd

21 days ago

Code for 𝐏𝐚𝐫𝐚𝐥𝐥𝐞𝐥-𝐑𝟏 is live! 👉 github.com/zhengkid/Paral… (now 189 stars and climbing 🔥) It lets LLMs think in parallel — multiple reasoning paths, smarter synthesis, more creative inference! Miss this paper and you’re missing a leap forward: arxiv.org/abs/2509.07980

thumb_up_off_alt237

chat_bubble_outline2

repeat48

shareShare

Jonas Geiping

@jonasgeiping

21 days ago

What determines how easy it is to quantize an LLM after training? Thanks to a number of recent open-source training trajectories, we were able to show much more directly how trainining hyperparameters modulate quantization errors, for good and bad. More details below:

thumb_up_off_alt34

chat_bubble_outline1

repeat5

shareShare

Songlin Yang

@songlinyang4

19 days ago

Recording: youtube.com/watch?v=sdzW2v…

thumb_up_off_alt76

chat_bubble_outline1

repeat6

shareShare

Tong Zheng

@zhengtoong

18 days ago

Parallel Reasoning has entered the AI mainstream. Inspired by works like Gemini 2.5 Pro, APR Jiayi Pan Xiuyu Li Long Lian , and Multiverse Xinyu Yang , our Parallel-R1 establishes the first reinforcement-learning framework that moves this paradigm beyond synthetic tasks,

Parallel Reasoning has entered the AI mainstream. Inspired by works like Gemini 2.5 Pro, APR <a href="/jiayi_pirate/">Jiayi Pan</a> <a href="/xiuyu_l/">Xiuyu Li</a> <a href="/LongTonyLian/">Long Lian</a> , and Multiverse <a href="/Xinyu2ML/">Xinyu Yang</a> , our Parallel-R1 establishes the first reinforcement-learning framework that moves this paradigm beyond synthetic tasks,

thumb_up_off_alt41

chat_bubble_outline3

repeat9

shareShare

X. Dong

@simonxindong

14 days ago

Super bullish on intra-layer hybridization LLM. These are the reasons why.

thumb_up_off_alt889

chat_bubble_outline3

repeat88

shareShare

Kangwook Lee

@kangwook_lee

13 days ago

DLLMs seem promising... but parallel generation is not always possible Diffusion-based LLMs can generate many tokens at different positions at once, while most autoregressive LLMs generate tokens one by one. This makes diffusion-based LLMs highly attractive when we need fast

thumb_up_off_alt331

chat_bubble_outline12

repeat48

shareShare

Zheng Zhan

@zhengzhan13

13 days ago

Starting now!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Yufan Zhuang

@yufan_zhuang

9 days ago

Can LLMs reason beyond context limits? 🤔 Introducing Knowledge Flow, a training-free method that helped gpt-oss-120b & Qwen3-235B achieve 100% on the AIME-25, no tools. How? like human deliberation, for LLMs. 📝 Blog: yufanzhuang.notion.site/knowledge-flow 💻 Code: github.com/EvanZhuang/kno…

thumb_up_off_alt207

chat_bubble_outline6

repeat35

shareShare

stochasm

@stochasticchasm

9 days ago

You can just train things

thumb_up_off_alt228

chat_bubble_outline5

repeat19

shareShare

Shanli Xing

@0xsling0

8 days ago

🤔 Can AI optimize the systems it runs on? 🚀 Introducing FlashInfer-Bench, a workflow that makes AI systems self-improving with agents: - Standardized signature for LLM serving kernels - Implement kernels with your preferred language - Benchmark them against real-world serving

thumb_up_off_alt116

chat_bubble_outline3

repeat39

shareShare

Xinyu Yang

@xinyu2ml

8 days ago

Honored to receive the 2025 Amazon AI PhD Fellowship! Thank you Amazon for the award!

thumb_up_off_alt87

chat_bubble_outline2

repeat3

shareShare

Xinyu Yang

@xinyu2ml

8 days ago

🏆Honored to share that LLM.265 (dl.acm.org/doi/10.1145/37…) received the Best Paper Award at MICRO 2025! 🥳Huge thanks to the whole team! 😅Accidentally deleted the original tweet—posting it again

thumb_up_off_alt58

chat_bubble_outline6

repeat3

shareShare