
Simon Shaolei Du
@simonshaoleidu
Assistant Professor @uwcse. Postdoc @the_IAS. PhD in machine learning @mldcmu.
ID: 913981622193664000
http://simonshaoleidu.com 30-09-2017 04:19:34
497 Tweet
7,7K Followers
2,2K Following




Excited to share our work led by Yiping Wang RLVR with only ONE training example can boost 37% accuracy on MATH500.

So excited to announce our work was accepted as a Spotlight paper to ICML Conference !!! I'm looking forward to presenting our work there this summer and CogSci Society! Big thank you again to my collaborators Wilka Carvalho Yancheng Liang Simon Shaolei Du Max Kleiman-Weiner Natasha Jaques




PPO vs. DPO? 🤔 Our new paper proves that it depends on whether your models can represent the optimal policy and/or reward. Paper: arxiv.org/abs/2505.19770 Led by Ruizhe Shi Minhak Song

Congratulations to University of Washington #UWAllen Ph.D. grads Ashish Sharma & Sewon Min, Association for Computing Machinery Doctoral Dissertation Award honorees! Sharma won for #AI tools for mental health; Min received honorable mention for efficient, flexible language models. #ThisIsUW news.cs.washington.edu/2025/06/04/all…

Oral ICML Conference !!! Can't wait to share our work and hear the community's thoughts on it, should be a fun talk! Can't thank my collaborators enough: Wilka Carvalho Yancheng Liang Simon Shaolei Du Max Kleiman-Weiner Natasha Jaques



🚨 Code is live! Check out LoRe – a modular, lightweight codebase for personalized reward modeling from user preferences. 📦 Few-shot personalization 📊 Benchmarks: TLDR, PRISM, PersonalLLM 👉 github.com/facebookresear… Huge thanks to AI at Meta for open-sourcing this research 🙌