Rosie Zhao (@rosieyzh) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Excited to share our recent work at #NeurIPS2023 on the nature of Simplicity Bias (SB) in 1-Hidden Layer Neural Networks (NNs) with Jatin Batra Prateek Jain Praneeth Netrapalli. SB is known to be one of the reasons behind brittleness of neural networks towards distribution shift (1/5)

thumb_up_off_alt38

chat_bubble_outline3

repeat3

shareShare

Rosie Zhao

@rosieyzh

a year ago

It’s an honor to be part of the 2024 cohort of Kempner Institute graduate fellows! Excited for what lies ahead :)

thumb_up_off_alt34

chat_bubble_outline1

repeat3

shareShare

Alexandre L.-Piché

@alexpiche_

a year ago

Introducing ReSearch: An iterative self-reflection algorithm that enhances LLM's self-restraint abilities: • Encouraging abstention when uncertain • Producing accurate, informative content when confident Result: Significant accuracy boost for Llama2 7B Chat and Mistral 7B! 🚀

thumb_up_off_alt102

chat_bubble_outline1

repeat45

shareShare

Kempner Institute at Harvard University

@kempnerinst

a year ago

Are you attending #ICML2024? Be sure to check out our posters, workshops & presentations from the #KempnerInstitute community! Ada Fang Blake Bordelon ☕️🧪👨‍💻 Cengiz Pehlevan Riley Simmons-Edler Naomi Saphra Kanaka Rajan David Brandfonbrener Rosie Zhao Eran Malach Ryan P. Badman Marinka Zitnik

Are you attending #ICML2024? Be sure to check out our posters, workshops & presentations from the #KempnerInstitute community!

<a href="/AdaFang_/">Ada Fang</a> <a href="/blake__bordelon/">Blake Bordelon ☕️🧪👨‍💻</a> <a href="/CPehlevan/">Cengiz Pehlevan</a> <a href="/SimmonsEdler/">Riley Simmons-Edler</a> <a href="/nsaphra/">Naomi Saphra</a> <a href="/KanakaRajanPhD/">Kanaka Rajan</a> <a href="/brandfonbrener/">David Brandfonbrener</a> <a href="/rosieyzh/">Rosie Zhao</a> <a href="/EranMalach/">Eran Malach</a> <a href="/RyanPaulBadman1/">Ryan P. Badman</a> <a href="/marinkazitnik/">Marinka Zitnik</a>

thumb_up_off_alt21

chat_bubble_outline0

repeat8

shareShare

Sham Kakade

@shamkakade6

10 months ago

1/n Introducing SOAP (ShampoO with Adam in the Preconditioner's eigenbasis): A deep learning optimization algorithm that applies Adam in Shampoo's eigenbasis. SOAP outperforms both AdamW and Shampoo in language model pretraining.

thumb_up_off_alt345

chat_bubble_outline6

repeat62

shareShare

Nikhil Vyas

@vyasnikhil96

6 months ago

Combining SOAP and Muon: nikhilvyas.github.io/SOAP_Muon.pdf and some rough thoughts on interesting future directions.

thumb_up_off_alt186

chat_bubble_outline3

repeat23

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

5 months ago

Ever looked at LLM skill emergence and thought 70B parameters was a magic number? Our new paper shows sudden breakthroughs are samples from bimodal performance distributions across seeds. Observed accuracy jumps abruptly while the underlying accuracy DISTRIBUTION changes slowly!

thumb_up_off_alt260

chat_bubble_outline6

repeat26

shareShare

Association for Computing Machinery

@theofficialacm

5 months ago

Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat479

shareShare

David Alvarez Melis

@elmelis

3 months ago

🚨 New preprint! TL;DR: Backtracking is not the "holy grail" for smarter LLMs. It’s praised for helping models “fix mistakes” and improve reasoning—but is it really the best use of test-time compute? 🤔

thumb_up_off_alt26

chat_bubble_outline1

repeat10

shareShare

Eran Malach

@eranmalach

3 months ago

How does RL improve performance on math reasoning? Studying RL from pretrained models is hard, as behavior depends on choice of base model. 🚨 In our new work, we train models *from scratch* to study the effect of the data mix on the behavior of RL. arxiv.org/abs/2504.07912

thumb_up_off_alt138

chat_bubble_outline3

repeat35

shareShare

Rosie Zhao

@rosieyzh

3 months ago

Excited to be attending 🇸🇬#ICLR2025! My DMs are open, please reach out to chat about LLM reasoning/optimization/training dynamics! Will be presenting a study on diagonal preconditioning optimizers for LLM pretraining (arxiv.org/abs/2407.07972) and SOAP (arxiv.org/abs/2409.11321)

thumb_up_off_alt182

chat_bubble_outline3

repeat12

shareShare

Bingbin Liu

@bingbinl

3 months ago

Excited to announce MOSS, our ICML workshop focused on discoveries at small scale! We believe there's tremendous potential & creativity in research done with limited resources and would love to hear your ideas. The submission (due May 22nd) can literally be a Jupyter notebook! :)

thumb_up_off_alt116

chat_bubble_outline0

repeat12

shareShare

Nived Rajaraman

@nived_rajaraman

3 months ago

Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025! 📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models! │ 🗓️ Deadline: May 19, 2025

thumb_up_off_alt74

chat_bubble_outline1

repeat24

shareShare

Antonio Orvieto

@orvieto_antonio

2 months ago

Adam is similar to many algorithms, but cannot be effectively replaced by any simpler variant in LMs. The community is starting to get the recipe right, but what is the secret sauce? Robert M. Gower 🇺🇦 and I found that it has to do with the beta parameters and variational inference.

thumb_up_off_alt259

chat_bubble_outline10

repeat37

shareShare

Rosie Zhao

Gate.io

Depen Morwani

Rosie Zhao

Alexandre L.-Piché

Kempner Institute at Harvard University

Sham Kakade

Nikhil Vyas

Naomi Saphra hiring a lab 🧈🪰

Association for Computing Machinery

David Alvarez Melis

Eran Malach

Rosie Zhao

Bingbin Liu

Nived Rajaraman

Antonio Orvieto