
Gopeshh Subbaraj
@gopeshh1
PhD Student @Mila_Quebec/UdeM Interested in RL and CL! Prev. developing software @MathWorks. Robotics Grad @WPI. Alum @ReachNITT Views my own!
ID: 601898354
https://www.linkedin.com/in/gopeshhraajsubbaraj/ 07-06-2012 14:04:42
93 Tweet
419 Followers
484 Following


🚨 Very pleased to share our recent work, in which we achieve up to 50x more efficient LLM post-training using off-policy reinforcement learning with replay buffers. Paper: arxiv.org/abs/2503.18929. 🧵See below for a summary of key results by Brian Bartoldson !

Do drop by poster #609 in Hall 2 to hear more about this work. #ICLR2025 Mila - Institut québécois d'IA

🚨 Excited to share our #ICML2025 paper: The Impact of On-Policy Parallelized Data Collection on Deep RL Networks. Big congrats to Walter Mayor-Toro for the amazing work! 🎉 Read the paper here: arxiv.org/abs/2506.03404, and more details in the thread below ⬇️




Interested in LLM training dynamics and scaling laws? Come to our #ACL2025 oral tomorrow! ⏰ Tuesday 2:55pm 📍 Hall C (Language Modeling 1) 🌐 mirandrom.github.io/zsl/ If you're in Vienna and want to chat, let me know! Mila - Institut québécois d'IA