Gopeshh Subbaraj (@gopeshh1) Twitter Tweets • TwiCopy

Gopeshh Subbaraj

@gopeshh1

+ Follow

PhD Student @Mila_Quebec/UdeM Interested in RL and CL! Prev. developing software @MathWorks. Robotics Grad @WPI. Alum @ReachNITT Views my own!

ID: 601898354

linkhttps://www.linkedin.com/in/gopeshhraajsubbaraj/ calendar_today07-06-2012 14:04:42

93 Tweet

419 Followers

484 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

In my lab, we have not one but four open postdoc positions! These positions cover developing foundation models for text, proteins, small molecules, genomic data, time series data, and astrophysics data! If you have strong research expertise and a PhD in LLMs and Foundation

thumb_up_off_alt117

chat_bubble_outline3

repeat34

shareShare

Johan S. Obando 👍🏽

@johanobandoc

4 months ago

🚨 Very pleased to share our recent work, in which we achieve up to 50x more efficient LLM post-training using off-policy reinforcement learning with replay buffers. Paper: arxiv.org/abs/2503.18929. 🧵See below for a summary of key results by Brian Bartoldson !

thumb_up_off_alt76

chat_bubble_outline2

repeat14

shareShare

Gopeshh Subbaraj

@gopeshh1

3 months ago

Do drop by poster #609 in Hall 2 to hear more about this work. #ICLR2025 Mila - Institut québécois d'IA

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Johan S. Obando 👍🏽

@johanobandoc

2 months ago

🚨 Excited to share our #ICML2025 paper: The Impact of On-Policy Parallelized Data Collection on Deep RL Networks. Big congrats to Walter Mayor-Toro for the amazing work! 🎉 Read the paper here: arxiv.org/abs/2506.03404, and more details in the thread below ⬇️

thumb_up_off_alt52

chat_bubble_outline7

repeat9

shareShare

Roger Creus Castanyer

@creus_roger

a month ago

🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈 We propose gradient interventions that enable stable, scalable learning, achieving significant performance gains across agents and environments! Details below 👇

thumb_up_off_alt99

chat_bubble_outline1

repeat22

shareShare

Mila - Institut québécois d'IA

@mila_quebec

a month ago

Chef robots need to act fast or omelets burn! This Mila blog tackles real-time reinforcement learning challenges and introduces solutions for minimizing both inaction and delay regret. mila.quebec/en/article/rea…

thumb_up_off_alt13

chat_bubble_outline1

repeat3

shareShare

Johan S. Obando 👍🏽

@johanobandoc

25 days ago

🚨 Excited to share our #ICML2025 paper: "The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep RL" We train RL agents to know when to quit, cutting wasted effort and improving efficiency with our method LEAST. 📄Paper: arxiv.org/pdf/2506.13672 🧵Check the thread below👇🏾

thumb_up_off_alt125

chat_bubble_outline3

repeat17

shareShare

Andrei Mircea

@mirandrom

7 days ago

Interested in LLM training dynamics and scaling laws? Come to our #ACL2025 oral tomorrow! ⏰ Tuesday 2:55pm 📍 Hall C (Language Modeling 1) 🌐 mirandrom.github.io/zsl/ If you're in Vienna and want to chat, let me know! Mila - Institut québécois d'IA

thumb_up_off_alt12

chat_bubble_outline0

repeat6

shareShare

Gopeshh Subbaraj

Gate.io

Sarath Chandar

Johan S. Obando 👍🏽

Gopeshh Subbaraj

Johan S. Obando 👍🏽

Roger Creus Castanyer

Mila - Institut québécois d'IA

Johan S. Obando 👍🏽

Andrei Mircea