Jesse Farebrother (@jessefarebro) 's Twitter Profile
Jesse Farebrother

@jessefarebro

Ph.D. student studying AI & decision making at @Mila_Quebec / @McGillU. Currently at @AIatMeta. Previously @GoogleDeepMind, @Google 🧠.

ID: 44497338

linkhttps://brosa.ca calendar_today04-06-2009 00:12:16

397 Tweet

867 Followers

395 Following

Amii (@amiithinks) 's Twitter Profile Photo

BREAKING: Amii Chief Scientific Advisor, Richard S. Sutton, has been awarded the A.M. Turing Award, the highest honour in computer science, alongside Andrew Barto! Read the official Association for Computing Machinery announcement: hubs.la/Q039nZXM0 #TuringAward #AI #ReinforcementLearning

BREAKING: Amii Chief Scientific Advisor, Richard S. Sutton, has been awarded the A.M. Turing Award, the highest honour in computer science, alongside Andrew Barto! Read the official <a href="/TheOfficialACM/">Association for Computing Machinery</a> announcement: hubs.la/Q039nZXM0

#TuringAward #AI #ReinforcementLearning
Marlos C. Machado (@marloscmachado) 's Twitter Profile Photo

Going through my BSc and MSc studies in Brazil I would hear about Turing Award winners. Those were not real people to me, they were mythological figures so far from me. Now Rich has won it! Thank you, Rich. You have no idea how meaningful this is to me. nytimes.com/2025/03/05/tec…

Jesse Farebrother (@jessefarebro) 's Twitter Profile Photo

As an undergraduate student, taking Richard Sutton’s course at University of Alberta was a defining moment in my academic journey. His work and teachings have shaped the paths of countless researchers, including my own. Congrats, Rich & Andy!

Clare Lyle (@clarelyle) 's Twitter Profile Photo

📣📣 My team at Google DeepMind is hiring a student researcher for summer/fall 2025 in Seattle! If you're a PhD student interested in getting deep RL to (finally) work reliably in interesting domains, apply at the link below and reach out to me via email so I know you aplied👇

📣📣 My team at Google DeepMind is hiring a student researcher for summer/fall 2025 in Seattle! If you're a PhD student interested in getting deep RL to (finally) work reliably in interesting domains, apply at the link below and reach out to me via email so I know you aplied👇
Marc G. Bellemare (@marcgbellemare) 's Twitter Profile Photo

At Reliant we've found RL to be incredibly efficient at improving answer quality to life sciences' hardest questions. Today we're putting out our work on LLM fine-tuning with off-policy RL, matching llama 70B performance with an 8B model - take a look! arxiv.org/abs/2503.14286

At Reliant we've found RL to be incredibly efficient at improving answer quality to life sciences' hardest questions. Today we're putting out our work on LLM fine-tuning with off-policy RL, matching llama 70B performance with an 8B model - take a look!

arxiv.org/abs/2503.14286
Jesse Farebrother (@jessefarebro) 's Twitter Profile Photo

Don’t miss this amazing opportunity to work with Pablo at Google DeepMind—one of the highlights of my PhD. He’s an incredible mentor, and I can’t say enough good things about working with him!

Arnav Jain (@arnavkj95) 's Twitter Profile Photo

📢 Come say hi at our SFM poster at #ICLR2025, Poster Session 5 – #572! We’re presenting a method for Inverse Reinforcement Learning via Successor Feature Matching — a non-adversarial approach that works without action labels. Excited to share and chat!

📢 Come say hi at our SFM poster at #ICLR2025, Poster Session 5 – #572!

We’re presenting a method for Inverse Reinforcement Learning via Successor Feature Matching — a non-adversarial approach that works without action labels.

Excited to share and chat!
Marc G. Bellemare (@marcgbellemare) 's Twitter Profile Photo

Take a look at this amazing piece of work by my student Jesse Farebrother - a new kind of world model based on successor representations that's a lot more robust than prior iterations. Incredible to see all the progress we've made in the last 5 years in RL.

Gokul Swamy (@g_k_swamy) 's Twitter Profile Photo

Say ahoy to 𝚂𝙰𝙸𝙻𝙾𝚁⛵: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! 𝚂𝙰𝙸𝙻𝙾𝚁 ⛵ out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!

Matteo Pirotta (@teopir) 's Twitter Profile Photo

Exciting PhD position open at FAIR in Paris. We are looking for a candidate to join our team and contribute to advancing the field of AI, especially reinforcement learning. Find more details and apply below. Feel free to reach out to me by email. metacareers.com/jobs/192266079…

Nate Rahn (@n8rahn) 's Twitter Profile Photo

Late update: I’ve moved to the Bay Area for a 6-month research fellowship at Anthropic ! I’d be glad to meet other researchers working on RL for language models, agents, subtle and unverifiable rewards, etc. — DMs open.

Jesse Farebrother (@jessefarebro) 's Twitter Profile Photo

Heading to Vancouver for #ICML2025 to present our work: Temporal Difference Flows. Make sure to check out the oral to learn how we’re now able to scale this exciting world model framework based on the successor representation! Also, feel free to reach out to discuss anything RL!

Heading to Vancouver for #ICML2025 to present our work: Temporal Difference Flows. Make sure to check out the oral to learn how we’re now able to scale this exciting world model framework based on the successor representation! Also, feel free to reach out to discuss anything RL!