Jesse Farebrother (@jessefarebro) Twitter Tweets • TwiCopy

Jesse Farebrother

@jessefarebro

+ Follow

Ph.D. student studying AI & decision making at @Mila_Quebec / @McGillU. Currently at @AIatMeta. Previously @GoogleDeepMind, @Google 🧠.

ID: 44497338

linkhttps://brosa.ca calendar_today04-06-2009 00:12:16

397 Tweet

867 Followers

395 Following

Amii

@amiithinks

9 months ago

BREAKING: Amii Chief Scientific Advisor, Richard S. Sutton, has been awarded the A.M. Turing Award, the highest honour in computer science, alongside Andrew Barto! Read the official Association for Computing Machinery announcement: hubs.la/Q039nZXM0 #TuringAward #AI #ReinforcementLearning

thumb_up_off_alt234

chat_bubble_outline5

repeat50

shareShare

Marlos C. Machado

@marloscmachado

9 months ago

Going through my BSc and MSc studies in Brazil I would hear about Turing Award winners. Those were not real people to me, they were mythological figures so far from me. Now Rich has won it! Thank you, Rich. You have no idea how meaningful this is to me. nytimes.com/2025/03/05/tec…

thumb_up_off_alt20

chat_bubble_outline0

repeat4

shareShare

Jesse Farebrother

@jessefarebro

9 months ago

As an undergraduate student, taking Richard Sutton’s course at University of Alberta was a defining moment in my academic journey. His work and teachings have shaped the paths of countless researchers, including my own. Congrats, Rich & Andy!

thumb_up_off_alt24

chat_bubble_outline0

repeat2

shareShare

Dieter Büchler

@dtrbchlr

9 months ago

Amii Intelligent Systems If you are interested in working with me at *the* RL powerhouse University of Alberta on robot learning on physical robots, please drop me a message. Retweets welcome 🙏

thumb_up_off_alt28

chat_bubble_outline3

repeat13

shareShare

Clare Lyle

@clarelyle

8 months ago

📣📣 My team at Google DeepMind is hiring a student researcher for summer/fall 2025 in Seattle! If you're a PhD student interested in getting deep RL to (finally) work reliably in interesting domains, apply at the link below and reach out to me via email so I know you aplied👇

thumb_up_off_alt622

chat_bubble_outline7

repeat74

shareShare

Marc G. Bellemare

@marcgbellemare

8 months ago

At Reliant we've found RL to be incredibly efficient at improving answer quality to life sciences' hardest questions. Today we're putting out our work on LLM fine-tuning with off-policy RL, matching llama 70B performance with an 8B model - take a look! arxiv.org/abs/2503.14286

thumb_up_off_alt305

chat_bubble_outline3

repeat39

shareShare

Jesse Farebrother

@jessefarebro

8 months ago

Don’t miss this amazing opportunity to work with Pablo at Google DeepMind—one of the highlights of my PhD. He’s an incredible mentor, and I can’t say enough good things about working with him!

thumb_up_off_alt46

chat_bubble_outline1

repeat1

shareShare

Arnav Jain

@arnavkj95

7 months ago

📢 Come say hi at our SFM poster at #ICLR2025, Poster Session 5 – #572! We’re presenting a method for Inverse Reinforcement Learning via Successor Feature Matching — a non-adversarial approach that works without action labels. Excited to share and chat!

thumb_up_off_alt33

chat_bubble_outline0

repeat10

shareShare

Marc G. Bellemare

@marcgbellemare

7 months ago

Take a look at this amazing piece of work by my student Jesse Farebrother - a new kind of world model based on successor representations that's a lot more robust than prior iterations. Incredible to see all the progress we've made in the last 5 years in RL.

thumb_up_off_alt88

chat_bubble_outline0

repeat7

shareShare

Gokul Swamy

@g_k_swamy

6 months ago

Say ahoy to 𝚂𝙰𝙸𝙻𝙾𝚁⛵: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! 𝚂𝙰𝙸𝙻𝙾𝚁 ⛵ out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!

thumb_up_off_alt247

chat_bubble_outline10

repeat64

shareShare

Matteo Pirotta

@teopir

5 months ago

Exciting PhD position open at FAIR in Paris. We are looking for a candidate to join our team and contribute to advancing the field of AI, especially reinforcement learning. Find more details and apply below. Feel free to reach out to me by email. metacareers.com/jobs/192266079…

thumb_up_off_alt215

chat_bubble_outline1

repeat43

shareShare

Nate Rahn

@n8rahn

5 months ago

Late update: I’ve moved to the Bay Area for a 6-month research fellowship at Anthropic ! I’d be glad to meet other researchers working on RL for language models, agents, subtle and unverifiable rewards, etc. — DMs open.

thumb_up_off_alt465

chat_bubble_outline6

repeat11

shareShare

Jesse Farebrother

@jessefarebro

5 months ago

Turns out Hugo Larochelle was ahead of his time once again with these cards existing in the wild 😄 thestar.com/opinion/contri…

Turns out <a href="/hugo_larochelle/">Hugo Larochelle</a> was ahead of his time once again with these cards existing in the wild 😄

thestar.com/opinion/contri…

thumb_up_off_alt25

chat_bubble_outline0

repeat1

shareShare

Jesse Farebrother

@jessefarebro

4 months ago

Heading to Vancouver for #ICML2025 to present our work: Temporal Difference Flows. Make sure to check out the oral to learn how we’re now able to scale this exciting world model framework based on the successor representation! Also, feel free to reach out to discuss anything RL!

thumb_up_off_alt150

chat_bubble_outline1

repeat26

shareShare