
Marc G. Bellemare
@marcgbellemare
CSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).
ID: 289158382
http://marcgbellemare.info 28-04-2011 03:50:48
1,1K Tweet
14,14K Followers
349 Following


It took us 2+ years to figure out exactly how to think about, & work with a distributional version of the successor representation - doubly proud of this work by Jesse Farebrother and Harley Wiltzer that both lays down a mathematical foundation and improves on γ-models! Also, A+ visuals.

On the back of our 2017 distributional RL paper Martha White and Ehsan Imani wrote a piece showing that you can do regression better with a classification loss... that seemed wild at the time, but Jesse Farebrother, Rishabh Agarwal and co pushed this further and the results are amazing!


Amazing piece of work by Jesse Farebrother , Rishabh Agarwal , & star co-authors digging into classification losses in RL and their unreasonable effectiveness in a problem space that has mostly been dominated by regression methods. Don't miss this talk at ICML!

Because one level of distributions isn't enough - don't miss tomorrow's ICML spotlight by Harley Wiltzer , Jesse Farebrother , Arthur Gretton , and Mark Rowland: lifting the successor representation to distributions and moving the needle on what you can do with technique like γ-models.

Distributional successor features: A follow up to our distributional successor representation by my students Harley Wiltzer and Jesse Farebrother - those manim animations are quite something!


🚀 Extremely excited about our latest work on Distributional RL algorithms for *high-frequency control*, to be presented at #neurips2024! Incredible collaboration with the OT wizard Yash Jhaveri, Marc G. Bellemare, David Meger, Patrick Shafto. Paper: arxiv.org/pdf/2410.11022

we've used Atari games as an RL benchmark for so long, but for a little while it's bugged me that it's a discrete action problem, since the original joysticks were analog... Jesse Farebrother & i fix this by introducing the Continuous ALE (CALE)! read thread for details! 1/9



Take a look at this amazing piece of work by my student Jesse Farebrother - a new kind of world model based on successor representations that's a lot more robust than prior iterations. Incredible to see all the progress we've made in the last 5 years in RL.

Goodbye Toronto! So many serendipitous meetings Toronto Tech Week, incredible energy. Learned that Isaac Souweine and I like the same parties. Met too many AI founders to count, all making amazing new things. Now back to building!


