
Michael Matthews @ ICLR 2025
@mitrma
PhD student @FLAIR_Ox working on RL in open-ended environments
ID: 429236521
https://www.mtmatthews.com/ 05-12-2011 18:40:04
128 Tweet
778 Followers
318 Following




Congratulations to antoine dedieu Joe Ortiz Kevin Patrick Murphy and the team for setting a new SOTA on Craftax-1M and Craftax-Classic-1M! 🎉


Jakob Foerster Jakob Foerster at University of Oxford arguing that the AI community needs to avoid being goodharted by benchmarks.

⚔️ MiniHack Updates! ⚔️ 1️⃣ MiniHack 1.0.0 is here! Following popular demand, it now supports the new Gymnasium API and is built on NLE 1.1.0. Huge thanks to @Stephen_Oman (maintainer of The NetHack Learning Environment ) for his outstanding contribution! 🙌





I'll be attending ICLR next week to present Kinetix with Michael Matthews. Would love to chat about anything UED / Open-Ended RL / QD related, or interesting research in general :)

I'll be in Singapore next week to present Kinetix as an Oral along with Michael Beukman. Reach out if you'd like to chat! 🇸🇬



Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…


A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some The NetHack Learning Environment challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…


You work on RL from pixels, and you're tired to wait 10 hours for a DMC run to finish? Or up to 100 hours, if you add video distractors? Well, we got you covered : PixelBrax can run your continuous control experiments from pixels in < 1 hr! Come chat with Trevor McInroe and I at
