WhiRL (@whi_rl) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I am recruiting for a Waymo-funded PhD studentship in my academic lab at Oxford Comp Sci to work on fundamental challenges in imitation learning for autonomous vehicles. Deadline to apply is 1 March. cs.ox.ac.uk/admissions/gra…

thumb_up_off_alt77

chat_bubble_outline1

repeat20

shareShare

Matthew Jackson

@jacksonmattt

a year ago

Meta-learning can discover RL algorithms with novel modes of learning, but how can we make them adapt to any training horizon? Introducing our #ICLR2024 work on discovering *temporally-aware* RL algorithms! Work co-led with Chris Lu, in Foerster Lab for AI Research and WhiRL

thumb_up_off_alt111

chat_bubble_outline1

repeat25

shareShare

WhiRL

@whi_rl

a year ago

Come see this fantastic tutorial today on meta-RL by lab members Risto Vuorio and Jacob Beck at #AAAI24!

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

WhiRL

@whi_rl

a year ago

Real-world agents often can’t interact with their environment… So what if they imagined it? 🤔💭 In our new work (led by Matthew Jackson) we use diffusion models to interactively generate entire sequences of synthetic experience 🤖

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

WhiRL

@whi_rl

a year ago

Excited to share that our work Bayesian Exploration Networks (BEN) has been accepted at ICML 🍾! BEN is the first model-free Bayesian RL approach that can learn Bayes-optimal policies 🙀 Congrats to @mattiefoxcs and collaborators! arxiv.org/pdf/2308.13049

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Matthew Jackson

@jacksonmattt

a year ago

Exciting updates to Policy-Guided Diffusion! 🎉 PGD was accepted at RL_Conference - see you in Amherst! 📈 For those building on PGD, we just released WandB logs with agent and diffusion model training: api.wandb.ai/links/flair/jo…

thumb_up_off_alt124

chat_bubble_outline2

repeat28

shareShare

Alex Goldie

@alexdgoldie

a year ago

1/ 🤖 Learned optimization offers huge potential to automate machine learning! So why doesn't it work well in RL (and how did we fix it)?! I'm excited to share OPEN, our AutoRL Workshop spotlight paper exploring this question! 🧵

thumb_up_off_alt117

chat_bubble_outline1

repeat27

shareShare

Alex Goldie

@alexdgoldie

a year ago

Attending #ICML2024 was amazing and full of firsts: My first time presenting a poster, first time giving a talk at a conference and first time sitting on a panel! Many thanks to the AutoRL Workshop organisers for preparing a great workshop about AutoRL!

thumb_up_off_alt55

chat_bubble_outline4

repeat6

shareShare

WhiRL

@whi_rl

a year ago

Check out this fantastic showing by Jacob Beck and Alex Goldie at #ICML2024! 🎙️🔥 They dove deep into the future of automated RL, meta-learning, and LLMs 🤖🔮

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Jacob Beck

@jakeabeck

7 months ago

🎉🚨 Big news! Our research, Metalic: Meta-Learning In-Context with Protein Language Models, 🧬 won a competition! #NeurIPS2024🤖📚 We advance in-context learning and protein fitness prediction with this paradigm: ✨ Pre-training 🔥 Learning to in-context learn🔥 ✨ Fine-tuning

thumb_up_off_alt7

chat_bubble_outline2

repeat4

shareShare

Shimon Whiteson

@shimon8282

7 months ago

A new version of the paper Counterfactual Multi-Agent Policy Gradients, that I first published in 2017 with Jakob Foerster, Greg Farquhar and others, is now available on arXiv (arxiv.org/abs/1705.08926).

thumb_up_off_alt43

chat_bubble_outline1

repeat8

shareShare

Shangtong Zhang

@shangtongzhang

5 months ago

Excited to share our new survey of in-context reinforcement learning!! arxiv.org/abs/2502.07978 w/ Amir Moeini Jiuqi Wang Jacob Beck Ethan Blaser Shimon Whiteson Rohan Chandra

thumb_up_off_alt223

chat_bubble_outline4

repeat50

shareShare

Jacob Beck

@jakeabeck

4 months ago

Big news—our survey paper “A Tutorial on Meta-Reinforcement Learning” is officially published! Meta-RL = learning how to adapt through interaction. It embraces The Bitter Lesson: don’t hardcode agents—train them to adapt on their own arxiv.org/abs/2301.08028 🧵⬇️

thumb_up_off_alt336

chat_bubble_outline2

repeat79

shareShare

Luisa Zintgraf

@luisa_zintgraf

4 months ago

🎉 Our Meta-RL survey is now published in Foundations and Trends in Machine Learning! A deep dive into how agents can learn to learn 🤖🧠 Huge kudos to Jacob Beck & Risto Vuorio for leading the charge, and to co-authors Evan Liu, Zheng Xiong, Chelsea Finn & Shimon Whiteson!

thumb_up_off_alt52

chat_bubble_outline2

repeat11

shareShare

Shimon Whiteson

@shimon8282

3 months ago

Our survey on meta reinforcement learning has now been published by Foundations and Trends in Machine Learning: nowpublishers.com/article/Detail…

thumb_up_off_alt37

chat_bubble_outline5

repeat2

shareShare

Matthew Jackson

@jacksonmattt

3 months ago

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

thumb_up_off_alt138

chat_bubble_outline5

repeat35

shareShare

WhiRL

@whi_rl

3 months ago

The best of RL research, brought to Offline RL! 🚀 TL;DR 1. CleanRL-style implementations ⚡️ 2. Rainbow-style algorithm unification 🦾 3. Rliable-style evaluation protocol 🔬 Check out our paper + library!

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare