Foerster Lab for AI Research (@flair_ox) Twitter Tweets • TwiCopy

Foerster Lab for AI Research

@flair_ox

+ Follow

ML research group @uniofoxford. Focussed on multi-agent, open-ended, meta and reinforcement learning as well as agent based models. More at foersterlab.com.

ID: 1508569815014727680

linkhttp://www.foersterlab.com calendar_today28-03-2022 22:20:53

58 Tweet

1,1K Followers

55 Following

Foerster Lab for AI Research

@flair_ox

7 months ago

👀 Language models trained to play Among Us with MARL!

thumb_up_off_alt41

chat_bubble_outline1

repeat4

shareShare

My group Foerster Lab for AI Research is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecruit…

thumb_up_off_alt34

chat_bubble_outline0

repeat14

shareShare

Matthew Jackson

@jacksonmattt

4 months ago

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

thumb_up_off_alt138

chat_bubble_outline5

repeat35

shareShare

Jarek Liesen

@jarekliesen

4 months ago

⁉️ While trying to find the best hyperparameter setting of ORL algorithms using a bandit, we noticed something unexpected: 🤯 After evaluating the episodic returns of more and more policies online, the bandit's performance *decreased*! x.com/JacksonMattT/s…

thumb_up_off_alt17

chat_bubble_outline1

repeat4

shareShare

Uljad Berdica

@uljadb99

4 months ago

🔮Looking forward, we intend Unifloral🌹to be more than a library—it's a scaffolding 🌱 for indexing current & future ORL work!🏵️ We encourage 🥺 you to: 🔄 PR your awesome work using the 🌹 format 🎮 Explore the unified implementation 🧩 Try to find new SOTA algos with it

thumb_up_off_alt20

chat_bubble_outline2

repeat4

shareShare

Foerster Lab for AI Research

@flair_ox

4 months ago

FLAIR is at ICLR 🇸🇬 Find out our schedule for the week 👇

thumb_up_off_alt57

chat_bubble_outline0

repeat13

shareShare

Michael Matthews @ ICLR 2025

@mitrma

4 months ago

We are presenting Kinetix today! Oral - 11:30am Peridot Room 5F Poster - 3pm Hall 3+2B 377

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

Jakob Foerster

@j_foerst

3 months ago

Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt111

chat_bubble_outline3

repeat23

shareShare

nathan monette @ ICLR 2025 🇸🇬

@nathanrmonette

3 months ago

Excited to announce my first paper, with Jakob Foerster and Foerster Lab for AI Research, was accepted into @rl_conference 2025! We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

Excited to announce my first paper, with <a href="/j_foerst/">Jakob Foerster</a> and <a href="/FLAIR_Ox/">Foerster Lab for AI Research</a>, was accepted into @rl_conference 2025!

We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

thumb_up_off_alt68

chat_bubble_outline1

repeat10

shareShare

Clarisse Wibault

@clarissewibault

3 months ago

How can we bypass the need for online hyper-parameter tuning in offline RL? Foerster Lab for AI Research is introducing two fully offline algorithms: SOReL, for accurate offline regret approximation, and TOReL, for offline hyper-parameter tuning! arxiv.org/html/2505.2244…

thumb_up_off_alt17

chat_bubble_outline1

repeat7

shareShare

Tim Franzmeyer

@frtimlive

3 months ago

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with AI at Meta 🧵

thumb_up_off_alt62

chat_bubble_outline1

repeat13

shareShare