Foerster Lab for AI Research (@flair_ox) 's Twitter Profile
Foerster Lab for AI Research

@flair_ox

ML research group @uniofoxford. Focussed on multi-agent, open-ended, meta and reinforcement learning as well as agent based models. More at foersterlab.com.

ID: 1508569815014727680

linkhttp://www.foersterlab.com calendar_today28-03-2022 22:20:53

58 Tweet

1,1K Followers

55 Following

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

My group Foerster Lab for AI Research is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecruit…

Matthew Jackson (@jacksonmattt) 's Twitter Profile Photo

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

Jarek Liesen (@jarekliesen) 's Twitter Profile Photo

⁉️ While trying to find the best hyperparameter setting of ORL algorithms using a bandit, we noticed something unexpected: 🤯 After evaluating the episodic returns of more and more policies online, the bandit's performance *decreased*! x.com/JacksonMattT/s…

⁉️ While trying to find the best hyperparameter setting of ORL algorithms using a bandit, we noticed something unexpected:
🤯 After evaluating the episodic returns of more and more policies online, the bandit's performance *decreased*!
x.com/JacksonMattT/s…
Uljad Berdica (@uljadb99) 's Twitter Profile Photo

🔮Looking forward, we intend Unifloral🌹to be more than a library—it's a scaffolding 🌱 for indexing current & future ORL work!🏵️ We encourage 🥺 you to: 🔄 PR your awesome work using the 🌹 format 🎮 Explore the unified implementation 🧩 Try to find new SOTA algos with it

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…

nathan monette @ ICLR 2025 🇸🇬 (@nathanrmonette) 's Twitter Profile Photo

Excited to announce my first paper, with Jakob Foerster and Foerster Lab for AI Research, was accepted into @rl_conference 2025! We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

Excited to announce my first paper, with <a href="/j_foerst/">Jakob Foerster</a> and <a href="/FLAIR_Ox/">Foerster Lab for AI Research</a>, was accepted into @rl_conference 2025!

We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.
Tim Franzmeyer (@frtimlive) 's Twitter Profile Photo

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with AI at Meta 🧵