Adrien Alitaiga (@aalitaiga) 's Twitter Profile
Adrien Alitaiga

@aalitaiga

PhD student at Mila

ID: 436069026

calendar_today13-12-2011 19:09:43

37 Tweet

183 Followers

256 Following

Marc G. Bellemare (@marcgbellemare) 's Twitter Profile Photo

Congrats to my PhD student Adrien Alitaiga for winning Best Paper Award at the Exploration in RL Workshop at ICML19, "Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment"! Talk today, 11:30, Hall A. #ICML2019 #ERL19 Aaron Courville Michelle Cholodovskis Machado William Fedus

William Fedus (@liamfedus) 's Twitter Profile Photo

In 'Benchmarking Bonus-Based Exploration Methods in ALE' we find that when standardizing training duration, architecture, model capacity - new methods do not clearly improve over prior baselines Work led by Adrien Alitaiga which received ICML exp. workshop 2019 best paper award!

In 'Benchmarking Bonus-Based Exploration Methods in ALE'  we find that when standardizing training duration, architecture, model capacity - new methods do not clearly improve over prior baselines

Work led by <a href="/aalitaiga/">Adrien Alitaiga</a> which received ICML exp. workshop 2019 best paper award!
Marlos C. Machado (@marloscmachado) 's Twitter Profile Photo

We're pleased to let you know that your submission, On Bonus Based Exploration Methods In The Arcade Learning Environment, has been accepted at #ICLR2020! openreview.net/forum?id=BJewl… This huge endeavor was led by Adrien Alitaiga. W/ William Fedus, Marc G. Bellemare & Aaron Courville. More👇🏼

Dibya Ghosh (@its_dibya) 's Twitter Profile Photo

At #ICML2021 and excited about RL? Come to the RL social this evening 5-7 PM PT! All levels of experience welcome, especially newcomers. We're lucky to have several RL experts confirmed attending -- a great opportunity to chat with them :) Join: icml.cc/virtual/2021/s…

At #ICML2021 and excited about RL? Come to the RL social this evening 5-7 PM PT! 

All levels of experience welcome, especially newcomers. We're lucky to have several RL experts confirmed attending -- a great opportunity to chat with them :)

Join: icml.cc/virtual/2021/s…
Konrad Żołna (@konradzolna) 's Twitter Profile Photo

Gather-Attend-Scatter (GATS), a novel module that combines pretrained foundation models operating at different rates into larger multimodal networks. Paper: arxiv.org/abs/2401.08525

Aviral Kumar (@aviral_kumar2) 's Twitter Profile Photo

Super simple code change to get value-based deep RL scale *much* better w/ big models across the board on Atari games, robotic manipulation w/ transformers, LLM + text games, & even Chess! Just use classification loss (i.e., cross entropy), not MSE!! arxiv.org/abs/2403.03950🧵⬇️

Super simple code change to get value-based deep RL scale *much* better w/ big models across the board on Atari games, robotic manipulation w/ transformers, LLM + text games, &amp; even Chess!

Just use classification loss (i.e., cross entropy), not MSE!!

arxiv.org/abs/2403.03950🧵⬇️
Jesse Farebrother (@jessefarebro) 's Twitter Profile Photo

Framing regression as a classification has been “dark knowledge” for some time. We wanted to shed some light on this phenomenon in deep RL: Framing value-learning as a classification significantly improves performance and scalability in deep RL. But... not all classification

Framing regression as a classification has been “dark knowledge” for some time. We wanted to shed some light on this phenomenon in deep RL:

Framing value-learning as a classification significantly improves performance and scalability in deep RL.

But... not all classification