Marcel Hussing (@marcel_hussing) 's Twitter Profile
Marcel Hussing

@marcel_hussing

PhD student at @Penn @GRASPlab @LifelongML_Penn interested in reliable and replicable reinforcement learning. All tweets are my own. marcelhussing.bsky.social

ID: 837337930611978243

linkhttps://marcelhussing.github.io/ calendar_today02-03-2017 16:24:54

471 Tweet

303 Followers

276 Following

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

Before, it was all about finding the next architecture. Now, it's all about finding the best tokenization. Next, it will be all about finding ways to collect new data and exploring.

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

We presented this work at RL_Conference this year at the Finding The Frame Workshop and we now put a version on arxiv. The purpose of benchmarking in RL seems convoluted and benchmarking choices are often not thought about much. We argue that we should change this!

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

🎉Claas and I have another preprint out: MAD-TD🤪 After combatting divergence in high utd RL, we can now stabilize ood action value prediction using model-based synthetic data. This makes resetting networks unnecessary and leads to great performance. Check it out👇

🎉Claas and I have another preprint out: MAD-TD🤪

After combatting divergence in high utd RL, we can now stabilize ood action value prediction using model-based synthetic data. This makes resetting networks unnecessary and leads to great performance.

Check it out👇
Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

After putting it off for a while, I’m diving into transformer research this winter. Joining John Langford’s team Microsoft Research to work on transformer representations. Don't worry, I'm not leaving RL behind but I need to learn some new things so we can get to point 3.

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

CoLLAs stacking their line-up early. Met Jakob at ICML and he is amazing! Even more excited because the conference is gonna be here at Penn next year. Come visit us!

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

My first week at Microsoft Research is ending today. One of the highlights of this week was how excited everyone got when people shared that this work was public now and how supportive everyone was. Definitely check it out: microsoft.github.io/OmniParser/

My first week at <a href="/MSFTResearch/">Microsoft Research</a> is ending today. One of the highlights of this week was how excited everyone got when people shared that this work was public now and how supportive everyone was. Definitely check it out:  

microsoft.github.io/OmniParser/
Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

Been hearing a lot about #NotebookLM but haven't tried it out myself. To my fellow researchers, do you find it useful and how do you use it? Is it work getting started and if so, what are some good things to try? #Research #Science #AcademicTwitter

Claas Voelcker (@c_voelcker) 's Twitter Profile Photo

What is the perception of #TMLR in the community at this point. How much is it "worth" compared to conference papers in core AI/ML? I've had very different reactions to this IRL, so I want to get a more general feeling. #mltwitter

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

Eugene got me to try it out. I also made a starter pack for machine learning #theory people but I haven't found too many I know. So if you want in just message me there x.com/EugeneVinitsky…

Eugene got me to try it out. I also made a starter pack for machine learning #theory people but I haven't found too many I know. So if you want in just message me there

x.com/EugeneVinitsky…
Claas Voelcker (@c_voelcker) 's Twitter Profile Photo

As everyone in #mltwitter, I'm probably going to be more active on ye goode other app in the sky going forward. Same handle, same shit-posting!

Gokul Swamy (@g_k_swamy) 's Twitter Profile Photo

Also on🦋Swaminathan.bsky.social! If you're looking for where to get started, the following starter packs of people to follow should be helpful: go.bsky.app/3WPHcHg, go.bsky.app/21nFz12, go.bsky.app/PcwNoSy, go.bsky.app/DfAoaJ1, go.bsky.app/SipA7it.

Marcel Hussing (@marcel_hussing) 's Twitter Profile Photo

Throwing compute at data has been incredibly powerful in other domains but used to be difficult in RL. Not so much anymore! Our MAD-TD paper will be presented as a spotlight 🚨 at #ICLR. As Claas and I are both not active here anymore, come discuss with us on the other platform!

Claas Voelcker (@c_voelcker) 's Twitter Profile Photo

We often use #VAML/ #MuZero losses with deterministic models. But if we want stochastic models to measure uncertainty or to leverage current SOTA models such as #transformers and #diffusion, we need to take care! Naively translating the loss functions leads to mistakes!