Florent Delgrange (@f_delgrange) Twitter Tweets • TwiCopy

Florent Delgrange

@f_delgrange

+ Follow

postdoc @aibrussels; working on providing reliable and verifiable AI mechanisms, with a strong focus on Reinforcement Learning

ID: 1647952411279892480

linkhttp://delgrange.me calendar_today17-04-2023 13:17:44

26 Tweet

71 Followers

271 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Today we had the pleasure of receiving Florent Delgrange and⁩ ⁦Raphael Avalos⁩ for exciting talks on Wasserstein Auto-Encoded MDPs and it’s application to POMDP with the Wasserstein Believer! WAE-MDP: openreview.net/pdf?id=JLLTtEd… Wasserstein Believer: openreview.net/pdf?id=V5GQVp8…

Today we had the pleasure of receiving <a href="/f_delgrange/">Florent Delgrange</a> and⁩ ⁦<a href="/raphael_avalos/">Raphael Avalos</a>⁩ for exciting talks on Wasserstein Auto-Encoded MDPs and it’s application to POMDP with the Wasserstein Believer!

WAE-MDP: openreview.net/pdf?id=JLLTtEd…

Wasserstein Believer: openreview.net/pdf?id=V5GQVp8…

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Raphael Avalos

@raphael_avalos

2 years ago

Damien ERNST Thanks a lot for receiving Florent Delgrange and me to present our research on POMDPs. We had a great time chatting with you and your team Gaspard Lambrechts , Pascal Leroy and the others! 📄arxiv.org/abs/2303.03284

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Pablo Samuel Castro

@pcastr

a year ago

Yes! Bisimulation FTW! Some papers to get you started: Bisim relations: sciencedirect.com/science/articl… Bisim metrics: arxiv.org/abs/1207.4114 Using these in deep RL: arxiv.org/abs/2106.08229

thumb_up_off_alt36

chat_bubble_outline2

repeat3

shareShare

Raphael Avalos

@raphael_avalos

a year ago

Arrived at #ICLR2024 with Florent Delgrange to present our work "The Wasserstein Believer: Learning Belief Updates for Partially Observable MDPs through Reliable Latent Space Models".

Arrived at #ICLR2024 with <a href="/f_delgrange/">Florent Delgrange</a> to present our work "The Wasserstein Believer: Learning Belief Updates for Partially Observable MDPs through Reliable Latent Space Models".

thumb_up_off_alt11

chat_bubble_outline3

repeat2

shareShare

Raphael Avalos

@raphael_avalos

a year ago

Poster session now ! We are waiting for you with Florent Delgrange at the poster 158 ! #ICLR2024

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Willem Röpke

@willem_ropke

5 months ago

Exciting news! My paper on multi-objective reinforcement learning was accepted at AAMAS 2025! We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems. 🔗 Paper: arxiv.org/abs/2402.07182 💻 Code: github.com/wilrop/ipro

thumb_up_off_alt28

chat_bubble_outline3

repeat6

shareShare

Guy Avni

@guyavni

4 months ago

Sit back, relax, and let me tell you the story of our paper in @aamasconf, by far the longest project I’ve been involved in; starting in the days of yore, just weeks before COVID hit. With F. Delange, C. Schilling, A. Lukina, A. Nowe, and G. Perez. arxiv.org/abs/2402.13785

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Raphael Avalos

@raphael_avalos

2 months ago

Last week, I wrapped up my internship cohere, where I had the chance to work with fantastic people on RL for LLMs. It was an amazing 6 months, and I'm excited to share one of the outcomes: ShiQ, a Q-value based RL algorithm for fine-tuning LLMs 🚀 🧵Details in Irem Ergün's post!

thumb_up_off_alt31

chat_bubble_outline0

repeat3

shareShare

Mark Santolucito

@marksantolucito

3 days ago

RL, formal methods, and program synthesis from Florent Delgrange at SYNT CAV . What more could I ask for

RL, formal methods, and program synthesis from Florent Delgrange at SYNT <a href="/confCAV/">CAV</a> . What more could I ask for

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare