Raphael Avalos (@raphael_avalos) Twitter Tweets • TwiCopy

Raphael Avalos

@raphael_avalos

+ Follow

Intern @cohere |
PhD Student @aibrussels | @FWOVlaanderen

ID: 1127834279201914880

linkhttp://avalos.fr calendar_today13-05-2019 07:13:41

53 Tweet

182 Followers

353 Following

Raphael Avalos

@raphael_avalos

2 years ago

Poster session now ! We are waiting for you with Florent Delgrange at the poster 158 ! #ICLR2024

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Raphael Avalos

@raphael_avalos

2 years ago

If you are attending #ICLR2024 workshops go checkout this cool work !

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Presenting work on synthetic preference generation at two #ICLR2024 workshops today: DPFM & GenAI4DM GenAI4DM Workshop. Come say hi to find out how to improve your reward model without collecting additional human feedback!

Presenting work on synthetic preference generation at two #ICLR2024 workshops today: DPFM & GenAI4DM <a href="/genai4dm/">GenAI4DM Workshop</a>.

Come say hi to find out how to improve your reward model without collecting additional human feedback!

thumb_up_off_alt20

chat_bubble_outline0

repeat2

shareShare

Willem Röpke

@willem_ropke

a year ago

Okay people, I need some help. We’re working on a project and have been stuck for a while. My final guess for what the issue may be is that gradients are not flowing as we would want them. Does anyone have a intuitive visualisation/debugging tool for gradient flows in jax?

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Hector Kohler

@kohler_hector

a year ago

RL_Conference was a blast and I caught up with some of the usual suspects from european RL Claire Vernade @claireve.bsky.app Antonin Raffin Raphael Avalos Gaspard Lambrechts Riccardo Zamboni. See you all at EWRL 2024. Looking forward to next year's edition!! 🥳🧠

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

Florent Delgrange

@f_delgrange

a year ago

Two weeks ago, I publicly defended my PhD thesis, entitled « Activating Formal Verification of Deep Reinforcement Learning Policies by Model Checking Bisimilar Latent Space Models ». 📚 The full dissertation is available here: tinyurl.com/formarl (1/n)

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Raphael Avalos

@raphael_avalos

a year ago

Starting my internship at cohere today to work on LLMs! I'll be in Paris a couple of days a week, so if anyone wants to meet up, let me know!

thumb_up_off_alt28

chat_bubble_outline0

repeat0

shareShare

Raphael Avalos

@raphael_avalos

a year ago

The X account and website for the next edition of the ALA workshop is live! Follow it to get all the updates :)

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Raphael Avalos

@raphael_avalos

a year ago

Don't miss the opportunity to submit your (Multi-Agent) RL work to the ALA workshop!

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Willem Röpke

@willem_ropke

10 months ago

Exciting news! My paper on multi-objective reinforcement learning was accepted at AAMAS 2025! We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems. 🔗 Paper: arxiv.org/abs/2402.07182 💻 Code: github.com/wilrop/ipro

thumb_up_off_alt28

chat_bubble_outline3

repeat6

shareShare

Raphael Avalos

@raphael_avalos

9 months ago

Excited to share the technical report on Command R7B (7B) and Command A (111B), our flagship model! These models are the result of incredible teamwork at cohere, and it was an honor to be part of it. Report: cohere.com/research/paper…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Andrew Zhao

@andrewz45732491

7 months ago

Okay, I was definitely not vague posting

thumb_up_off_alt433

chat_bubble_outline7

repeat29

shareShare

Raphael Avalos

@raphael_avalos

7 months ago

🚀 Excited to share the 3rd outcome of my internship at @CohereAI: a new RL algo for agentic LLMs that combines policy learning and world modeling, letting agents verify actions before executing them. Check out the 🧵 and 📄! Big thanks to my co-authors and Cohere’s RL team 🙏

thumb_up_off_alt19

chat_bubble_outline0

repeat2

shareShare

Raphael Avalos

Raphael Avalos

Raphael Avalos

Alizée Pace

Willem Röpke

Hector Kohler

Florent Delgrange

Raphael Avalos

Raphael Avalos

Raphael Avalos

Willem Röpke

Raphael Avalos

Andrew Zhao

Raphael Avalos