shiemannor (@shiemannor) Twitter Tweets • TwiCopy

shiemannor

@shiemannor

+ Follow

Prof@Technion, Researcher@Nvidia, Founder@Jether Energy. Trying to get machine learning to really work.

ID: 1077174278917885953

calendar_today24-12-2018 12:08:55

16 Tweet

275 Followers

12 Following

Guy Tennenholtz

4 years ago

Check out our most recent work "On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning", where we talk about an important challenge of using expert data with hidden covariates. arxiv.org/pdf/2110.06539…

Check out our most recent work "On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning", where we talk about an important challenge of using expert data with hidden covariates. arxiv.org/pdf/2110.06539…

thumb_up_off_alt27

chat_bubble_outline3

repeat3

shareShare

Gal Dalal

3 years ago

In case you missed our three papers in #Neurips2022, here they are again: 1. "RL with a Terminator" at the main conference: arxiv.org/abs/2205.15376 Guy Tennenholtz Nadav Merlis Lior Shani Uri Shalit shiemannor Gal Chechik

thumb_up_off_alt16

chat_bubble_outline1

repeat7

shareShare

Gal Dalal

3 years ago

2. "Implementing RL Datacenter Congestion Control in NVIDIA NICs" at RL4RealLife and @ML4systems (spotlight) workshops: arxiv.org/abs/2207.02295 Benjamin Fuhrer yuval shpigelman Chen Tessler shiemannor Gal Chechik

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Gal Dalal

3 years ago

3. "SoftTreeMax: Policy Gradient with Tree Search" at the @DeepRL workshop: arxiv.org/abs/2209.13966 shiemannor Gal Chechik

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

shiemannor

3 years ago

What’s Broken with RL Research and a Potential Fix: open.substack.com/pub/avivtamar/…

thumb_up_off_alt100

chat_bubble_outline2

repeat16

shareShare

Chen Tessler

3 years ago

1/ Excited to share that our latest work, Conditional Adversarial Latent Models [CALM], has been accepted to ACM SIGGRAPH 2023. 🧵👇 #reinforcementlearning #animation #games #isaacgym #siggraph2023 NVIDIA AI

1/ Excited to share that our latest work, Conditional Adversarial Latent Models [CALM], has been accepted to <a href="/siggraph/">ACM SIGGRAPH</a> 2023.

🧵👇

#reinforcementlearning #animation #games #isaacgym #siggraph2023 <a href="/NVIDIAAI/">NVIDIA AI</a>

thumb_up_off_alt150

chat_bubble_outline2

repeat38

shareShare

Chen Tessler

3 years ago

3/ What new capabilities does this unlock? Linearly interpolating between two motions (in the latent space) produces semantically meaningful transitions.

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Chen Tessler

3 years ago

4/ We can leverage the semantically meaningful latent space for high-level style-conditioned policies. For instance, a high-level policy tasked with moving in a specified direction can be urged to use a specified style via a latent-space similarity reward (here, crouching).

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Chen Tessler

3 years ago

5/ Combining the ability to control the style and direction a motion is produced -- we construct a finite state-machine to control the character, both the motion and direction it is performed.

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Chen Tessler

3 years ago

6/ This enables solving unseen tasks in various forms, without training -- overcoming the need for meticulous reward/termination design.

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Chen Tessler

3 years ago

7/ This work was made possible thanks to my amazing co-authors: Yoni Kasten, Yunrong Guo, Shie Mannor, Gal Chechik, and Jason Peng. More videos, a link to the paper, and code (coming real soon!): research.nvidia.com/labs/par/calm/

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Gal Dalal

2 years ago

We released a multi-agent RL framework for network congestion control with the first public realistic network simulator! github.com/NVlabs/RLCC. Based on the amazing work of Benjamin Fuhrer and Chen Tessler

thumb_up_off_alt16

chat_bubble_outline3

repeat7

shareShare

Aviv Tamar

a year ago

Want to learn / teach RL? Check out new book draft: Reinforcement Learning - Foundations sites.google.com/view/rlfoundat… W/ shiemannor and Yishay Mansour This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.

Want to learn / teach RL?
Check out new book draft:
Reinforcement Learning - Foundations
sites.google.com/view/rlfoundat…
W/ <a href="/shiemannor/">shiemannor</a> and <a href="/YishayMansour/">Yishay Mansour</a>
This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.

thumb_up_off_alt633

chat_bubble_outline8

repeat107

shareShare

Aviv Tamar

a year ago

For teachers, we also have a 40+ page exam booklet on our website. Why this book? There are several other excellent textbooks, including Sutton and Barto and Bertsekas and Tsitsiklis.

thumb_up_off_alt10

chat_bubble_outline2

repeat1

shareShare

Aviv Tamar

a year ago

But for teaching RL, we wanted a book that is both rigorous (full proofs, analytical examples), covers what we feel is most relevant, and easy enough for undergrad teaching. The book is a focused one semester course for advanced undergrad/early grad covering key topics in depth.

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

Aviv Tamar

a year ago

We hope you find it useful! The book is still work in progress - we’d be grateful for comments, suggestions, omissions, and errors of any kind, at [email protected]

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

UriG

4 months ago

Tired of manual #ComfyUI workflow design? While recent methods predict them, our new paper, FlowRL, introduces a Reinforcement Learning framework that learns to generate complex, novel workflows for you! paper [arxiv.org/abs/2505.21478]

Tired of manual #ComfyUI workflow design? While recent methods predict them, our new paper, FlowRL, introduces a Reinforcement Learning framework that learns to generate complex, novel workflows for you!
paper [arxiv.org/abs/2505.21478]

thumb_up_off_alt15

chat_bubble_outline2

repeat7

shareShare