Moritz Reuss (@moritz_reuss) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

3D representations are more critical than what we thought for manipulation. Our work 3D Diffuser Actor marries those with policy diffusion and achieves a new SOTA on both RLBench and CALVIN! 3d-diffuser-actor.github.io 🦾 with Tsung-Wei Tsung-Wei Ke and Katerina Katerina Fragkiadaki

thumb_up_off_alt45

chat_bubble_outline1

repeat11

shareShare

Jan Peters

@jan_r_peters

a year ago

Fantastic gang from TU Darmstadt Intelligent Autonomous Systems Group Georgia Chalvatzaki and offspring at Shmoopy …

Fantastic gang from <a href="/TUDarmstadt/">TU Darmstadt</a> <a href="/ias_tudarmstadt/">Intelligent Autonomous Systems Group</a> <a href="/GeorgiaChal/">Georgia Chalvatzaki</a> and offspring at <a href="/KITKarlsruhe/">Shmoopy</a> …

thumb_up_off_alt83

chat_bubble_outline3

repeat8

shareShare

Oier Mees

@oier_mees

9 months ago

Tired of labeling your robot data?🤖 Excited to present NILS, a novel approach that leverages foundation models 🧠 to segments videos into tasks and generates semantically meaningful natural language annotations 💬 of varying levels of granularity! Web: robottasklabeling.github.io

thumb_up_off_alt64

chat_bubble_outline1

repeat9

shareShare

Jyo Pari

@jyo_pari

5 months ago

Turn a single pre-trained model’s layers into MoE “experts” and reuse them? Finetuning a “router” slightly cuts loss—cool proof of concept. Can we combine dynamic compute paths/reuse + coconut-like latent reasoning? jyopari.github.io/posts/reuse

thumb_up_off_alt22

chat_bubble_outline1

repeat5

shareShare

Nico Bohlinger

@nicobohlinger

4 months ago

⚡️ Do you think training robot locomotion needs large scale simulation? Think again! Our new paper shows how to train an omnidirectional locomotion policy directly on a real quadruped robot in just a few minutes 🚀 Top speeds of 0.85 m/s, two different control approaches, indoor

thumb_up_off_alt32

chat_bubble_outline1

repeat9

shareShare

Moritz Reuss

@moritz_reuss

4 months ago

Happy to share to be a recipient of the 2025 Apple Scholars in AI/ML PhD Fellowship Programm! Grateful to @apple for this recognition and the support. Thank you to all my collaborators, colleagues and to supervisor who made this possible! machinelearning.apple.com/updates/apple-…

thumb_up_off_alt19

chat_bubble_outline3

repeat0

shareShare

Jyo Pari

@jyo_pari

4 months ago

Llama 4 (Meta) shows too much SFT limits RL exploration — something we also found in our recent work! A new and superior pretraining paradigm is around the corner to unleash a new era of reasoning. Check out our paper: arxiv.org/abs/2502.19402 Thread: x.com/pulkitology/st…

Llama 4 (<a href="/Meta/">Meta</a>) shows too much SFT limits RL exploration — something we also found in our recent work! A new and superior pretraining paradigm is around the corner to unleash a new era of reasoning.

Check out our paper: arxiv.org/abs/2502.19402

Thread: x.com/pulkitology/st…

thumb_up_off_alt105

chat_bubble_outline1

repeat22

shareShare

Nico Bohlinger

@nicobohlinger

2 months ago

⚡️ Can one policy control 1000 different robots? 🤖 We explore Embodiment Scaling Laws: Training on more diverse robot embodiments boosts generalization 📈 Our generalist policy, trained on 1000 generated robots, zero-shot transfers to the real Go2 quadruped and H1 humanoid 🚀

thumb_up_off_alt36

chat_bubble_outline1

repeat7

shareShare

Ryan Hoque

@ryan_hoque

2 months ago

Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of dexterous human manipulation to date — 829 hours of egocentric video + paired 3D hand poses across 194 tasks. Now on arxiv: arxiv.org/abs/2505.11709 (1/4)

thumb_up_off_alt568

chat_bubble_outline15

repeat95

shareShare

Anagh Malik

@anagh_malik

2 months ago

📢📢📢 Neural Inverse Rendering from Propagating Light 💡 Our CVPR Oral introduces the first method for multiview neural inverse rendering from videos of propagating light, unlocking applications such as relighting light propagation videos, geometry estimation, or light

thumb_up_off_alt208

chat_bubble_outline1

repeat40

shareShare

Jyo Pari

@jyo_pari

a month ago

What if an LLM could update its own weights? Meet SEAL🦭: a framework where LLMs generate their own training data (self-edits) to update their weights in response to new inputs. Self-editing is learned via RL, using the updated model’s downstream performance as reward.

thumb_up_off_alt3,3K

chat_bubble_outline124

repeat514

shareShare

Moritz Reuss

Gate.io

Nikos Gkanatsios

Jan Peters

Oier Mees

Jyo Pari

Nico Bohlinger

Moritz Reuss

Jyo Pari

Nico Bohlinger

Ryan Hoque

Anagh Malik

Jyo Pari