Xavier Puig @ ICLR (@xavierpuigf) Twitter Tweets • TwiCopy

AI at Meta

10 months ago

Additionally, looking towards the future, we’re releasing PARTNR: a benchmark for Planning And Reasoning Tasks in humaN-Robot collaboration. Built on Habitat 3.0, it’s the largest benchmark of its kind to study and evaluate human-robot collaboration in household activities By

thumb_up_off_alt168

chat_bubble_outline3

repeat24

shareShare

Manling Li

@manlingli_

10 months ago

[NeurIPS D&B Oral] Embodied Agent Interface: Benchmarking LLMs for Embodied Agents A single line of code to evaluate your model! 🌟Standardize Goal Specifications: LTL 🌟Standardize Modules and Interfaces: 4 modules, 438 tasks, 1475 goals 🌟Standardize Fine-grained Metrics: 18

thumb_up_off_alt282

chat_bubble_outline5

repeat69

shareShare

Jiaman Li

@jiaman01

9 months ago

🤖 Introducing Human-Object Interaction from Human-Level Instructions! First complete system that generates physically plausible, long-horizon human-object interactions with finger motions in contextual environments, driven by human-level instructions. 🔍 Our approach: - LLMs

thumb_up_off_alt517

chat_bubble_outline18

repeat112

shareShare

C Zhang

@chongzitazhang

7 months ago

This is the mobile manipulation I want to see. You can only get this via RL.

thumb_up_off_alt315

chat_bubble_outline3

repeat30

shareShare

Tianyu Li EasyPaperSniper

@sniperpaper

7 months ago

The trained policy can be integrated with a high-level planner for real-world applications. By combining our object manipulation policy with user commands, we demonstrate its effectiveness in real-world scenarios—such as moving large trash carts. (6/8)

thumb_up_off_alt0

chat_bubble_outline1

repeat1

shareShare

Xavier Puig @ ICLR

@xavierpuigf

7 months ago

🪑How do you train robots to move furniture? This requires robots to synchronize whole-body movements, making teleoperation or RL approaches challenging. Check out this amazing work by Tianyu Li EasyPaperSniper, using human demonstrations to train robots to move furniture in the real world!

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

AI at Meta

@aiatmeta

7 months ago

Meta PARTNR is a benchmark for planning and reasoning in embodied multi-agent tasks. This large-scale human and robot collaboration benchmark was core to our recent demos and also informs our work as scientists and engineers pushing this field of study forward.

thumb_up_off_alt296

chat_bubble_outline20

repeat72

shareShare

Chuanyang Jin

@chuanyang_jin

6 months ago

How to achieve human-level open-ended machine Theory of Mind? Introducing #AutoToM: a fully automated and open-ended ToM reasoning method combining the flexibility of LLMs with the robustness of Bayesian inverse planning, achieving SOTA results across five benchmarks. 🧵[1/n]

thumb_up_off_alt65

chat_bubble_outline1

repeat22

shareShare

Ram Ramrakhya

@ramramrakhya

5 months ago

🚨New Preprint 🚨 Embodied agents 🤖 operating in indoor environments must interpret ambiguous and under-specified human instructions. A capable household robot 🤖 should recognize ambiguity and ask relevant clarification questions to infer the user🧑‍🚒 intent accurately, leading

thumb_up_off_alt40

chat_bubble_outline1

repeat7

shareShare

Xavier Puig @ ICLR

@xavierpuigf

5 months ago

How do we enable agents to perform tasks even when these are underspecified? In this work, led by Ram Ramrakhya, we train VLA agents via RL to decide when to act in the environment or ask clarifying questions, enabling them to handle ambiguous instructions ram81.github.io/projects/ask-t…

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Xavier Puig @ ICLR

@xavierpuigf

5 months ago

I will be at ICLR to present PARTNR. Reach out if you want to talk about our work at FAIR or interesting problems in Robotics!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Mandi Zhao

@zhaomandi

3 months ago

DexMachina lets us perform a functional comparison between different dexterous hands: we evaluate 6 hands on 4 challenging long-horizon tasks, and found that larger, fully actuated hands learn better and faster, and high DoF is more important than having human-like hand sizes –

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Roozbeh Mottaghi

@roozbehmottaghi

3 months ago

I'll be giving two talks at the #CVPR2025 workshops: 3D LLM/VLA 3d-llm-vla.github.io and POETS poets2024.github.io/poets2025/. 🧵

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

Xavier Puig @ ICLR

@xavierpuigf

3 months ago

I will be talking at the #CVPR2025 workshop on Humanoid Agents, tomorrow June 11th at 9:30 am. I will discuss how humanoid agents can help us improve human-robot collaboration. See you there! humanoid-agents.github.io

thumb_up_off_alt52

chat_bubble_outline0

repeat5

shareShare

Tianmin Shu

@tianminshu

3 months ago

🚀 Excited to introduce SimWorld: an embodied simulator for infinite photorealistic world generation 🏙️ populated with diverse agents 🤖 If you are at #CVPR2025, come check out the live demo 👇 Jun 14, 12:00-1:00 pm at JHU booth, ExHall B Jun 15, 10:30 am-12:30 pm, #7, ExHall B

thumb_up_off_alt195

chat_bubble_outline5

repeat37

shareShare

Yixuan Wang

@yxwangbot

3 months ago

🤖 Does VLA models really listen to language instructions? Maybe not 👀 🚀 Introducing our RSS paper: CodeDiffuser -- using VLM-generated code to bridge the gap between **high-level language** and **low-level visuomotor policy** 🎮 Try the live demo: robopil.github.io/code-diffuser/ (1/9)

thumb_up_off_alt126

chat_bubble_outline1

repeat26

shareShare

Xavier Puig @ ICLR

@xavierpuigf

3 months ago

Check out our workshop on Continual Robot Learning from Humans, at #RSS2025, with amazing speakers covering topics including learning from human visual demonstrations, generative models for continual robot learning or the role of LLMs in embodied contexts …-robot-learning-from-humans.github.io

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare