Jiayuan Mao (@maojiayuan) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Our #CoRL2023 paper shows that by composing the energies of diffusion models trained to sample for individual constraint types such as collision-free, spatial relations, and physical stability, it can solve novel combinations of known constraints. diffusion-ccsp.github.io (🧵 1/N)

thumb_up_off_alt136

chat_bubble_outline1

repeat22

shareShare

Computer Vision Lab Zurich

@cvl_eth

2 years ago

thumb_up_off_alt1,1K

chat_bubble_outline168

repeat220

shareShare

Yilun Du

@du_yilun

2 years ago

Introducing a way to convert synthesized robot videos to robot execution without using any action labels! flow-diffusion.github.io We also release a codebase (with pretrained models) for text-to-video generation. Train your own models for robot control in only 1 day with 4 GPUs!

thumb_up_off_alt125

chat_bubble_outline1

repeat22

shareShare

Joy Hsu

@joycjhsu

2 years ago

What’s left w/ foundation models? We found that they still can't ground modular concepts across domains. We present Logic-Enhanced FMs:🤝FMs & neuro-symbolic concept learners. We learn abstractions of concepts like “left” across domains & do domain-independent reasoning w/ LLMs.

thumb_up_off_alt171

chat_bubble_outline2

repeat28

shareShare

Nishanth Kumar

@nishanthkumar23

2 years ago

Ever heard about "Bilevel Planning" or "Task and Motion Planning", but been unsure what those words mean? Ever wanted a gentle intro to these methods so you can just understand what's going on? Our new blog post might help! lis.csail.mit.edu/bilevel-planni…

thumb_up_off_alt81

chat_bubble_outline3

repeat19

shareShare

Jiayuan Mao

@maojiayuan

2 years ago

Definitely one of my top 3 favourite papers :) It marries deep learning with a minimal set of universal grammar rules for grounded language learning. It draws inspiration from lexicalist linguistics and cognitive science (bootstrapping from core knowledge).

thumb_up_off_alt57

chat_bubble_outline0

repeat6

shareShare

Jiayuan Mao

@maojiayuan

2 years ago

Come chat with us on Tuesday morning at #203 at NeurIPS Conference !

thumb_up_off_alt28

chat_bubble_outline0

repeat1

shareShare

Jiayuan Mao

@maojiayuan

2 years ago

Check out our new framework that automatically generates planning domain knowledge with LLMs, learns to ground it, and verifies it through interaction. We believe that learning such verifiable and compositional planning representations from language is important for embodied AI!

thumb_up_off_alt123

chat_bubble_outline0

repeat13

shareShare

Fangchen Liu

@fangchenliu_

a year ago

Can we leverage VLMs for robot manipulation in the open world? Checkout our new work MOKA, a simple and effective visual prompting method!

thumb_up_off_alt210

chat_bubble_outline12

repeat41

shareShare

Fangchen Liu

@fangchenliu_

a year ago

The key idea is to query GPT-4V to perform multiple choice from a set of keypoints and waypoints. Here is an example: suppose the current task is to sweep the trash bag off the table. We mark the involved objects with a set of points, and overlay the image with grids.

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

Chen Wang

@chenwang_j

a year ago

Can we use wearable devices to collect robot data without actual robots? Yes! With a pair of gloves🧤! Introducing DexCap, a portable hand motion capture system that collects 3D data (point cloud + finger motion) for training robots with dexterous hands Everything open-sourced

thumb_up_off_alt621

chat_bubble_outline25

repeat136

shareShare

Yilun Du

@du_yilun

a year ago

Introducing our @icml_conf paper: Learning Iterative Reasoning through Energy Diffusion! We formulate reasoning as optimizing a sequence of energy landscapes. This enables us to solve harder problems at test time with more complex optimization. Website: energy-based-model.github.io/ired/

thumb_up_off_alt597

chat_bubble_outline7

repeat95

shareShare

Joy Hsu

@joycjhsu

10 months ago

What makes a maze look like a maze? Humans can reason about infinitely many instantiations of mazes—made of candy canes, sticks, icing, yarn, etc. But VLMs often struggle to make sense of such visual abstractions. We improve VLMs' ability to interpret these abstract concepts.

thumb_up_off_alt232

chat_bubble_outline5

repeat35

shareShare

Jiayuan Mao

@maojiayuan

8 months ago

Excited to announce our upcoming workshop: “Planning in the Era of LLMs”! 🌐🤖 Join us as we explore the intersection of planning representations, algorithms, and the power of LLMs. Let’s discuss how combining these insights and solutions can unlock new capabilities in AI.

thumb_up_off_alt69

chat_bubble_outline0

repeat8

shareShare

Joy Hsu

@joycjhsu

6 months ago

Excited to bring back the 2nd Workshop on Visual Concepts at #CVPR2025 2025, this time with a call for papers! We welcome submissions on the following topics. See our website for more info: sites.google.com/stanford.edu/w… Join us & a fantastic lineup of speakers in Tennessee!

Excited to bring back the 2nd Workshop on Visual Concepts at <a href="/CVPR/">#CVPR2025</a> 2025, this time with a call for papers!

We welcome submissions on the following topics. See our website for more info:
sites.google.com/stanford.edu/w…

Join us & a fantastic lineup of speakers in Tennessee!

thumb_up_off_alt135

chat_bubble_outline1

repeat23

shareShare

Haonan Chen

@haonanchen_

5 months ago

Excited to organize Workshop on Learning Meets Model-Based Methods for Contact-Rich Manipulation @ ICRA 2025! We welcome submissions on a range of topics—check out our website for details: contact-rich.github.io Join us for an incredible lineup of speakers! #ICRA2025

thumb_up_off_alt88

chat_bubble_outline2

repeat17

shareShare

Manling Li

@manlingli_

5 months ago

Tutorial on "Foundation Models Meet Embodied Agents", with Yunzhu Li Jiayuan Mao Wenlong Huang . We categorize LLMs, VLMs, VLAs and their usages under a MDP formulation. Come and talk with us! Also on Zoom! …models-meet-embodied-agents.github.io Feb 25 8:30-12:30 Location: 118A room,

Tutorial on "Foundation Models Meet Embodied Agents", with <a href="/YunzhuLiYZ/">Yunzhu Li</a> <a href="/maojiayuan/">Jiayuan Mao</a> <a href="/wenlong_huang/">Wenlong Huang</a> .

We categorize LLMs, VLMs, VLAs and their usages under a MDP formulation.

Come and talk with us! Also on Zoom!

…models-meet-embodied-agents.github.io
Feb 25 8:30-12:30
Location: 118A room,

thumb_up_off_alt166

chat_bubble_outline2

repeat83

shareShare

Shao-Hua Sun

@shaohua0116

3 months ago

We will organize an #ICML2025 Workshop on Programmatic Representations for Agent Learning, bringing together experts in decision-making and code generation to explore how structured representations can make agent learning more interpretable, generalizable, efficient, and safe!

thumb_up_off_alt76

chat_bubble_outline5

repeat14

shareShare

Weijia Shi

@weijiashi2

3 months ago

Join us for the session tomorrow. It will be held in person at NAACL and also available via VC.

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

Jiayuan Mao

@maojiayuan

2 months ago

Excited to see more work at the intersection of programmatic representations, program synthesis, and learning! If you’re exploring programmatic representations for agents, consider submitting to our ICML & RLC workshops!

thumb_up_off_alt20

chat_bubble_outline0

repeat0

shareShare

Jiayuan Mao

Gate.io

Zhutian Yang

Computer Vision Lab Zurich

Yilun Du

Joy Hsu

Nishanth Kumar

Jiayuan Mao

Jiayuan Mao

Jiayuan Mao

Fangchen Liu

Fangchen Liu

Chen Wang

Yilun Du

Joy Hsu

Jiayuan Mao

Joy Hsu

Haonan Chen

Manling Li

Shao-Hua Sun

Weijia Shi

Jiayuan Mao