Haoyu Xiong (@haoyu_xiong_) Twitter Tweets • TwiCopy

Mihir Prabhudesai

4 months ago

🚨 The era of infinite internet data is ending, So we ask: 👉 What’s the right generative modelling objective when data—not compute—is the bottleneck? TL;DR: ▶️Compute-constrained? Train Autoregressive models ▶️Data-constrained? Train Diffusion models Get ready for 🤿 1/n

thumb_up_off_alt973

chat_bubble_outline122

repeat171

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

Extrapolating this trend to robotics, i believe if one is doing sim2real they should prefer Autoregressive > Diffusion (compute bottleneck). But if they are doing real world training then Autoregressive < Diffusion (data bottleneck).. We don't empirically validate this for

thumb_up_off_alt124

chat_bubble_outline3

repeat10

shareShare

Haoyu Xiong

@haoyu_xiong_

3 months ago

Just reread the tidybot2.github.io docs today, what an incredible tutorial for building a robot system. Honestly, you could set up an entire new robot lab just by following it, Jimmy Wu even gave you the link of the screwdriver he used 😂

thumb_up_off_alt189

chat_bubble_outline2

repeat25

shareShare

Haoyu Xiong

@haoyu_xiong_

3 months ago

Excited to see my former lab mates doing cool stuffs! Congrats!

thumb_up_off_alt26

chat_bubble_outline0

repeat0

shareShare

Tim Schneider

@timschneider94

3 months ago

Pushing for #icra but still missing real robot experiments? 😰 Skip the ROS headaches — get your Franka robot running in minutes with franky! 🦾 Super beginner-friendly, Pythonic, and fast to set up. 🔗 github.com/TimSchneider42… Intelligent Autonomous Systems Group Jan Peters 🧵👇

thumb_up_off_alt65

chat_bubble_outline1

repeat16

shareShare

MetaStoneAI

@themetastoneai

3 months ago

🚀 Introducing XBai o4：a milestone in our 4th-generation open-source technology based on parallel test time scaling！ In its medium mode, XBai o4 now fully outperforms OpenAI−o3−mini.📈 🔗Open-source weights: huggingface.co/MetaStoneTec/X…✅ Github link: github.com/MetaStone-AI/X…

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat234

shareShare

Shawn Shen

@shawn_shen_oix

3 months ago

Now with our internet clip search tool, curating a training dataset does not need weeks, it’s within seconds, you get the exact clips within the video, and perfectly labeled. Eg, training a world model needs PoV data, here is how:

thumb_up_off_alt24

chat_bubble_outline1

repeat3

shareShare

Yanjie Ze

@zeyanjie

3 months ago

Excited to open-source GMR: General Motion Retargeting. Real-time human-to-humanoid retargeting on your laptop. Supports diverse motion formats & robots. Unlock whole-body humanoid teleoperation (e.g., TWIST). video with 🔊

thumb_up_off_alt677

chat_bubble_outline20

repeat104

shareShare

Google DeepMind

@googledeepmind

3 months ago

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

thumb_up_off_alt10,10K

chat_bubble_outline692

repeat2,2K

shareShare

Lili

@lchen915

3 months ago

Self-Questioning Language Models: LLMs that learn to generate their own questions and answers via asymmetric self-play RL. There is no external training data – the only input is a single prompt specifying the topic.

thumb_up_off_alt769

chat_bubble_outline16

repeat130

shareShare

Skywork

@skywork_ai

3 months ago

Matrix-Game 2.0 — The FIRST open-source, real-time, long-sequence interactive world model Last week, DeepMind's Genie 3 shook the AI world with real-time interactive world models. But... it wasn't open-sourced. Today, Matrix-Game 2.0 changed the game. 🚀 25FPS. Minutes-long

thumb_up_off_alt1,1K

chat_bubble_outline45

repeat342

shareShare

Jiafei Duan

@djiafei

3 months ago

Reasoning is central to purposeful action. Today we introduce MolmoAct — a fully open Action Reasoning Model (ARM) for robotics. Grounded in large-scale pre-training with action reasoning data, every predicted action is interpretable and user-steerable via visual trace. We are

thumb_up_off_alt310

chat_bubble_outline8

repeat52

shareShare

kaan doğrusöz

@kaandogrusoz

3 months ago

Demos demos demos! Reminiscing on how we got started when we publicly demo’ed our first VLA autonomously folding tshirts in @YCombinator’s demoday, with the first version of Isaac we built in our living room with Evan Wineland We’ve learned a lot since then.

thumb_up_off_alt197

chat_bubble_outline13

repeat16

shareShare

Zhanyi S

@s_zhanyi

3 months ago

How to prevent behavior cloning policies from drifting OOD on long horizon manipulation tasks? Check out Latent Policy Barrier (LPB), a plug-and-play test-time optimization method that keeps BC policies in-distribution with no extra demo or fine-tuning: project-latentpolicybarrier.github.io

thumb_up_off_alt130

chat_bubble_outline2

repeat25

shareShare

Haoyu Xiong

@haoyu_xiong_

3 months ago

I’m happy to share that I’ve recently moved to Boston to start my PhD MIT CSAIL! Excited to hack on some cool robots! Let me know if you are around, let’s chat about AI, robotics, food in Boston, or anything else!

thumb_up_off_alt455

chat_bubble_outline22

repeat4

shareShare

Bonnie Li

@bonniesjli

3 months ago

We can now train AI inside the mind of another AI. 🤯 🌍 Our world model, Genie 3, imagines and generates new worlds on the fly. 🤖 Our embodied agent, Sima, is dropped in and learns to navigate them autonomously. The entire loop—from the environment to the action—is generated

thumb_up_off_alt1,1K

chat_bubble_outline67

repeat138

shareShare

Ken Liu

@kenziyuliu

3 months ago

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions. Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:

thumb_up_off_alt362

chat_bubble_outline12

repeat72

shareShare

Haoyu Xiong

@haoyu_xiong_

a month ago

Wow I gave Gemini Robotics my card (unseen) and asked it to pick it up.

thumb_up_off_alt342

chat_bubble_outline14

repeat23

shareShare

Binghao Huang

@binghao_huang

25 days ago

How does high-fidelity tactile simulation help robots nail the last millimeter? We’re releasing VT-Refine, accepted to CoRL: a real-to-sim-to-real visuo-tactile policy using a GPU-parallel tactile sim for our piezoresistive skin FlexiTac. Then fine-tuning a diffusion policy with

thumb_up_off_alt183

chat_bubble_outline4

repeat29

shareShare

Haoyu Xiong

@haoyu_xiong_

22 days ago

Working with Homanga has been a wonderful experience. My first paper was published with him a few years ago. Don’t miss to apply to homanga’s lab if you’re interested in robot learning!

thumb_up_off_alt25

chat_bubble_outline0

repeat1

shareShare