Zhuo Xu (@drzhuoxu) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

“A picture is worth a thousand words”, can VLMs also read robot actions better in images than in words? We introduce PIVOT to explore this idea and enable a VLM to zero-shot “find a place to sit down and do writing” by navigating a robot to the room with the light on :)

thumb_up_off_alt48

chat_bubble_outline1

repeat6

shareShare

Zhuo Xu

@drzhuoxu

a year ago

Our interesting findings from exploring the sampling based planning in the era of large VLMs — pivot-prompt.github.io

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jeff Dean

@jeffdean

a year ago

Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pro model. One of the key differentiators of this model is its incredibly long

thumb_up_off_alt6,6K

chat_bubble_outline188

repeat1,1K

shareShare

Cheng Chi

@chichengcc

a year ago

Can we collect robot data without any robots? Introducing Universal Manipulation Interface (UMI) An open-source $400 system from Stanford University designed to democratize robot data collection 0 teleop -> autonomously wash dishes (precise), toss (dynamic), and fold clothes (bimanual)

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat352

shareShare

Toru

@toruo_o

a year ago

Achieving bimanual dexterity with RL + Sim2Real! toruowo.github.io/bimanual-twist/ TLDR - We train two robot hands to twist bottle lids using deep RL followed by sim-to-real. A single policy trained with simple simulated bottles can generalize to drastically different real-world objects.

thumb_up_off_alt217

chat_bubble_outline4

repeat57

shareShare

Tony Z. Zhao

@tonyzzhao

a year ago

Introducing 𝐀𝐋𝐎𝐇𝐀 𝐔𝐧𝐥𝐞𝐚𝐬𝐡𝐞𝐝 🌋 - Pushing the boundaries of dexterity with low-cost robots and AI. Google DeepMind Finally got to share some videos after a few months. Robots are fully autonomous filmed in one continuous shot. Enjoy!

thumb_up_off_alt1,1K

chat_bubble_outline56

repeat333

shareShare

Ayzaan Wahid

@ayzwah

a year ago

For the past year we've been working on ALOHA Unleashed 🌋 @GoogleDeepmind - pushing the scale and dexterity of tasks on our ALOHA 2 fleet. Here is a thread with some of the coolest videos! The first task is hanging a shirt on a hanger (autonomous 1x)

thumb_up_off_alt537

chat_bubble_outline31

repeat110

shareShare

Lucas Beyer (bl16)

@giffmana

a year ago

✨PaliGemma report will hit arxiv tonight. We tried hard to make it interesting, and not "here model. sota results. kthxbye." So here's some of the many interesting ablations we did, check the paper tomorrow for more! 🧶

thumb_up_off_alt843

chat_bubble_outline19

repeat109

shareShare

Zipeng Fu

@zipengfu

a year ago

Introduce Mobility VLA - Google's foundation model for navigation - started as my intern project: - Gemini 1.5 Pro for high-level image & text understanding - topological graphs for low-level navigation - supports multimodal instructions co-lead Zhuo Xu, Lewis Chiang, Jie Tan

thumb_up_off_alt175

chat_bubble_outline3

repeat27

shareShare

Zhuo Xu

@drzhuoxu

a year ago

Today's long context, multimodal models are very good at solving long horizon robotics tasks -- such as navigation.

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

Exciting News from Chatbot Arena!

<a href="/GoogleDeepMind/">Google DeepMind</a>'s new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

thumb_up_off_alt1,1K

chat_bubble_outline83

repeat410

shareShare

Demis Hassabis

@demishassabis

a year ago

Never seen a competitive leaderboard that I didn't like 😀 Congrats to the Gemini team on ranking no.1 🏆 with our latest improved Gemini 1.5 Pro developer preview model, which you can try on AI studio now!

thumb_up_off_alt967

chat_bubble_outline32

repeat103

shareShare

Zhuo Xu

@drzhuoxu

a year ago

Congrats Pannag Sanketi and team! Had lots of fun playing with the robot!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Zhuo Xu

@drzhuoxu

a year ago

Thank you Demis!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Robotics Papers

@oww

9 months ago

Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors. arxiv.org/abs/2411.03669

thumb_up_off_alt1

chat_bubble_outline0

repeat2

shareShare

Jason Ma

@jasonma2020

9 months ago

Excited to finally share Generative Value Learning (GVL), my Google DeepMind project on extracting universal value functions from long-context VLMs via in-context learning! We discovered a simple method to generate zero-shot and few-shot values for 300+ robot tasks and 50+

thumb_up_off_alt598

chat_bubble_outline9

repeat116

shareShare

Tesla Optimus

@tesla_optimus

8 months ago

Got a new hand for Black Friday

thumb_up_off_alt36,36K

chat_bubble_outline3,3K

repeat5,5K

shareShare

Zhuo Xu

@drzhuoxu

8 months ago

Amazing hardware! The teleop is even more amazing!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zhuo Xu

@drzhuoxu

7 months ago

Excited to announce the What Bimanuals Can Do (WBCD) competition at ICRA 2025! We carefully designed challenging and commercially valuable tasks, will provide 15 state of the art bimanual robots and $200k total robot/cash awards! Visit the website to learn more and register ASAP!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

FrodoBots

@frodobots

7 months ago

Announcing the 2nd Earth Rover Challenge: an "AI vs Gamers" global navigation competition (to be held #ICRA2025 in May in Atlanta) Co-organized with researchers from Deepmind, Meta & academia A thread 🧵 - 1/n

thumb_up_off_alt57

chat_bubble_outline5

repeat12

shareShare

Zhuo Xu

Gate.io

Wenhao Yu

Zhuo Xu

Jeff Dean

Cheng Chi

Toru

Tony Z. Zhao

Ayzaan Wahid

Lucas Beyer (bl16)

Zipeng Fu

Zhuo Xu

lmarena.ai (formerly lmsys.org)

Demis Hassabis

Zhuo Xu

Zhuo Xu

Robotics Papers

Jason Ma

Tesla Optimus

Zhuo Xu

Zhuo Xu

FrodoBots