Huaxiu Yao✈️ICLR 2025🇸🇬 (@huaxiuyaoml) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Can we prompt robots, just like we prompt language models? With hierarchy of VLA models + LLM-generated data, robots can: - reason through long-horizon tasks - respond to variety of prompts - handle situated corrections Blog post & paper: pi.website/research/hirob…

thumb_up_off_alt463

chat_bubble_outline9

repeat67

shareShare

Mohit Bansal

@mohitban47

5 months ago

Thank you AAAI for the honor & the fun ceremonies -- humbled to be inducted as a AAAI Fellow in esteemed company 🙏 PS. I am still around today in Philadelphia if anyone wants to meet up at #AAAI2025 :-) Thanks once again to everyone (students+postdocs+collaborators,

Thank you <a href="/RealAAAI/">AAAI</a> for the honor & the fun ceremonies -- humbled to be inducted as a AAAI Fellow in esteemed company 🙏
PS. I am still around today in Philadelphia if anyone wants to meet up at #AAAI2025 :-)

Thanks once again to everyone (students+postdocs+collaborators,

thumb_up_off_alt296

chat_bubble_outline26

repeat30

shareShare

Prithvijit

@prithvijitch

5 months ago

Join us at the WorldModelBench workshop at #CVPR2025 where we'll tackle systematic evaluation of World Models! Focus: benchmarks, metrics, downstream tasks, and safety. Submit papers now: worldmodelbench.github.io

thumb_up_off_alt33

chat_bubble_outline1

repeat14

shareShare

No Priors

@nopriorspod

4 months ago

Are we at a turning point in robotics? New interview with Chelsea Finn founder of Physical Intelligence: we talk about pi’s approach to robotics foundation models, generalization, data generation, humanoids, and comparisons to self-driving. links 👇

Are we at a turning point in robotics?

New interview with <a href="/chelseabfinn/">Chelsea Finn</a> founder of <a href="/physical_int/">Physical Intelligence</a>: we talk about pi’s approach to robotics foundation models, generalization, data generation, humanoids, and comparisons to self-driving. links 👇

thumb_up_off_alt84

chat_bubble_outline4

repeat12

shareShare

Peng (Richard) Xia ✈️ ICLR 2025

@richardxp888

4 months ago

🚀 Introducing MDocAgent! 🧐📄 📚 Ever struggled with AI that can’t handle complex documents filled with text, images, tables, and figures? 💡 Enter MDocAgent 🧠🤖—a next-gen multi-modal multi-agent framework that revolutionizes document understanding! #AI #DocQA #LLM #Agent

thumb_up_off_alt385

chat_bubble_outline7

repeat100

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

4 months ago

🚀 Launched MDocAgent—a multi-agent framework that crushes complex documents (text, images, tables!) with specialized AI agents. Outperforms SOTA by 12.1%! See details in Peng (Richard) Xia's thread.

thumb_up_off_alt19

chat_bubble_outline0

repeat7

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

3 months ago

I’ll be at RIKEN Center for Advanced Intelligence Project 🇯🇵 (Apr 19–21) and #ICLR2025 🇸🇬 (Apr 22–28), presenting our latest ICLR work and organizing the FM in the wild workshop. If you’d like to chat about (1) VLM / VLA / AI agents or (2) PhD openings in my lab, DM or email me—let’s connect! My students

I’ll be at <a href="/RIKEN_AIP_EN/">RIKEN Center for Advanced Intelligence Project</a> 🇯🇵 (Apr 19–21) and #ICLR2025 🇸🇬 (Apr 22–28), presenting our latest ICLR work and organizing the FM in the wild workshop.

If you’d like to chat about (1) VLM / VLA / AI agents or (2) PhD openings in my lab, DM or email me—let’s connect!

My students

thumb_up_off_alt50

chat_bubble_outline2

repeat5

shareShare

Sergey Levine

@svlevine

3 months ago

π-0.5 is here, and it can generalize to new homes! Some fun experiments with my colleagues at Physical Intelligence, introducing π-0.5 (“pi oh five”). Our new VLA can put dishes in the sink, clean up spills and do all this in homes that it was not trained in🧵👇

thumb_up_off_alt547

chat_bubble_outline10

repeat65

shareShare

LLM360

@llm360

3 months ago

The MBZUAI IFM and the LLM360 team's first day at ICLR 2026, come to visit our new Institute of Foundation Models! Booth D04 in Hall 2! We’re looking forward to meeting researchers and engineers to introduce them to MBZUAI .

The MBZUAI IFM and the LLM360 team's first day at <a href="/iclr_conf/">ICLR 2026</a>, come to visit our new Institute of Foundation Models! Booth D04 in Hall 2!

We’re looking forward to meeting researchers and engineers to introduce them to <a href="/mbzuai/">MBZUAI</a> .

thumb_up_off_alt27

chat_bubble_outline0

repeat9

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

3 months ago

👇A great paper list collected by my student Jiatu about robot foundation models

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Chelsea Finn

@chelseabfinn

3 months ago

I'm giving two talks today/Sunday at #ICLR2025 ! - Post-Training Robot Foundation Models (Robot Learning Workshop @ 12:50 pm) - Robot Foundation Models with Open-Ended Generalization (Foundation Models in the Wild @ 2:30 pm) Will cover π-0, Demo-SCORE, Hi Robot, & π-0.5.

thumb_up_off_alt246

chat_bubble_outline8

repeat10

shareShare

Qwen

@alibaba_qwen

3 months ago

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general

thumb_up_off_alt7,7K

chat_bubble_outline316

repeat1,1K

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

3 months ago

How is it that powerful VLMs in 2025 still fail on such a simple case—when 'Australia' is merely hidden? 🤔

thumb_up_off_alt6

chat_bubble_outline2

repeat0

shareShare

Fan Nie

@fannie1208

3 months ago

🚀 Excited to share that “#FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees” has been accepted to #ICML25 ! 🎉 📄 Paper: arxiv.org/abs/2411.02603 💻 Code: github.com/fannie1208/Fac… 1/N

thumb_up_off_alt144

chat_bubble_outline7

repeat24

shareShare

Joel Jang

@jang_yoel

2 months ago

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

thumb_up_off_alt326

chat_bubble_outline7

repeat65

shareShare

Yiyang Zhou

@aiyiyangz

2 months ago

🔥 ReAgent-V Released! 🔥 A unified video framework with reflection and reward-driven optimization. ✨ Real-time self-correction. ✨ Triple-view reflection. ✨ Auto-selects high-reward samples for training.

thumb_up_off_alt43

chat_bubble_outline1

repeat17

shareShare

Peng (Richard) Xia ✈️ ICLR 2025

@richardxp888

a month ago

🚑 Introducing MedAgentRL – a dynamic, RL-powered multi-agent framework for medical multimodal reasoning! 🤝🧠 Tired of AI models that fumble across specialties? MedAgentRL boosts performance by 20% over SFT, delivering smarter, collaborative diagnosis. 🏥 #MedAI #LLM #Agent #RL

thumb_up_off_alt88

chat_bubble_outline3

repeat20

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

a month ago

🎓 Meet EduVisAgent: the next-gen AI teaching assistant for STEM education. We're shifting from static answers to dynamic, visual-guided reasoning. 📊 We introduce EduVisBench, the first benchmark for visual teaching quality with 1,154 STEM questions, and EduVisAgent, a

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare