Jiajun Wu (@jiajunwu_cs) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🔥 Introducing MVLift: Generate realistic 3D motion without any 3D training data - just using 2D poses from monocular videos! Applicable to human motion, human-object interaction & animal motion. Joint work w/ Jiajun Wu & Karen 💡 How? We reformulate 3D motion estimation as

thumb_up_off_alt215

chat_bubble_outline3

repeat39

shareShare

Hong-Xing "Koven" Yu

@koven_yu

7 months ago

🤩Forget MoCap -- Let’s generate human interaction motions with *Real-world 3D scenes*!🏃🏞️ Introducing ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation. No training, No MoCap data! 🧵1/5 Web: awfuact.github.io/zerohsi/

thumb_up_off_alt271

chat_bubble_outline12

repeat60

shareShare

Joy Hsu

@joycjhsu

6 months ago

Excited to bring back the 2nd Workshop on Visual Concepts at #CVPR2025 2025, this time with a call for papers! We welcome submissions on the following topics. See our website for more info: sites.google.com/stanford.edu/w… Join us & a fantastic lineup of speakers in Tennessee!

Excited to bring back the 2nd Workshop on Visual Concepts at <a href="/CVPR/">#CVPR2025</a> 2025, this time with a call for papers!

We welcome submissions on the following topics. See our website for more info:
sites.google.com/stanford.edu/w…

Join us & a fantastic lineup of speakers in Tennessee!

thumb_up_off_alt135

chat_bubble_outline1

repeat23

shareShare

Gim Hee Lee

@gimhee_lee

5 months ago

🚀 3 more days for #3DV2025 early registration! Don’t miss out - register now! We’re featuring: ✨ Keynotes: Fei-Fei Li, Jon Barron, Noah Snavely 🗣️ Panels: Embodied AI & Large Models 📜 Accepted Papers: Oral talks & posters 🌟 Nectar Track: Spotlight talks & posters International Conference on 3D Vision

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Gordon Wetzstein

@gordonwetzstein

4 months ago

State-of-the-art zero-shot customized image generation by Shengqu Cai, Eric Chan, Yunzhi Zhang, Leo Guibas, Jiajun Wu !

thumb_up_off_alt87

chat_bubble_outline1

repeat10

shareShare

Yunfan Jiang

@yunfanjiang

4 months ago

🤖 Ever wondered what robots need to truly help humans around the house? 🏡 Introducing 𝗕𝗘𝗛𝗔𝗩𝗜𝗢𝗥 𝗥𝗼𝗯𝗼𝘁 𝗦𝘂𝗶𝘁𝗲 (𝗕𝗥𝗦)—a comprehensive framework for mastering mobile whole-body manipulation across diverse household tasks! 🧹🫧 From taking out the trash to

thumb_up_off_alt416

chat_bubble_outline17

repeat146

shareShare

International Conference on 3D Vision

@3dvconf

4 months ago

#3DV2025 is happening in 10 days in Singapore, but we can't wait to give some spoilers for the award!! 8 papers were selected as award candidates, congrats 🥳. The final awards will be announced during the main conference. 3dvconf.github.io/2025/awards/

thumb_up_off_alt46

chat_bubble_outline2

repeat5

shareShare

Kyle Sargent

@kylesargentai

4 months ago

Modern generative models of images and videos rely on tokenizers. Can we build a state-of-the-art discrete image tokenizer with a diffusion autoencoder? Yes! I’m excited to share FlowMo, with Kyle Hsu, Justin Johnson, Fei-Fei Li, Jiajun Wu. A thread 🧵:

thumb_up_off_alt598

chat_bubble_outline12

repeat176

shareShare

Fan-Yun Sun

@sunfanyun

4 months ago

Spatial reasoning is a major challenge for the foundation models today, even in simple tasks like arranging objects in 3D space. #CVPR2025 Introducing LayoutVLM, a differentiable optimization framework that uses VLM to spatially reason about diverse scene layouts from unlabeled

thumb_up_off_alt239

chat_bubble_outline4

repeat58

shareShare

Daniel Yamins

@dyamins

4 months ago

New paper on self-supervised optical flow and occlusion estimation from video foundation models. Stefan Stojanov Jiajun Wu Seungwoo (Simon) Kim Rahul Venkatesh tinyurl.com/dpa3auzd @

thumb_up_off_alt111

chat_bubble_outline3

repeat17

shareShare

Hong-Xing "Koven" Yu

@koven_yu

3 months ago

🤩 FluidNexus has been selected as CVPR'25 *Oral* paper 🎺! See you at Nashville!

thumb_up_off_alt45

chat_bubble_outline0

repeat3

shareShare

Manling Li

@manlingli_

3 months ago

Introducing T* and LV-Haystack -- targeting needle-in-the-haystack for long videos! 🤗 LV-Haystack annotated 400+ hours of videos and 15,000+ samples. 🧩 Lightweight plugin for any proprietary and open-source VLMs: T* boosting LLaVA-OV-72B [56→62%] and GPT-4o [50→53%] within

thumb_up_off_alt89

chat_bubble_outline4

repeat17

shareShare

Hong-Xing "Koven" Yu

@koven_yu

3 months ago

🔥Spatial intelligence requires world generation, and now we have the first comprehensive evaluation benchmark📏 for it! Introducing WorldScore: Unifying evaluation for 3D, 4D, and video models on world generation! 🧵1/7 Web: haoyi-duan.github.io/WorldScore/ arxiv: arxiv.org/abs/2504.00983

thumb_up_off_alt244

chat_bubble_outline6

repeat116

shareShare

Stanford AI Lab

@stanfordailab

3 months ago

Stanford AI Lab (SAIL) is excited to announce new SAIL Postdoctoral Fellowships! We are looking for outstanding candidates excited to advance the frontiers of AI with our professors and vibrant community. Applications received by the end of April 30 will receive full

thumb_up_off_alt201

chat_bubble_outline6

repeat89

shareShare

Joy Hsu

@joycjhsu

3 months ago

We'll be presenting Deep Schema Grounding at ICLR 2026 🇸🇬 on Thursday (session 1 #98). Come chat about abstract visual concepts, structured decomposition, & what makes a maze a maze! & test your models on our challenging Visual Abstractions Benchmark: stanford.edu/~joycj/project…

thumb_up_off_alt33

chat_bubble_outline1

repeat2

shareShare

Wenlong Huang

@wenlong_huang

2 months ago

How to scale visual affordance learning that is fine-grained, task-conditioned, works in-the-wild, in dynamic envs? Introducing Unsupervised Affordance Distillation (UAD): distills affordances from off-the-shelf foundation models, *all without manual labels*. Very excited this

thumb_up_off_alt433

chat_bubble_outline8

repeat102

shareShare

Ndea

@ndea

2 months ago

Neuro-symbolic concepts (object, action, relation) represented by a hybrid of neural nets & symbolic programs. Composable, grounded, and typed, agents recombine them to solve tasks like robotic manipulation. J. Tenenbaum Jiayuan Mao Jiajun Wu Massachusetts Institute of Technology (MIT) arxiv.org/abs/2505.06191

thumb_up_off_alt57

chat_bubble_outline1

repeat8

shareShare

Guangzhao (Alex) He

@alexhe00880585

a month ago

💫 Animating 4D objects is complex: traditional methods rely on handcrafted, category-specific rigging representations. 💡 What if we could learn unified, category-agnostic, and scalable 4D motion representations — from raw, unlabeled data? 🚀 Introducing CANOR at #CVPR2025: a

thumb_up_off_alt97

chat_bubble_outline2

repeat20

shareShare

Zhao Dong

@flycooler_zd

a month ago

🚀 Excited to announce our CVPR 2025 Workshop: 3D Digital Twin: Progress, Challenges, and Future Directions 🗓 June 12, 2025 · 9:00 AM–5:00 PM 📢 Incredible lineup: Richard Newcombe, Andrea Vedaldi Visual Geometry Group (VGG),Hao (Richard) Zhang,Qianqian Wang,Dr. Xiaoshuai Zhang Hillbot,

thumb_up_off_alt53

chat_bubble_outline2

repeat21

shareShare

Yunzhi Zhang

@zhang_yunzhi

a month ago

(1/n) Time to unify your favorite visual generative models, VLMs, and simulators for controllable visual generation—Introducing a Product of Experts (PoE) framework for inference-time knowledge composition from heterogeneous models.

thumb_up_off_alt296

chat_bubble_outline4

repeat61

shareShare

Jiajun Wu

Gate.io

Jiaman Li

Hong-Xing "Koven" Yu

Joy Hsu

Gim Hee Lee

Gordon Wetzstein

Yunfan Jiang

International Conference on 3D Vision

Kyle Sargent

Fan-Yun Sun

Daniel Yamins

Hong-Xing "Koven" Yu

Manling Li

Hong-Xing "Koven" Yu

Stanford AI Lab

Joy Hsu

Wenlong Huang

Ndea

Guangzhao (Alex) He

Zhao Dong

Yunzhi Zhang