Yiming Dou (@_yimingdou) Twitter Tweets • TwiCopy

Yiming Dou

@_yimingdou

+ Follow

Ph.D. student at UMich | B.Eng. from SJTU | Previous intern at Stanford | Computer Vision, Multimodal, Robotics

ID: 1507890461398253575

linkhttps://dou-yiming.github.io/ calendar_today27-03-2022 01:21:26

64 Tweet

695 Followers

867 Following

Zichen Wang

@zichen2501

a year ago

Differentiable rendering made SIMPLE❗️ Differentiating physically based renderers is hard: Dirac-delta discontinuities arise at object silhouette. Our #SIGGRAPHAsia2024 work shows how a simple relaxation can rescue the day, enabling easy 3D reconstruction and relighting! (1/N)

thumb_up_off_alt347

chat_bubble_outline5

repeat55

shareShare

Ayush Shrivastava

@ayshrv

a year ago

We present Global Matching Random Walks, a simple self-supervised approach to the Tracking Any Point (TAP) problem, accepted to #ECCV2024. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks (CRW).

thumb_up_off_alt85

chat_bubble_outline1

repeat23

shareShare

Junyi Zhang

@junyi42

a year ago

Excited to share MonST3R! -- a simple way to estimate geometry from unposed video of dynamic scene We achieve competitive results on several downstreams (video depth, camera pose) and believe this is a promising step toward feed-forward 4D reconstruction monst3r-project.github.io

thumb_up_off_alt735

chat_bubble_outline22

repeat143

shareShare

Daniel Geng

@dangengdg

9 months ago

What happens when you train a video generation model to be conditioned on motion? Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!

thumb_up_off_alt673

chat_bubble_outline20

repeat147

shareShare

Yuanchen_Ju

@ju_yuanchen

9 months ago

🍌We present DenseMatcher！ 🤖️DenseMatcher enables robots to acquire generalizable skills across diverse object categories by only seeing one demo, by finding correspondences between 3D objects even with different types, shapes, and appearances.

thumb_up_off_alt116

chat_bubble_outline9

repeat29

shareShare

Sarah Jabbour

@sarahjabbour_

8 months ago

I’m on the PhD internship market for Spr/Summer 2025! I have experience in multimodal AI (EHR, X-ray, text), explainability for image models w/ genAI, clinician-AI interaction (surveyed 700+ doctors), and tabular foundation models. Please reach out if you think there’s a fit!

thumb_up_off_alt66

chat_bubble_outline1

repeat12

shareShare

Yiming Dou

@_yimingdou

5 months ago

Thanks to OpenAI, got a chance to grow up again in Ghibli anime🤗

Thanks to <a href="/OpenAI/">OpenAI</a>, got a chance to grow up again in Ghibli anime🤗

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Yuanchen_Ju

@ju_yuanchen

5 months ago

🧩#CVPR2025🌷Introducing Two By Two✌️: The First Large-Scale Daily Pairwise Assembly Dataset with SE(3)-Equivariant Pose Estimation. 🤖2BY2 helps robots master daily 3D assembly tasks—like plugging sockets or arranging flowers—across diverse objects! 🐨Co-lead by Yu Qi

thumb_up_off_alt89

chat_bubble_outline2

repeat22

shareShare

Chris Rockwell

@_crockwell

5 months ago

Ever wish YouTube had 3D labels? 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose! Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯 Download: huggingface.co/datasets/nvidi…

thumb_up_off_alt177

chat_bubble_outline2

repeat39

shareShare

Daniel Geng

@dangengdg

3 months ago

Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Friday at 9am: I'll be presenting "Motion Prompting" Ryan Burgert will be presenting "Go with the Flow" and Pascal CHANG will be presenting "LookingGlass"

thumb_up_off_alt64

chat_bubble_outline3

repeat16

shareShare

Jeongsoo Park

@jespark0

3 months ago

Can AI image detectors keep up with new fakes? Mostly, no. Existing detectors are trained using a handful of models. But there are thousands in the wild! Our work, Community Forensics, uses 4800+ generators to train detectors that generalize to new fakes. #CVPR2025 🧵 (1/5)

thumb_up_off_alt23

chat_bubble_outline1

repeat9

shareShare

Ayush Shrivastava

@ayshrv

3 months ago

Excited to share our CVPR 2025 paper on cross-modal space-time correspondence! We present a method to match pixels across different modalities (RGB-Depth, RGB-Thermal, Photo-Sketch, and cross-style images) — trained entirely using unpaired data and self-supervision. Our

thumb_up_off_alt120

chat_bubble_outline1

repeat28

shareShare

Linyi Jin

@jin_linyi

3 months ago

Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday: Zhengqi Li will be presenting "MegaSaM" I'll be presenting "Stereo4D" and Qianqian Wang will be presenting "CUT3R"

thumb_up_off_alt36

chat_bubble_outline1

repeat6

shareShare

Paul Liang

@pliang279

3 months ago

Despite much progress in AI, the ability for AI to 'smell' like humans remains elusive. Smell AIs 🤖👃can be used for allergen sensing (e.g., peanuts or gluten in food), hormone detection for health, safety & environmental monitoring, quality control in manufacturing, and more.

thumb_up_off_alt130

chat_bubble_outline7

repeat17

shareShare