Noah Snavely (@jimantha) Twitter Tweets • TwiCopy

Gemmechu Hassena

a year ago

Excited to share our work, ObjectCarver! Given multiview images and click points on one image, ObjectCarver decomposes scenes into separate objects, providing high-quality 3D surfaces while handling occlusion and close-contact objects. (1/6) website: objectcarver.github.io

thumb_up_off_alt61

chat_bubble_outline5

repeat12

shareShare

Noah Snavely

@jimantha

a year ago

I'm a big fan of work on visual discovery, and this work on using diffusion models for data mining is really cool!

thumb_up_off_alt32

chat_bubble_outline1

repeat1

shareShare

Haian Jin

@haian_jin

a year ago

Novel view synthesis has long been a core challenge in 3D vision. But how much 3D inductive bias is truly needed? —Surprisingly, very little! Introducing "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"—a fully transformer-based approach that enables scalable,

thumb_up_off_alt581

chat_bubble_outline23

repeat94

shareShare

Dmytro Mishkin 🇺🇦

@ducha_aiki

a year ago

Extreme Rotation Estimation in the Wild Hana Bezalel, Dotan Ankri, Ruojin Cai Hadar Averbuch-Elor tl;dr: MegaDepth/Scenes subset with small/large/no overlap image pairs, the task is R prediction arxiv.org/abs/2411.07096

Extreme Rotation Estimation in the Wild
Hana Bezalel, Dotan Ankri, <a href="/ruojin8/">Ruojin Cai</a> <a href="/ElorHadar/">Hadar Averbuch-Elor</a>

tl;dr: MegaDepth/Scenes subset with small/large/no overlap image pairs, the task is R prediction
arxiv.org/abs/2411.07096

thumb_up_off_alt37

chat_bubble_outline1

repeat8

shareShare

Gene Chou

@gene_ch0u

a year ago

We've released our paper "Generating 3D-Consistent Videos from Unposed Internet Photos"! Video models like Luma generate pretty videos, but sometimes struggle with 3D consistency. We can do better by scaling them with 3D-aware objectives. 1/N page: genechou.com/kfcw

thumb_up_off_alt227

chat_bubble_outline6

repeat46

shareShare

International Conference on 3D Vision

@3dvconf

a year ago

#3DV2025AMA Third guest on the Ask Me Anything series: Noah Snavely Noah Snavely from Cornell & Google DeepMind! 🌟 🕒 You have now 24 HOURS to ask him anything — drop your questions in the comments below! We have also the AMA happening on Bluesky: bsky.app/profile/3dvcon…

thumb_up_off_alt17

chat_bubble_outline2

repeat6

shareShare

Zhengqi Li

@zhengqi_li

a year ago

Introducing MegaSaM! 🎥 Accurate, fast, & robust structure + camera estimation from casual monocular videos of dynamic scenes! MegaSaM outputs camera parameters and consistent video depth, scaling to long videos with unconstrained camera paths and complex scene dynamics!

thumb_up_off_alt490

chat_bubble_outline8

repeat90

shareShare

Yuanbo Xiangli

@ambie_kk

a year ago

Introducing Doppelgangers++! 🚀 An enhanced pairwise image classifier that tackles visual aliasing (doppelgangers) to improve 3D reconstruction accuracy across diverse, real-world scenes. 🌍✨ 🔗Project page: bit.ly/3VAPMJc. Code is also available.

thumb_up_off_alt107

chat_bubble_outline2

repeat21

shareShare

Linyi Jin

@jin_linyi

a year ago

Introducing 👀Stereo4D👀 A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories. We used Stereo4D to make a dataset of over 100k real-world 4D scenes.

thumb_up_off_alt524

chat_bubble_outline13

repeat102

shareShare

Haian Jin

@haian_jin

a year ago

I can’t attend the #NeurIPS conference this year, but Yuanbo Xiangli will present Neural Gaffer in person. Drop by our poster at West Ballroom A-D #7001 if you are interested! Time: Fri 13 Dec 4:30 p.m. — 7:30 p.m.

thumb_up_off_alt41

chat_bubble_outline0

repeat4

shareShare

Ruojin Cai

@ruojin8

a year ago

🤔Can Generative Video Models Help Pose Estimation? ✅Yes! We find that generative video models can hallucinate plausible intermediate frames that provide useful context for pose estimators (e.g. DUSt3R), especially for images with little to no overlap. 🔗 inter-pose.github.io

thumb_up_off_alt223

chat_bubble_outline2

repeat34

shareShare

Noah Snavely

@jimantha

9 months ago

These days, I only get around to trying every third or fourth social media site that comes along. The one I really like right now is Bluesky. It's a bit hard to describe, kind of like a new-fangled Usenet, but I highly recommend it! snavely.bsky.social

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Noah Snavely

@jimantha

8 months ago

Really cool work from Hadar Averbuch-Elor and co!

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Haian Jin

@haian_jin

7 months ago

Our paper LVSM has been accepted as an oral presentation at #ICLR2025! See you in Singapore! We’ve just released the code and checkpoints—check it out here: github.com/haian-jin/LVSM.🚀

thumb_up_off_alt127

chat_bubble_outline2

repeat19

shareShare

Boyang Deng

@boyang_deng

7 months ago

Curious about how cities have changed in the past decade? We use MLLMs to analyse 40 million Street View images to answer this. Do you know that "juice shops became a thing in NYC" and "miles of overpasses were painted BLUE in SF"? More at→boyangdeng.com/visual-chronic… (vid ↓ w/ 🔊)

thumb_up_off_alt88

chat_bubble_outline1

repeat15

shareShare

Linyi Jin

@jin_linyi

7 months ago

We have released the Stereo4D dataset! Explore the real-world dynamic 3D tracks: github.com/Stereo4d/stere…

thumb_up_off_alt227

chat_bubble_outline0

repeat38

shareShare

Rundong Luo

@luorundong0122

7 months ago

1/6 🔍➡️ How to transform standard videos into immersive 360° panoramas? We've designed a new AI system for video-to-360° panorama generation! Our key insight: large-scale data is crucial for robust panoramic synthesis across diverse scenes.

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

Shiry Ginosar

@shiryginosar

7 months ago

Think LMMs can reason like a 3-year-old? Think again! Our Kid-Inspired Visual Analogies benchmark reveals where young children still win: ey242.github.io/kiva.github.io/ Catch our #ICLR2025 poster today to see where models still fall short! Thurs. April 24 3-5:30 pm Halls 3 + 2B #312

thumb_up_off_alt15

chat_bubble_outline0

repeat5

shareShare

Haian Jin

@haian_jin

7 months ago

Excited to attend #ICLR2025 in person this year! I’ll be presenting two papers: 1. LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias 🔹 Oral Presentation: Session 3C (Garnet 216-218) — Apr 25 (Fri), 11:06–11:18 a.m. 🔹 Poster: Hall 3 + Hall 2B, Poster #593 — Apr

thumb_up_off_alt25

chat_bubble_outline1

repeat3

shareShare

Aleksander Holynski

@holynski_

5 months ago

MegaSaM got an award! Big congrats to the team!!!!! 🥳🥳🎉🎉 Zhengqi Li, Richard, Forrester Cole, Linyi Jin, Qianqian Wang, Vickie Angjoo Kanazawa, Noah Snavely

MegaSaM got an award! Big congrats to the team!!!!! 🥳🥳🎉🎉

<a href="/zhengqi_li/">Zhengqi Li</a>, Richard, <a href="/forrestercole2/">Forrester Cole</a>, <a href="/jin_linyi/">Linyi Jin</a>, <a href="/QianqianWang5/">Qianqian Wang</a>, Vickie <a href="/akanazawa/">Angjoo Kanazawa</a>, <a href="/Jimantha/">Noah Snavely</a>

thumb_up_off_alt132

chat_bubble_outline4

repeat6

shareShare