Saurabh Saxena (@srbhsxn) Twitter Tweets • TwiCopy

Ben Poole

2 years ago

ReconFusion = 3D Reconstruction + Diffusion prior for novel view synthesis reconfusion.github.io Better NeRFs, less data.

thumb_up_off_alt108

chat_bubble_outline5

repeat12

shareShare

Looking for diffusion model advancements at #NeurIPS2023? Come to check our oral work "Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation" w/ Durk Kingma. New theoretical understanding, SOTA empirical results, and more! Arxiv: arxiv.org/abs/2303.00848

thumb_up_off_alt306

chat_bubble_outline6

repeat30

shareShare

AK

@_akhaliq

2 years ago

NeRFiller: Completing Scenes via Generative 3D Inpainting paper page: huggingface.co/papers/2312.04… propose NeRFiller, an approach that completes missing portions of a 3D capture via generative 3D inpainting using off-the-shelf 2D visual generative models. Often parts of a captured

thumb_up_off_alt154

chat_bubble_outline3

repeat30

shareShare

Ethan Weber

@ethanjohnweber

2 years ago

AK Thanks for sharing! Here is the project page ethanweber.me/nerfiller/. 😃

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Saurabh Saxena

@srbhsxn

2 years ago

Excited to share that our work was accepted for an oral presentation at #NeurIPS2023. If you are interested in diffusion models or computer vision, please drop by our talk and poster on Thursday! nips.cc/virtual/2023/o…

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Ethan Weber

@ethanjohnweber

2 years ago

Excited to release NeRFiller with my amazing collaborators Aleksander Holynski, Varun Jampani, Saurabh Saxena, Noah Snavely, Abhishek Kar, and Angjoo Kanazawa! The project page is available at ethanweber.me/nerfiller/. We focus on scene completion by using a 2D inpainter.

thumb_up_off_alt81

chat_bubble_outline5

repeat13

shareShare

AI Bites | YouTube Channel

@ai_bites

2 years ago

DMD (Diffusion for Metric Depth) is a state-of-the-art diffusion model for monocular absolute depth estimation. Innovations include: 👉use of log-scale depth parameterization to enable joint modeling of indoor and outdoor scenes, 👉conditioning on the field-of-view (FOV) to

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Alex Carlier

@alexcarliera

2 years ago

Google just revealed an ABSOLUTE depth estimation model 🤯 As opposed to recent depth models (Marigold, PatchFusion) which aim for maximum details, DMD aims to estimate the ABSOLUTE depth (in meters) within the image More details below ⬇️⬇️

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat265

shareShare

Shek Azizi

@azizishekoofeh

2 years ago

Hiring Research Scientists within Google DeepMind - Toronto to join our team & advance the next generation of medical AI, develop cutting-edge LLMs & Multi-modal models to tackle real-world healthcare challenges. Please submit your interest through: forms.gle/2cSbBotUwSfVfu…

thumb_up_off_alt257

chat_bubble_outline3

repeat50

shareShare

Daniel Watson

@watson_nn

a year ago

[[THREAD]] Happy to announce 4DiM, our diffusion model for novel view synthesis of scenes! 4DiM allows camera+time control with as few as one input image. Joint work with Saurabh Saxena* Lala Li* Andrea Tagliasacchi 🇨🇦 David Fleet *equal contribution

thumb_up_off_alt57

chat_bubble_outline2

repeat17

shareShare

Sander Dieleman

@sedielem

a year ago

Diffusion is the rising tide that eventually submerges all frequencies, high and low 🌊 Diffusion is the gradual decomposition into feature scales, fine and coarse 🗼 Diffusion is just spectral autoregression 🤷🌈

thumb_up_off_alt1,1K

chat_bubble_outline34

repeat162

shareShare

Jeff Dean

@jeffdean

a year ago

My Google colleague and longtime UC Berkeley faculty member David Patterson has a great essay out in this month's Communications of the ACM (Association for Computing Machinery):🎉 "Life Lessons from the First Half-Century of My Career Sharing 16 life lessons, and nine magic words." I saw an

My <a href="/Google/">Google</a> colleague and longtime <a href="/UCBerkeley/">UC Berkeley</a> faculty member David Patterson has a great essay out in this month's Communications of the ACM (<a href="/TheOfficialACM/">Association for Computing Machinery</a>):🎉

"Life Lessons from the First Half-Century of My Career
Sharing 16 life lessons, and nine magic words."

I saw an

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat336

shareShare

Michael Tschannen

@mtschannen

a year ago

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)? We have been pondering this during summer and developed a new model: JetFormer 🌊🤖 arxiv.org/abs/2411.19722 A thread 👇 1/

thumb_up_off_alt834

chat_bubble_outline15

repeat142

shareShare

Saurabh Saxena

@srbhsxn

a year ago

SfM failing on dynamic videos? 😠 RoMo to the rescue! 💪 Our simple method uses epipolar cues and semantic features for robustly estimating motion masks, boosting dynamic SfM performance 🚀 Plus, a new dataset of dynamic scenes with ground truth cameras! 🤯 #computervision 🧵👇

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Saumya Saxena

@saxena_saumya

a year ago

Can 3D scene graphs act as effective online memory for solving EQA tasks in⚡️real-time? Presenting GraphEQA🤖, a framework for grounding Vision Language Models using multimodal memory for real-time embodied question answering.

thumb_up_off_alt146

chat_bubble_outline4

repeat30

shareShare

Andrea Tagliasacchi 🇨🇦

@taiyasaki

a year ago

📢📢📢 Consider applying to SFU, wher we have one of the largest Graphics/Vision groups in the world. Still in time! Deadline is Jan 18, 2025 The language of Vancouver is English, and despite being in 🇨🇦... it is not that cold (similar to Paris, Berlin, London). Links in 🧵

thumb_up_off_alt86

chat_bubble_outline3

repeat11

shareShare

Ricardo Martin-Brualla

@rmbrualla

9 months ago

Excited about 3D GenAI? There’s something super exciting brewing… If you know of researchers, or 3D / ML / infra engineers, there are positions open in Munich and London. Reach out!

thumb_up_off_alt109

chat_bubble_outline4

repeat14

shareShare

Saurabh Saxena

@srbhsxn

9 months ago

Our team in Google DeepMind Toronto is hiring a Student Researcher for Summer 2025 to work on projects in Video generative models and 3D Computer Vision. If you are interested please apply at: forms.gle/Yj1jmbvjBFQCzC…

thumb_up_off_alt100

chat_bubble_outline0

repeat10

shareShare

Saurabh Saxena

Ben Poole

Ruiqi Gao

AK

Ethan Weber

Saurabh Saxena

Ethan Weber

AI Bites | YouTube Channel

Alex Carlier

Shek Azizi

Daniel Watson

Sander Dieleman

Jeff Dean

Michael Tschannen

Saurabh Saxena

Saumya Saxena

Andrea Tagliasacchi 🇨🇦

Ricardo Martin-Brualla

Saurabh Saxena