Michael Niemeyer (@mi_niemeyer) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Announcing Diffusion Forcing Transformer (DFoT), our new video diffusion algorithm that generates ultra-long videos of 800+ frames. DFoT enables History Guidance, a simple add-on to any existing video diffusion models for a quality boost. Website: boyuan.space/history-guidan… (1/7)

thumb_up_off_alt537

chat_bubble_outline6

repeat86

shareShare

Jon Barron

@jon_barron

5 months ago

I just pushed a new paper to arXiv. I realized that a lot of my previous work on robust losses and nerf-y things was dancing around something simpler: a slight tweak to the classic Box-Cox power transform that makes it much more useful and stable. It's this f(x, λ) here:

thumb_up_off_alt2,2K

chat_bubble_outline39

repeat262

shareShare

Angjoo Kanazawa

@akanazawa

5 months ago

Exciting news! MegaSAM code is out🔥 & the updated Shape of Motion results with MegaSAM are really impressive! A year ago I didn't think we could make any progress on these videos: shape-of-motion.github.io/results.html Huge congrats to everyone involved and the community 🎉

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat171

shareShare

Inception Labs

@inceptionailabs

5 months ago

We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.

thumb_up_off_alt5,5K

chat_bubble_outline225

repeat996

shareShare

Luma AI

@lumalabsai

4 months ago

Today, we release Inductive Moment Matching (IMM): a new pre-training paradigm breaking the algorithmic ceiling of diffusion models. Higher sample quality. 10x more efficient. Single-stage, single network, stable training. Read more: lumalabs.ai/news/imm

thumb_up_off_alt2,2K

chat_bubble_outline54

repeat224

shareShare

Matthias Niessner

@mattniessner

4 months ago

📢📢Want to build 𝟑𝐃 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬? 📢📢 ➡️We're looking for Diffusion/3D/ML/Infra engineers and scientists in Munich & London. Get in touch for details! #3D #GenAI #spatialintelligence #foundationmodels

thumb_up_off_alt395

chat_bubble_outline14

repeat43

shareShare

Jensen (Jinghao) Zhou

@jensenzhoujh

4 months ago

Hi there, 🎉 We are thrilled to introduce Stable Virtual Camera, a generalist diffusion model designed to address the exciting challenge of Novel View Synthesis (NVS). With just one or a few images, it allows you to create a smooth trajectory video from any viewpoint you desire.

thumb_up_off_alt206

chat_bubble_outline1

repeat28

shareShare

Michael Niemeyer

@mi_niemeyer

4 months ago

Make sure to check out Stan's new work on fast 3D scene generation! Very impressive results.

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Philipp Henzler

@philipphenzler

4 months ago

From image(s) to 3D scenes in SECONDS! Bolt3D ⚡️ uses a latent diffusion transformer to generate both image and geometry latents from which we can directly decode 3D Gaussians - no optimization needed.

thumb_up_off_alt33

chat_bubble_outline1

repeat2

shareShare

MrNeRF

@janusch_patas

4 months ago

SplatVoxel: History-Aware Novel View Streaming without Temporal Training Contributions: • We propose a hybrid Splat-Voxel feed-forward reconstruction framework that leverages historical information to enable novel view streaming, without relying on multi-view video datasets for

thumb_up_off_alt160

chat_bubble_outline4

repeat21

shareShare

Michael Niemeyer

@mi_niemeyer

4 months ago

On my way back from 3DV in Singapore. What a blast! Thanks to all the organizers of this year's International Conference on 3D Vision as well as all the speakers and presenters, I had such a fantastic time!

On my way back from 3DV in Singapore. What a blast! Thanks to all the organizers of this year's <a href="/3DVconf/">International Conference on 3D Vision</a> as well as all the speakers and presenters, I had such a fantastic time!

thumb_up_off_alt25

chat_bubble_outline0

repeat0

shareShare

Shubham Tulsiani

@shubhtuls

3 months ago

Excited to share this dataset with registered aerial and ground images with dense geometry and correspondence supervision. Please see Khiem’s thread for some cool applications this enables!

thumb_up_off_alt130

chat_bubble_outline0

repeat9

shareShare

Sherwin Bahmani

@sherwinbahmani

3 months ago

📢Excited to be at #ICLR2025 for our paper: VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Poster: Thu 3-5:30 PM (#134) Website: snap-research.github.io/vd3d/ Code: github.com/snap-research/… Also check out our #CVPR2025 follow-up AC3D: snap-research.github.io/ac3d/

thumb_up_off_alt109

chat_bubble_outline7

repeat12

shareShare

Songyou Peng

@songyoupeng

3 months ago

📢 Unposed few-view 3D reconstruction has never been so easy, and SOTA pose estimation as a byproduct! Check out our #ICLR2025 ORAL paper (top 1.8%): NoPoSplat! Catch the amazing Botao Ye at: Oral: Thu 4:18 pm Poster: Thu 10 am (#204) Website: noposplat.github.io

thumb_up_off_alt155

chat_bubble_outline1

repeat15

shareShare

Google DeepMind

@googledeepmind

2 months ago

Video, meet audio. 🎥🤝🔊 With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make. Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline267

repeat1,1K

shareShare

SpAItial AI

@spaitial_ai

2 months ago

🚀🚀🚀Announcing our $13M funding round to build the next generation of AI: 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 that can generate entire 3D environments anchored in space & time. 🚀🚀🚀 Interested? Join our world-class team: 🌍 spaitial.ai #GenAI #3DAI

thumb_up_off_alt736

chat_bubble_outline38

repeat113

shareShare

Michael Niemeyer

@mi_niemeyer

2 months ago

Rendering large-scale scenes even on mobile! Make sure to check out the internship project LODGE of the rising star in computer vision, Jonas. It was such a blast having you with us! 🎉

thumb_up_off_alt23

chat_bubble_outline0

repeat0

shareShare

MrNeRF

@janusch_patas

2 months ago

Is Google taking initial steps to enhance Street View? For some reason, Street View seems stuck in technology that feels outdated. I wonder if we'll see such improvements on the product side. Also, note how much better it performs in all aspects compared to Zip-NeRF in their

thumb_up_off_alt294

chat_bubble_outline20

repeat25

shareShare

Ben Mildenhall

@benmildenhall

a month ago

At World Labs, we built a new Gaussian splatting web renderer with all the bells and whistles we needed to make splats a first-class citizen of the incredible Three.js ecosystem. Today, we're open sourcing Forge under the MIT license.

thumb_up_off_alt908

chat_bubble_outline23

repeat116

shareShare

Haofei Xu

@haofeixu

a month ago

Excited to present our #CVPR2025 paper DepthSplat next week! DepthSplat is a feed-forward model that achieves high-quality Gaussian reconstruction and view synthesis in just 0.6 seconds. Looking forward to great conversations at the conference!

thumb_up_off_alt122

chat_bubble_outline1

repeat19

shareShare

Michael Niemeyer

Gate.io

Boyuan Chen

Jon Barron

Angjoo Kanazawa

Inception Labs

Luma AI

Matthias Niessner

Jensen (Jinghao) Zhou

Michael Niemeyer

Philipp Henzler

MrNeRF

Michael Niemeyer

Shubham Tulsiani

Sherwin Bahmani

Songyou Peng

Google DeepMind

SpAItial AI

Michael Niemeyer

MrNeRF

Ben Mildenhall

Haofei Xu