Saining Xie (@sainingxie) Twitter Tweets • TwiCopy

Saining Xie

@sainingxie

+ Follow

researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

ID: 1283081795890626560

linkhttp://www.sainingxie.com calendar_today14-07-2020 16:51:59

479 Tweet

20,20K Followers

1,1K Following

Willis (Nanye) Ma

@ma_nanye

5 months ago

Come and check out our paper, Inference-Time Scaling for Diffusion Models Beyond Denoising Steps, at Poster Session 1 at #CVPR2025, slot 226, happening right now!

thumb_up_off_alt72

chat_bubble_outline0

repeat8

shareShare

I had to cancel my trip to #CVPR2025 because I caught the stupid flu 🙃 So sad to miss everyone! But if you are in Nashville go check out Damiano's project tomorrow on improving 3D spatial reasoning from a single image with the power of LLMs 🌟 #CVPR2025 Damiano Marsili

thumb_up_off_alt62

chat_bubble_outline4

repeat7

shareShare

Andrei Bursuc

@abursuc

5 months ago

Visuals from slides of Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces #cvpr2025 #startikz

thumb_up_off_alt24

chat_bubble_outline0

repeat7

shareShare

Deedy

@deedydas

5 months ago

LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems. LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces. What's most interesting is the categories they perform really poorly on:

thumb_up_off_alt2,2K

chat_bubble_outline80

repeat219

shareShare

Deedy

@deedydas

5 months ago

I rarely see benchmark papers with this depth. It has links to problems, solutions and LLM attempts from the PDF directly. And it goes deep into each problem category. Great read. Live leaderboard: livecodebenchpro.com Paper: arxiv.org/pdf/2506.11928

thumb_up_off_alt238

chat_bubble_outline7

repeat15

shareShare

Rohan Paul

@rohanpaul_ai

5 months ago

This is really BAD news of LLM's coding skill. ☹️ The best Frontier LLM models achieve 0% on hard real-life Programming Contest problems, domains where expert humans still excel. LiveCodeBench Pro, a benchmark composed of problems from Codeforces, ICPC, and IOI (“International

thumb_up_off_alt1,1K

chat_bubble_outline100

repeat312

shareShare

Mathurin Massias

@mathusmassias

5 months ago

New paper on the generalization of Flow Matching arxiv.org/abs/2506.03719 🤯 Why does flow matching generalize? Did you know that the flow matching target you're trying to learn **can only generate training points**? with Quentin Bertrand, Anne Gagneux & Rémi Emonet 👇👇👇

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat202

shareShare

Benjamin Feuer

@feuerbenjamin

5 months ago

So excited to announce the DCVLR (Data Curation for Vision-Language Reasoning) competition at NeurIPS 2025, led by Oumi and sponsored by Lambda! 🌟open-data 🌟 🤖 open-models 🤖 💻 open-source 💻 💪anyone can compete for free 💪 dcvlr-neurips.github.io 🧵 1 / n

thumb_up_off_alt36

chat_bubble_outline1

repeat11

shareShare

Saining Xie

@sainingxie

5 months ago

wait, speaking of false dichotomies---during your phd, you *can* write code, dive into data and systems, collaborate with a team, and build useful things---all while enjoying complete openness and the freedom to pursue what *genuinely* excites you.

thumb_up_off_alt298

chat_bubble_outline11

repeat10

shareShare

Saining Xie

@sainingxie

5 months ago

guys, real geospatial data is a total goldmine for digital agents. step away from the web browser and get real. (we explored a bit in virl-platform.github.io, but building a simulation-ready pipeline like this could take things way further)

thumb_up_off_alt104

chat_bubble_outline4

repeat17

shareShare

Tal Linzen

@tallinzen

5 months ago

I'm hiring at least one post-doc! We're interested in creating language models that process language more like humans than mainstream LLMs do, through architectural modifications and interpretability-style steering.

thumb_up_off_alt275

chat_bubble_outline12

repeat49

shareShare

Andrej Karpathy

@karpathy

5 months ago

Do people *feel* how much work there is still to do. Like wow.

thumb_up_off_alt2,2K

chat_bubble_outline98

repeat68

shareShare

Saining Xie

@sainingxie

5 months ago

metaquery is now open-source — with both the data and code available.

thumb_up_off_alt54

chat_bubble_outline2

repeat7

shareShare

Saining Xie

@sainingxie

5 months ago

awesome work by Jiacheng Chen and Sanghyun Woo on 3D-grounded visual compositing (and nice demos!)

thumb_up_off_alt55

chat_bubble_outline4

repeat9

shareShare

Manling Li

@manlingli_

5 months ago

Can VLMs build Spatial Mental Models like humans? Reasoning from limited views? Reasoning from partial observations? Reasoning about unseen objects behind furniture / beyond current view? Check out MindCube! 🌐mll-lab-nu.github.io/mind-cube/ 📰arxiv.org/pdf/2506.21458

thumb_up_off_alt280

chat_bubble_outline5

repeat56

shareShare

Saining Xie

Willis (Nanye) Ma

Georgia Gkioxari

Andrei Bursuc

Deedy

Deedy

Rohan Paul

Mathurin Massias

Benjamin Feuer

Saining Xie

Saining Xie

Tal Linzen

Andrej Karpathy

Saining Xie

Saining Xie

Manling Li