Xihui Liu (@xihuiliu) Twitter Tweets • TwiCopy

Prithvijit

8 months ago

World Models have gained significant momentum in the research community over the past few years. However, we still lack systematic approaches for evaluating them properly for downstream applications and making informed design decisions. We're organizing the WorldModelBench

thumb_up_off_alt5

chat_bubble_outline0

repeat5

shareShare

Xihui Liu

@xihuiliu

8 months ago

Check out our new work introducing generation chain-of-thought (GoT) reasoning for visual generation and editing. Code and data released! Arxiv: arxiv.org/abs/2503.10639 Code and data: github.com/rongyaofang/GoT Huggingface: huggingface.co/papers/2503.10…

thumb_up_off_alt114

chat_bubble_outline1

repeat28

shareShare

Xihui Liu

@xihuiliu

8 months ago

Excited to introduce our new work TokenBridge. We hope to inspire a new pathway to autoregressive visual generation by bridging discrete and continuous tokens. arxiv.org/abs/2503.16430 github.com/yuqingwang1029… yuqingwang1029.github.io/TokenBridge/ huggingface.co/papers/2503.16…

thumb_up_off_alt52

chat_bubble_outline2

repeat12

shareShare

Xihui Liu

@xihuiliu

8 months ago

Thank AK for sharing our work!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Xihui Liu

@xihuiliu

7 months ago

My Ph.D. student, Mr. Tianwei Xiong, will introduce our long-take video dataset LVD-2M (NeurIPS 2024) in this webinar. silentview.github.io/LVD-2M/ arxiv.org/pdf/2410.10816

thumb_up_off_alt44

chat_bubble_outline2

repeat3

shareShare

Xihui Liu

@xihuiliu

7 months ago

Check out our latest work SEED-Bench-R1!

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Xihui Liu

@xihuiliu

7 months ago

Why are discrete visual tokenizers difficult to scale? In GigaTok, we study the key factors for scaling tokenizers, and scale VQ tokenizers to 3B for better reconstruction, AR generation, and representation. Code and models released huggingface.co/papers/2504.08… silentview.github.io/GigaTok/

thumb_up_off_alt75

chat_bubble_outline1

repeat10

shareShare

Xihui Liu

@xihuiliu

7 months ago

Arrived at Singapore! Tengyao will present our work SJD in #ICLR tomorrow! Welcome to chat with us at Poster #161 tomorrow morning 10:00-12:30.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Xintao Wang

@xinntao

6 months ago

(2/2) We release a survey of Interactive Generative Video (IGV). Paper: arxiv.org/abs/2504.21853 2. Framework of IGV, consisting of five main components: Generation, Control, Memory, Dynamics and Intelligence. Research work from Kling AI.

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Xihui Liu

@xihuiliu

6 months ago

Excited to release the survey of interactive generative videos!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Xihui Liu

@xihuiliu

6 months ago

Introducing GoT-R1, which enhances visual generation by using reinforcement learning to improve semantic-spatial chain-of-thought reasoning. Code available! Arxiv: arxiv.org/abs/2505.17022 Code: github.com/gogoduan/GoT-R1 huggingface.co/papers/2505.17…

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Xihui Liu

@xihuiliu

6 months ago

Thank TwelveLabs for organizing and inviting us! My Ph.D. student, Ms. Yi Chen, will share our recent research on benchmarking and improving the reasoning abilities of multimodal LLMs. May 30, 2025 Fri 9:30-10:30 AM PST Registration: mailchi.mp/twelvelabs/mul…

thumb_up_off_alt5

chat_bubble_outline1

repeat3

shareShare

Hermann

@kumbonghermann

5 months ago

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week. VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from

thumb_up_off_alt49

chat_bubble_outline1

repeat21

shareShare

Prithvijit

@prithvijitch

5 months ago

The WorldModelBench workshop is happening tomorrow (June 12th) at #CVPR2025! We have an exciting series of talks, do attend! Place: Room 108 Time: Morning Session #NVIDIAResearch

thumb_up_off_alt19

chat_bubble_outline1

repeat9

shareShare

Xihui Liu

@xihuiliu

5 months ago

I am at #CVPR2025 this week looking forward to chat: Parallelized Autoregressive Visual Generation (Highlight yuqingwang1029.github.io/PAR-project/)，Sat 10:30 am #220 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation (t2v-compbench-2025.github.io) Fri 4:00 pm #290

thumb_up_off_alt40

chat_bubble_outline0

repeat6

shareShare

Xihui Liu

@xihuiliu

5 months ago

#CVPR2025 We will present our CVPR Highlight Parallelized Autoregressive Visual Generation (arxiv.org/abs/2412.15119) at ExHall D #220 this morning. Also welcome to chat about our recent work TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

thumb_up_off_alt20

chat_bubble_outline0

repeat2

shareShare

Yukun Huang

@yukun6414

5 months ago

🎉Our paper DreamCube is accepted to #ICCV2025 ! Thank AK for sharing our work! Project page: yukun-huang.github.io/DreamCube/ Code: github.com/yukun-huang/Dr… Model: huggingface.co/KevinHuang/Dre… Video: youtube.com/watch?v=7x4Elc… Special thanks to my co-authors: Xihui Liu Kaiyi HUANG

thumb_up_off_alt51

chat_bubble_outline2

repeat9

shareShare