Xihui Liu (@xihuiliu) 's Twitter Profile
Xihui Liu

@xihuiliu

Assistant Professor @ HKU. Previous Postdoc @ UC Berkeley and PhD @ CUHK MMLab

ID: 3329738772

linkhttps://xh-liu.github.io/ calendar_today24-08-2015 20:11:40

41 Tweet

1,1K Followers

198 Following

Prithvijit (@prithvijitch) 's Twitter Profile Photo

World Models have gained significant momentum in the research community over the past few years. However, we still lack systematic approaches for evaluating them properly for downstream applications and making informed design decisions. We're organizing the WorldModelBench

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Check out our new work introducing generation chain-of-thought (GoT) reasoning for visual generation and editing. Code and data released! Arxiv: arxiv.org/abs/2503.10639 Code and data: github.com/rongyaofang/GoT Huggingface: huggingface.co/papers/2503.10…

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Excited to introduce our new work TokenBridge. We hope to inspire a new pathway to autoregressive visual generation by bridging discrete and continuous tokens. arxiv.org/abs/2503.16430 github.com/yuqingwang1029… yuqingwang1029.github.io/TokenBridge/ huggingface.co/papers/2503.16…

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

My Ph.D. student, Mr. Tianwei Xiong, will introduce our long-take video dataset LVD-2M (NeurIPS 2024) in this webinar. silentview.github.io/LVD-2M/ arxiv.org/pdf/2410.10816

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Why are discrete visual tokenizers difficult to scale? In GigaTok, we study the key factors for scaling tokenizers, and scale VQ tokenizers to 3B for better reconstruction, AR generation, and representation. Code and models released huggingface.co/papers/2504.08… silentview.github.io/GigaTok/

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Arrived at Singapore! Tengyao will present our work SJD in #ICLR tomorrow! Welcome to chat with us at Poster #161 tomorrow morning 10:00-12:30.

Xintao Wang (@xinntao) 's Twitter Profile Photo

(2/2) We release a survey of Interactive Generative Video (IGV). Paper: arxiv.org/abs/2504.21853 2. Framework of IGV, consisting of five main components: Generation, Control, Memory, Dynamics and Intelligence. Research work from Kling AI.

(2/2) We release a survey of Interactive Generative Video (IGV).
Paper: arxiv.org/abs/2504.21853
2. Framework of IGV, consisting of five main components: Generation, Control, Memory, Dynamics and Intelligence.

Research work from <a href="/Kling_ai/">Kling AI</a>.
Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Introducing GoT-R1, which enhances visual generation by using reinforcement learning to improve semantic-spatial chain-of-thought reasoning. Code available! Arxiv: arxiv.org/abs/2505.17022 Code: github.com/gogoduan/GoT-R1 huggingface.co/papers/2505.17…

Xihui Liu (@xihuiliu) 's Twitter Profile Photo

Thank TwelveLabs for organizing and inviting us! My Ph.D. student, Ms. Yi Chen, will share our recent research on benchmarking and improving the reasoning abilities of multimodal LLMs. May 30, 2025 Fri 9:30-10:30 AM PST Registration: mailchi.mp/twelvelabs/mul…

Hermann (@kumbonghermann) 's Twitter Profile Photo

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week. VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week.

VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from
Prithvijit (@prithvijitch) 's Twitter Profile Photo

The WorldModelBench workshop is happening tomorrow (June 12th) at #CVPR2025! We have an exciting series of talks, do attend! Place: Room 108 Time: Morning Session #NVIDIAResearch

The WorldModelBench workshop is happening tomorrow (June 12th) at #CVPR2025! We have an exciting series of talks, do attend!

Place: Room 108
Time: Morning Session

#NVIDIAResearch
Xihui Liu (@xihuiliu) 's Twitter Profile Photo

I am at #CVPR2025 this week looking forward to chat: Parallelized Autoregressive Visual Generation (Highlight yuqingwang1029.github.io/PAR-project/),Sat 10:30 am #220 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation (t2v-compbench-2025.github.io) Fri 4:00 pm #290

I am at #CVPR2025 this week looking forward to chat: Parallelized Autoregressive Visual Generation (Highlight yuqingwang1029.github.io/PAR-project/),Sat 10:30 am #220
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation (t2v-compbench-2025.github.io) Fri 4:00 pm #290
Xihui Liu (@xihuiliu) 's Twitter Profile Photo

#CVPR2025 We will present our CVPR Highlight Parallelized Autoregressive Visual Generation (arxiv.org/abs/2412.15119) at ExHall D #220 this morning. Also welcome to chat about our recent work TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

#CVPR2025 We will present our CVPR Highlight Parallelized Autoregressive Visual Generation (arxiv.org/abs/2412.15119) at ExHall D #220 this morning. Also welcome to chat about our recent work TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation