Hao Zhang (@haozhangml) 's Twitter Profile
Hao Zhang

@haozhangml

Asst. Prof. @HDSIUCSD and @ucsd_cse running @haoailab. Cofounder and runs @lmsysorg. 20% with @SnowflakeDB

ID: 1415378240772976650

linkhttps://cse.ucsd.edu/~haozhang calendar_today14-07-2021 18:30:49

598 Tweet

4,4K Followers

441 Following

Hao Zhang (@haozhangml) 's Twitter Profile Photo

Super cool work by my buddies at snowflake! 😀 On gap in the speculative decoding (SD) community is that the reported performance in papers often lags behind what observed in a real-world production system, due to new complexity in real serving. This is perhaps one of the

Hao Zhang (@haozhangml) 's Twitter Profile Photo

PD disaggregation shines at all scales, but its true power is fully unleashed only at relative large scale, conditioned on *very meticulous* engineering. OSS folks have seen glimpses of this, but LMSYS Org just showed what it really looks like: a 96-GPU PD setup that nearly

Diyi Yang (@diyi_yang) 's Twitter Profile Photo

🚀 Introducing CAVA: The Comprehensive Assessment for Voice Assistants A new benchmark for evaluating end-to-end, speech-in-speech-out voice assistants in real-world scenarios. We go beyond single tasks or metrics to test the capabilities required for voice assistants:

🚀 Introducing CAVA: The Comprehensive Assessment for Voice Assistants

A new benchmark for evaluating end-to-end, speech-in-speech-out voice assistants in real-world scenarios.

We go beyond single tasks or metrics to test the capabilities required for voice assistants:
PyTorch (@pytorch) 's Twitter Profile Photo

PyTorch Foundation has expanded into an umbrella foundation. vLLM and DeepSpeed have been accepted as hosted projects, advancing community-driven AI across the full lifecycle. Supporting quotes provided by the following members: AMD, Arm, Amazon Web Services, Google, Huawei,

PyTorch Foundation has expanded into an umbrella foundation. <a href="/vllm_project/">vLLM</a> and <a href="/DeepSpeedAI/">DeepSpeed</a> have been accepted as hosted projects, advancing community-driven AI across the full lifecycle.

Supporting quotes provided by the following members: <a href="/AMD/">AMD</a>, <a href="/Arm/">Arm</a>, <a href="/AWS/">Amazon Web Services</a>, <a href="/Google/">Google</a>, <a href="/Huawei/">Huawei</a>,
Hao Zhang (@haozhangml) 's Twitter Profile Photo

Was casually chatting with a few buddies at snow the other day and realized that Snowflake might just have the best text2sql team and capabilities on the planet NOW? 😎😀🔥 ✅ #1 on BIRD (single-model, an extremely competitive benchmark) — with our own post-trained

Hao Zhang (@haozhangml) 's Twitter Profile Photo

FastVideo v1 is here! 🎬 Our FastVideo team have been working hard and cooking up something new ☕️☕️: a unified, programmable API for video generation that simplifies model authoring and integrates various DiT-related optimizations. We hope to make video generation as seamless

Tianqi Chen (@tqchenml) 's Twitter Profile Photo

#MLSys2025 make sure to attend 10:30am keynote Ion Stoica An AI stack: from scaling AI workloads to evaluating LLMs. Checkout full schedule at mlsys.org/virtual/2025/c…

#MLSys2025 make sure to attend 10:30am keynote <a href="/istoica05/">Ion Stoica</a>  An AI stack: from scaling AI workloads to evaluating LLMs. Checkout full schedule at mlsys.org/virtual/2025/c…
MBZUAI (@mbzuai) 's Twitter Profile Photo

An exceptional morning at #IFMLaunch! From @EricXing's vision for world models to Yejin Choi 's insights on "bending scaling laws," we're exploring how collaboration between industry and academia will shape AI's future. More exciting sessions coming this afternoon with

An exceptional morning at #IFMLaunch! From @EricXing's vision for world models to <a href="/YejinChoinka/">Yejin Choi</a> 's insights on "bending scaling laws," we're exploring how collaboration between industry and academia will shape AI's future. More exciting sessions coming this afternoon with
Snowflake (@snowflakedb) 's Twitter Profile Photo

Solving real enterprise AI pain points! Our AI Research just shared two impactful new open-source efforts: ➡️ Arctic-Text2SQL-R1: Generate reliable SQL from natural language that truly executes on complex enterprise schemas. It’s trained for actual execution correctness using

Solving real enterprise AI pain points! Our AI Research just shared two impactful new open-source efforts:

➡️ Arctic-Text2SQL-R1: Generate reliable SQL from natural language that truly executes on complex enterprise schemas. It’s trained for actual execution correctness using
Perry Zhang (@py_z001) 's Twitter Profile Photo

I will be giving a talk in GPU MODE tomorrow (May 31 12pm PST) about FastVideo/STA/VSA. Come if you're interested! youtube.com/watch?v=x44iGp…

I will be giving a talk in <a href="/GPU_MODE/">GPU MODE</a> tomorrow (May 31 12pm PST) about FastVideo/STA/VSA. 
Come if you're interested!

youtube.com/watch?v=x44iGp…
Hao Zhang (@haozhangml) 's Twitter Profile Photo

Wondering if the latest open-weight Qwen3 and Deepseek-R1-0528 performs on games? Check this thread out. Also, stay tuned for a new release of our game benchmark soon...🧑‍🍳👩‍🍳👨‍🍳