Victor Zhong (@hllo_wrld) 's Twitter Profile
Victor Zhong

@hllo_wrld

ML+NLP AP @UWCheritonCS, @cifar_news AIChair @vectorinst. Former @MSFTResearch @MetaAI, @SFResearch via @MetamindIO, @uwnlp, @StanfordNLP, @eceuoft.

ID: 257287707

linkhttp://victorzhong.com calendar_today25-02-2011 03:19:57

1,1K Tweet

4,4K Followers

481 Following

Taco Cohen (@tacocohen) 's Twitter Profile Photo

Imagine inventing attention and getting the runner up test of time award 😂 🥈 Congrats to all first and second place winners!

Ximing Lu (@gximing) 's Twitter Profile Photo

With the rise of R1, search seems out of fashion? We prove the opposite! 😎 Introducing Retro-Search 🌈: an MCTS-inspired search algorithm that RETROspectively revises R1’s reasoning traces to synthesize untaken, new reasoning paths that are better 💡, yet shorter in length ⚡️.

With the rise of R1, search seems out of fashion? We prove the opposite! 😎

Introducing Retro-Search 🌈: an MCTS-inspired search algorithm that RETROspectively revises R1’s reasoning traces to synthesize untaken, new reasoning paths that are better 💡, yet shorter in length ⚡️.
XLANG NLP Lab (@xlangnlp) 's Twitter Profile Photo

🚀 Exciting news! OpenAI's o3 & o4-mini, the most capable reasoning models, are now live on Computer Agent Arena! Test, vote, and explore their full potential with CUAs at arena.xlang.ai! Join the community and dive in!

🚀 Exciting news! <a href="/OpenAI/">OpenAI</a>'s o3 &amp; o4-mini, the most capable reasoning models, are now live on Computer Agent Arena!
Test, vote, and explore their full potential with CUAs at arena.xlang.ai! Join the community and dive in!
Ion Stoica (@istoica05) 's Twitter Profile Photo

This journey has been a blast, and I'm very much looking forward to an exciting future, driven by our incredible community.

Victor Zhong (@hllo_wrld) 's Twitter Profile Photo

Some emergency PhD/MS admissions: if you are a recent ML/NLP applicant having second thoughts about grad school in the US or recently had your funding pulled, please consider reaching out at r2llab.com/openings.

Tao Yu (@taoyds) 's Twitter Profile Photo

🤔Static CUA benchmarks enable fast model dev but lack task variety and risk overfitting. Computer Agent Arena tests crowdsourced real-world tasks. OSWorld: 🥇UI-Tars1.5🥈Operator🥉Claude 3.7 CUA Arena: 🥇Claude 3.7🥈Operator🥉UI-Tars1.5 🚀Rankings likely to evolve quickly

🤔Static CUA benchmarks enable fast model dev but lack task variety and risk overfitting. 

Computer Agent Arena tests crowdsourced real-world tasks.

OSWorld: 🥇UI-Tars1.5🥈Operator🥉Claude 3.7
CUA Arena: 🥇Claude 3.7🥈Operator🥉UI-Tars1.5

🚀Rankings likely to evolve quickly
Victor Zhong (@hllo_wrld) 's Twitter Profile Photo

Fantastic and very inspirational talk by NVIDIA ‘s Bill Dally on the personal history of GPUs at the CCC Computing Futures Symposium

Michael Solodko (@saladcoinc) 's Twitter Profile Photo

Honored to have received the Vector Scholarship in AI and thank you to my supervisor, Victor Zhong (Victor Zhong), for the nomination and support!

XLANG NLP Lab (@xlangnlp) 's Twitter Profile Photo

🔥New Computer Agent Arena Leaderboard Updates (2k+ user votes)! 🤔Which VLMs act better as computer use agents (CUAs)? 1, Claude Sonnet 4 🥇 2, Claude 3.7 Sonnet 🥈 3, UI-TARS-1.5 🥉 4, Operator More insights in the thread 👇 arena.xlang.ai

🔥New Computer Agent Arena Leaderboard Updates (2k+ user votes)!
🤔Which VLMs act better as computer use agents (CUAs)?

1, Claude Sonnet 4 🥇
2, Claude 3.7 Sonnet 🥈
3, UI-TARS-1.5 🥉
4, Operator

More insights in the thread 👇
arena.xlang.ai
Allen Nie (🇺🇦☮️) (@allen_a_nie) 's Twitter Profile Photo

Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇

Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇
Yu Xiang (@yuxiang_irvl) 's Twitter Profile Photo

“As a PHD student, your job is not publishing a paper every quarter. Focus on a problem in deep understanding and solve it in years under the protect of your adviser” from Russ Tedrake #RSS2025

“As a PHD student, your job is not publishing a paper every quarter. Focus on a problem in deep understanding and solve it in years under the protect of your adviser” from <a href="/RussTedrake/">Russ Tedrake</a> #RSS2025
Christopher Manning (@chrmanning) 's Twitter Profile Photo

I’ve joined AIX Ventures as a General Partner, working on investing in deep AI startups. Looking forward to working with founders on solving hard problems in AI and seeing products come out of that!  Thank you Yuliya Chernova at The Wall Street Journal for covering the news: wsj.com/articles/ai-re…

Dan Roy (@roydanroy) 's Twitter Profile Photo

Would love help identifying amazing ML researchers with strong connections to Canada who are currently outside Canada (thus potentially targets for recruitment as US situation deteriorates). DMs please. Retweet please.

Victor Zhong (@hllo_wrld) 's Twitter Profile Photo

I’m planning a year 1 retrospective on life as a new assistant professor to help those considering academia. What would you like me to cover? Teaching, research, work-life balance, service roles, lab building…? #AcademicTwitter

Victor Zhong (@hllo_wrld) 's Twitter Profile Photo

I will be giving a talk at the ICML Conference Workshop on Computer Use Agents from July 18-19. Please let me know if you'd like to chat! If you are a prospective student, the R2L Lab will be looking for 2 grad students + 1 postdoc (Dec ddl for 2026). Details @ r2llab.com/openings.