Qian Huang (@qhwang3) 's Twitter Profile
Qian Huang

@qhwang3

@xai | CS PhD student @StanfordAILab

ID: 847263121865441281

linkhttps://q-hwang.github.io/ calendar_today30-03-2017 01:44:04

145 Tweet

8,8K Followers

307 Following

Yusuf Roohani (@yusufroohani) 's Twitter Profile Photo

Amid growing interest in closed-loop design for biological experiments, we demonstrate how LLM-powered agents can enhance both effectiveness and interpretability. We develop BioDiscoveryAgent to design genetic perturbation experiments 🧵1/6 Jian Vora Qian Huang Percy Liang Jure Leskovec

Qian Huang (@qhwang3) 's Twitter Profile Photo

I will present my paper “MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation” tomorrow at 1:30 pm local time! Also dm me to chat!

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Woah, another exciting update from Chatbot Arena❤️‍🔥 The results for @xAI’s sus-column-r (Grok 2 early version) are now public**! With over 12,000 community votes, sus-column-r has secured the #3 spot on the overall leaderboard, even matching GPT-4o! It excels in Coding (#2),

Woah, another exciting update from Chatbot Arena❤️‍🔥

The results for @xAI’s sus-column-r (Grok 2 early version) are now public**!

With over 12,000 community votes, sus-column-r has secured the #3 spot on the overall leaderboard, even matching GPT-4o! It excels in Coding (#2),
xAI (@xai) 's Twitter Profile Photo

We are excited to bring together a group of exceptional engineers and product builders who are intrigued by our mission to build maximally truth-seeking AI Join our open house to meet our team, learn more about xAI, and enjoy a fun evening brought you by the creators of the

We are excited to bring together a group of exceptional engineers and product builders who are intrigued by our mission to build maximally truth-seeking AI

Join our open house to meet our team, learn more about xAI, and enjoy a fun evening brought you by the creators of the
Yuhuai (Tony) Wu (@yuhu_ai_) 's Twitter Profile Photo

Three components of Reasoning for AI: 1. Foundation (Pre-training) 2. Self-improvement (RL) 3. Test-time compute (planning). xAI will soon have the best foundation in the world - Grok3. Join us to advance reasoning to the next-level! 🔥🔥 grnh.se/ddabc23e7us

Ethan Knight (@__eknight__) 's Twitter Profile Photo

Earlier today, we released a new model, code-named Aurora, that gives Grok the ability to generate extremely photorealistic images (and in the future, even edit them). It's free to use for all of 𝕏, try it out and send us what you're creating! This model was trained entirely

Earlier today, we released a new model, code-named Aurora, that gives Grok the ability to generate extremely photorealistic images (and in the future, even edit them). It's free to use for all of 𝕏, try it out and send us what you're creating!

This model was trained entirely
Cristiano Giardina (@crisgiardina) 's Twitter Profile Photo

This kinda blew my mind. Grok 3 having both DeepSearch and Think is so powerful: I prompted DeepSearch to create a p5.js clone of Flappy Bird, finding sprites and assets online. Then switched on Think in the same chat to have it implement it. One shot. Wild.

Qian Huang (@qhwang3) 's Twitter Profile Photo

It’s been quite an unbelievable ride since I paused my PhD at Stanford and joined xAI almost a year ago. The journey (building the tool use/agent stack from scratch to demoing DeepSearch as a little research project to converting it into a product launched to millions of people)