Shang Zhu (@shangzhu18) 's Twitter Profile
Shang Zhu

@shangzhu18

AI Researcher at Together AI @togethercompute | alumni of @UMich @CMUEngineering and Xi'an Jiao-Tong University, China

Opinions are my own.

ID: 1017435298035355649

calendar_today12-07-2018 15:47:32

75 Tweet

190 Followers

292 Following

Together AI (@togethercompute) 's Twitter Profile Photo

2/ Why does this matter? MoAA makes high-quality AI more accessible & scalable—no need for massive proprietary models! 📝Check out our blog post: together.ai/blog/moaa 🔗 Full details in our paper: arxiv.org/abs/2505.03059 By Junlin Wang Roy Xie Shang Zhu Jue WANG

James Zou (@james_y_zou) 's Twitter Profile Photo

Our new #icml2025 paper w/Together AI shows how to use synthetic data from Mixture-of-Agents to boost LM fine-tuning + RL. Turns out a mixture of small agents is much more effective/cheaper than using a large LM as teacher 🌐together.ai/blog/moaa 📜arxiv.org/abs/2505.03059

Our new #icml2025 paper w/<a href="/togethercompute/">Together AI</a> shows how to use synthetic data from Mixture-of-Agents to boost LM fine-tuning + RL.

Turns out a mixture of small agents is much more effective/cheaper than using a large LM as teacher
🌐together.ai/blog/moaa
📜arxiv.org/abs/2505.03059
Shang Zhu (@shangzhu18) 's Twitter Profile Photo

Please check out our freshly accepted #ICML2025 paper on LLM post-training research using Mixture-of-Agents. We released our SFT data and fine-tuned models for the community to try out!

Together AI (@togethercompute) 's Twitter Profile Photo

1/ We built an open-source AI agent that can reason like a data scientist. It loads data, writes Python code, retrains when models fail, and solves real Kaggle + DABStep tasks. Here’s how we did it (and how you can too): 👇

Shang Zhu (@shangzhu18) 's Twitter Profile Photo

We built a data science agent from scratch and made it accessible to everyone (pip install & one command-line call). Open-source codebase: github.com/togethercomput… We look forward to your feedback! With Federico Bianchi Zain Ben Athiwaratkun James Zou

Zain (@zainhasan6) 's Twitter Profile Photo

Check out our fully open source data science agent! It performs complex tasks like hyper-parameter optimization, runs experiments and can even complete kaggle competitions!

James Zou (@james_y_zou) 's Twitter Profile Photo

Excited to introduce Open Data Scientist: ✅outperforms Gemini data science agent ✅solves real Kaggle tasks ✅fully open source, easy to adapt ✅sandbox for safe exec Step-by-step tutorial on building our agent together.ai/blog/building-… Great job Federico Bianchi Shang Zhu

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Shrinking the Generation-Verification Gap with Weak Verifiers "we introduce Weaver, a framework for designing a strong verifier by combining multiple weak, imperfect verifiers." "Weaver leverages weak supervision to estimate each verifier’s accuracy and combines their outputs

Shrinking the Generation-Verification Gap with Weak Verifiers

"we introduce Weaver, a framework for designing a strong verifier by combining multiple weak, imperfect verifiers."

"Weaver leverages weak supervision to estimate each verifier’s accuracy and combines their outputs
Zach Xu (@nehzux) 's Twitter Profile Photo

LLMs are getting more powerful, but they still struggle with super long documents. A common trick is "Divide and Conquer" - chop it up, process chunks, and combine. But... when does this actually work? And when does it fail catastrophically? We investigated. 🧵

Jon Saad-Falcon (@jonsaadfalcon) 's Twitter Profile Photo

How can we close the generation-verification gap when LLMs produce correct answers but fail to select them? 🧵 Introducing Weaver: a framework that combines multiple weak verifiers (reward models + LM judges) to achieve o3-mini-level accuracy with much cheaper non-reasoning

How can we close the generation-verification gap when LLMs produce correct answers but fail to select them? 
🧵 Introducing Weaver: a framework that combines multiple weak verifiers (reward models + LM judges) to achieve o3-mini-level accuracy with much cheaper non-reasoning
Jon Saad-Falcon (@jonsaadfalcon) 's Twitter Profile Photo

📝 Paper: arxiv.org/abs/2506.18203 ✍️ Blog: hazyresearch.stanford.edu/blog/2025-06-1… 🔧 Code: github.com/HazyResearch/s… 🤗 Datasets and Models: huggingface.co/collections/ha… Joint work with Kelly Buchanan, Mayee Chen, Tzu-Heng Huang, Brendan McLaughlin, Tanvir Bhathal, Shang Zhu, Ben Athiwaratkun, Fred Sala,

Mayee Chen (@mayeechen) 's Twitter Profile Photo

LLMs often generate correct answers but struggle to select them. Weaver tackles this by combining many weak verifiers (reward models, LM judges) into a stronger signal using statistical tools from Weak Supervision—matching o3-mini-level accuracy with much cheaper models! 📊

LLMs often generate correct answers but struggle to select them. Weaver tackles this by combining many weak verifiers (reward models, LM judges) into a stronger signal using statistical tools from Weak Supervision—matching o3-mini-level accuracy with much cheaper models! 📊
Together AI (@togethercompute) 's Twitter Profile Photo

Introducing the Open Deep Research app! Generate detailed reports on any topic with open source LLMs. Free & fully open source. We’re releasing everything: evaluation dataset, code, app, and blog.🔥

Hassan (@nutlope) 's Twitter Profile Photo

Announcing Open Deep Research! Generate detailed reports on any topic with open source LLMs. 100% free and open source. opendeepresearch​.dev

Azalia Mirhoseini (@azaliamirh) 's Twitter Profile Photo

Introducing Weaver, a test time scaling method for verification! Weaver shrinks the generation-verification gap through a low-overhead weak-to-strong optimization of a mixture of verifiers (e.g., LM judges and reward models). The Weavered mixture can be distilled into a tiny

Introducing Weaver, a test time scaling method for verification! 

Weaver shrinks the generation-verification gap through a low-overhead weak-to-strong  optimization of a mixture of verifiers (e.g., LM judges and reward models). The Weavered mixture can be distilled into a tiny