Shang Zhu (@shangzhu18) Twitter Tweets • TwiCopy

Together AI

6 months ago

2/ Why does this matter? MoAA makes high-quality AI more accessible & scalable—no need for massive proprietary models! 📝Check out our blog post: together.ai/blog/moaa 🔗 Full details in our paper: arxiv.org/abs/2505.03059 By Junlin Wang Roy Xie Shang Zhu Jue WANG

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

James Zou

@james_y_zou

6 months ago

Our new #icml2025 paper w/Together AI shows how to use synthetic data from Mixture-of-Agents to boost LM fine-tuning + RL. Turns out a mixture of small agents is much more effective/cheaper than using a large LM as teacher 🌐together.ai/blog/moaa 📜arxiv.org/abs/2505.03059

Our new #icml2025 paper w/<a href="/togethercompute/">Together AI</a> shows how to use synthetic data from Mixture-of-Agents to boost LM fine-tuning + RL.

Turns out a mixture of small agents is much more effective/cheaper than using a large LM as teacher
🌐together.ai/blog/moaa
📜arxiv.org/abs/2505.03059

thumb_up_off_alt110

chat_bubble_outline7

repeat18

shareShare

Shang Zhu

@shangzhu18

6 months ago

Please check out our freshly accepted #ICML2025 paper on LLM post-training research using Mixture-of-Agents. We released our SFT data and fine-tuned models for the community to try out!

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Together AI

@togethercompute

6 months ago

1/ We built an open-source AI agent that can reason like a data scientist. It loads data, writes Python code, retrains when models fail, and solves real Kaggle + DABStep tasks. Here’s how we did it (and how you can too): 👇

thumb_up_off_alt189

chat_bubble_outline8

repeat28

shareShare

Shang Zhu

@shangzhu18

6 months ago

We built a data science agent from scratch and made it accessible to everyone (pip install & one command-line call). Open-source codebase: github.com/togethercomput… We look forward to your feedback! With Federico Bianchi Zain Ben Athiwaratkun James Zou

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Zain

@zainhasan6

6 months ago

Check out our fully open source data science agent! It performs complex tasks like hyper-parameter optimization, runs experiments and can even complete kaggle competitions!

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

James Zou

@james_y_zou

6 months ago

Excited to introduce Open Data Scientist: ✅outperforms Gemini data science agent ✅solves real Kaggle tasks ✅fully open source, easy to adapt ✅sandbox for safe exec Step-by-step tutorial on building our agent together.ai/blog/building-… Great job Federico Bianchi Shang Zhu

thumb_up_off_alt83

chat_bubble_outline1

repeat9

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

Shrinking the Generation-Verification Gap with Weak Verifiers "we introduce Weaver, a framework for designing a strong verifier by combining multiple weak, imperfect verifiers." "Weaver leverages weak supervision to estimate each verifier’s accuracy and combines their outputs

thumb_up_off_alt124

chat_bubble_outline3

repeat24

shareShare

Zach Xu

@nehzux

5 months ago

LLMs are getting more powerful, but they still struggle with super long documents. A common trick is "Divide and Conquer" - chop it up, process chunks, and combine. But... when does this actually work? And when does it fail catastrophically? We investigated. 🧵

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Jon Saad-Falcon

@jonsaadfalcon

5 months ago

How can we close the generation-verification gap when LLMs produce correct answers but fail to select them? 🧵 Introducing Weaver: a framework that combines multiple weak verifiers (reward models + LM judges) to achieve o3-mini-level accuracy with much cheaper non-reasoning

thumb_up_off_alt204

chat_bubble_outline11

repeat56

shareShare

Jon Saad-Falcon

@jonsaadfalcon

5 months ago

📝 Paper: arxiv.org/abs/2506.18203 ✍️ Blog: hazyresearch.stanford.edu/blog/2025-06-1… 🔧 Code: github.com/HazyResearch/s… 🤗 Datasets and Models: huggingface.co/collections/ha… Joint work with Kelly Buchanan, Mayee Chen, Tzu-Heng Huang, Brendan McLaughlin, Tanvir Bhathal, Shang Zhu, Ben Athiwaratkun, Fred Sala,

thumb_up_off_alt26

chat_bubble_outline0

repeat5

shareShare

Mayee Chen

@mayeechen

5 months ago

LLMs often generate correct answers but struggle to select them. Weaver tackles this by combining many weak verifiers (reward models, LM judges) into a stronger signal using statistical tools from Weak Supervision—matching o3-mini-level accuracy with much cheaper models! 📊

thumb_up_off_alt223

chat_bubble_outline15

repeat33

shareShare

Changwen Xu

@changwen_xu98

5 months ago

1/n📢 New preprint: CLOUD — scalable & physics-informed foundation model for crystals! - Pretrained on 6M structures - Symmetry-consistent strings (SCOPE) - Integrates Debye model for thermodynamic consistency 🔥 👉 arxiv.org/abs/2506.17345 #AI4Science #materials #FoundationModels

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Changwen Xu

@changwen_xu98

5 months ago

8/n Our preprint and code are available now: Paper: arxiv.org/abs/2506.17345 Code: github.com/ChangwenXu98/C… Thanks Shang Zhu and Venkat Viswanathan for the great support!

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Together AI

@togethercompute

5 months ago

Introducing the Open Deep Research app! Generate detailed reports on any topic with open source LLMs. Free & fully open source. We’re releasing everything: evaluation dataset, code, app, and blog.🔥

thumb_up_off_alt280

chat_bubble_outline9

repeat43

shareShare

Shang Zhu

@shangzhu18

5 months ago

Thrilled to see our deep research efforts turning to an app! Built by Hassan !

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Hassan

@nutlope

5 months ago

Announcing Open Deep Research! Generate detailed reports on any topic with open source LLMs. 100% free and open source. opendeepresearch.dev

thumb_up_off_alt351

chat_bubble_outline17

repeat43

shareShare

Ben Athiwaratkun @ ICLR

@ben_athi

5 months ago

Open Deep Research app + fully open recipe ☺️

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Azalia Mirhoseini

@azaliamirh

5 months ago

Introducing Weaver, a test time scaling method for verification! Weaver shrinks the generation-verification gap through a low-overhead weak-to-strong optimization of a mixture of verifiers (e.g., LM judges and reward models). The Weavered mixture can be distilled into a tiny

thumb_up_off_alt169

chat_bubble_outline2

repeat35

shareShare