
Vipul Ved Prakash
@vipulved
Building AI factories. Co-founder, CEO @togethercompute
ID: 14418455
17-04-2008 09:27:43
2,2K Tweet
5,5K Followers
970 Following


.Together AI is possibly the best place to transition to B200 and GB200 with our new infrastructure and deep support for Blackwell kernels and optimization.



Qwen 3 235B now on Together AI API! Qwen 3 is a reasoning model that has a non-reasoning instruct mode with allowance for setting a thinking budget. It’s efficient ($0.20/M input & $0.60/M output on our throughput optimized endpoint) and fantastic on a variety of

Join us this Monday, May 5th at 10:00 AM PT for a talk on Matryoshka Principles for Adaptive Intelligence by Aditya Kusupati, Staff Research Scientist Google DeepMind. Register below 🧵📷🧵!



There’s a lot of FUD around safety of OSS models, but these models allow us to study safety implications of AI in an open and transparent way. Excited for the partnership with General Analysis to comprehensively stress test OSS (and closed) models.


A new effort led by Percy Liang to build open models in a radically participatory way. The project is now live, including the first Marin models. Here's the project website: marin.community And the 8B model is live on Together's model platform:

Together Code SandBox and Code Interpreter, the fastest VM technology on the planet, designed for AI model generated code execution, text-to-app platforms and RL pipelines, GAs today! Together AI

The new R1 is live on Together AI API as well as chat.together.ai!



Bulk intelligence now available Together AI. Our new Batch API offers broad access to SOTA open source LLMs for high throughput uses. This is great for synthetic data generation, benchmarking, content review and summarization, document extraction and more. Also intro


.Together AI is building 2 gigawatts of AI factories (~100,000 GPUs) in the EU over the next 4 years with the first phase live in H2 '2025. AI compute is at <1% saturation relative to our 2035 forecast and we are starting early to build a large-scale sustainable AI cloud
