Sanyam Bhutani (@bhutanisanyam1) Twitter Tweets • TwiCopy

We are hiring on the PyTorch team! 🙏 Partner Engineer is a mix of building applications with real use cases, applied research and software engineering I work on the llama wing but learn so much everytime I speak with PyTorch org. They are some of the smartest and most humble

thumb_up_off_alt381

chat_bubble_outline5

repeat21

shareShare

Sanyam Bhutani

@bhutanisanyam1

5 months ago

Lake Tahoe is beautiful! ❤️

thumb_up_off_alt47

chat_bubble_outline4

repeat1

shareShare

Suhail

@suhail

5 months ago

Playbook to defeat frontier ai labs without billions of dollars initially: - build an app on top of their models - solve an important, large problem for humanity - resume training on top OSS models to reduce dependency, lower costs for certain tasks, increase performance -

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat125

shareShare

Mark Saroufim

@marksaroufim

5 months ago

x.com/i/article/1904…

thumb_up_off_alt394

chat_bubble_outline9

repeat67

shareShare

Sanyam Bhutani

@bhutanisanyam1

4 months ago

Llama 4 supports 10M Context length! 🙏 Reading AN ENTIRE GitHub repo of 900k tokens and writing a guide on it takes under 3 minutes! We are launching two new models Scout and Maverick: - Upto 10M context length - Scout fits on single H100 with int4 quant - Upto 5 images -

thumb_up_off_alt304

chat_bubble_outline5

repeat20

shareShare

rdyro

@rdyro128523

4 months ago

Llama 4 inference in pure JAX! Expert/tensor parallelism with int8 quantization. Contributions welcome!

thumb_up_off_alt131

chat_bubble_outline2

repeat14

shareShare

Artificial Analysis

@artificialanlys

4 months ago

Llama 4 Intelligence Index Update: We have now replicated Meta’s claimed values for MMLU Pro and GPQA Diamond, pushing our Intelligence Index scores for both Scout and Maverick higher Key update details: ➤ We noted in our first post 48 hours ago that we noticed discrepancies

thumb_up_off_alt742

chat_bubble_outline49

repeat195

shareShare

Sanyam Bhutani

@bhutanisanyam1

4 months ago

1.5M tokens to website in 5 minutes 🙏 - Upload an entire repo of apps - Upload multiple sketches website - Use the repo content to populate the template Llama 4 supports 10M context + upto 10 images in a session:

thumb_up_off_alt45

chat_bubble_outline5

repeat7

shareShare

Daniel Han

@danielhanchen

4 months ago

Also note if you're not getting good Llama 4 results, there are a few bugs: 1. QK Norm eps should be 1e-5 - collabed with HF on the fix! github.com/huggingface/tr… 2. RoPE scaling for Scout changed: github.com/ggml-org/llama… 3. vLLM +2% acc shared QK norm fix: github.com/vllm-project/v…

thumb_up_off_alt188

chat_bubble_outline5

repeat19

shareShare

Unsloth AI

@unslothai

4 months ago

We’re excited to showcase all the amazing ways you’ve been building with Llama + Unsloth at LlamaCon 2025! 🦥🦙 Get ready for surprises and exciting announcements from Meta and us on Apr 29 in SF. 👀 Also big thanks to AI at Meta for the support and awesome merch!

thumb_up_off_alt117

chat_bubble_outline5

repeat4

shareShare

Sanyam Bhutani

@bhutanisanyam1

3 months ago

Super excited to launch Synthetic-Data-Kit! 🙏 Fine-tuning LLMs is easy, there are many packages to get started, Unsloth AI is my absolute favorite. However, there is still a BIG HURDLE when working on fine-tuning: Data preparation Today I’m super grateful to be launching a

Super excited to launch Synthetic-Data-Kit! 🙏

Fine-tuning LLMs is easy, there are many packages to get started, <a href="/UnslothAI/">Unsloth AI</a> is my absolute favorite.

However, there is still a BIG HURDLE when working on fine-tuning: Data preparation

Today I’m super grateful to be launching a

thumb_up_off_alt463

chat_bubble_outline15

repeat53

shareShare

Unsloth AI

@unslothai

3 months ago

We partnered with AI at Meta on a free notebook that turns your documents into high-quality synthetic datasets using Llama! Features: • Parses PDFs, websites, videos • Use Llama to generate QA pairs + auto-filter data • Fine-tunes dataset with Llama 🔗colab.research.google.com/github/unsloth…

We partnered with <a href="/AIatMeta/">AI at Meta</a> on a free notebook that turns your documents into high-quality synthetic datasets using Llama!

Features:
• Parses PDFs, websites, videos
• Use Llama to generate QA pairs + auto-filter data
• Fine-tunes dataset with Llama

🔗colab.research.google.com/github/unsloth…

thumb_up_off_alt772

chat_bubble_outline17

repeat142

shareShare

Sanyam Bhutani

@bhutanisanyam1

3 months ago

Llama Synthetic Data Fine-Tuning Guide! 🙏 My favourite thing about this tutorial-everything is powered by 3B model. It covers a step overlooked everywhere-data preparation and generation for fine tuning. Thanks Unsloth AI team for this gem: colab.research.google.com/github/unsloth…

thumb_up_off_alt159

chat_bubble_outline2

repeat26

shareShare

Unsloth AI

@unslothai

3 months ago

You can now fine-tune TTS models with Unsloth! Train, run and save models like Sesame-CSM and OpenAI's Whisper locally with our free notebooks. Unsloth makes TTS training 1.5x faster with 50% less VRAM. GitHub: github.com/unslothai/unsl… Docs & Notebooks: docs.unsloth.ai/basics/text-to…

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat170

shareShare

Daniel Han

@danielhanchen

3 months ago

We're bringing the Unsloth magic to TTS and audio models! There are multiple free Colab notebooks with free GPUs for Whisper, Sesame, Orpheus, Spark, Llasa & Oute on our docs! docs.unsloth.ai/basics/text-to…

thumb_up_off_alt171

chat_bubble_outline6

repeat26

shareShare

Sanyam Bhutani

@bhutanisanyam1

2 months ago

Every birthday, I recap my favourite learnings 🙏 This time, I’ve been laser focussed on simple ideas for fine-tuning and data generation. A roadmap of 8 ideas from papers that I’ve enjoyed the most, along with our implementations: 1. How much data do you need to fine-tune?

thumb_up_off_alt111

chat_bubble_outline17

repeat8

shareShare