Prateek Yadav (@prateeky2806) 's Twitter Profile
Prateek Yadav

@prateeky2806

Incoming pre-training @AlatMeta,
PhD at @unccs, @GoogleDeepMind,

Efficient and Adaptive LLMs, MoE, Model Merging, RLHF, Efficiency

ID: 2598124981

linkhttp://prateeky2806.github.io calendar_today01-07-2014 14:48:47

992 Tweet

2,2K Followers

2,2K Following

Prateek Yadav (@prateeky2806) 's Twitter Profile Photo

So does this mean the secret sauce is spilled? GDM's long context was the longest standing AI moat (apart from this) that anyone had in the last couple of years when sota models were being dethroned every other day. The Llama team did great! Congratulations Aston Zhang

Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Llama 4 Intelligence Index Update: We have now replicated Meta’s claimed values for MMLU Pro and GPQA Diamond, pushing our Intelligence Index scores for both Scout and Maverick higher Key update details: ➤ We noted in our first post 48 hours ago that we noticed discrepancies

Llama 4 Intelligence Index Update: We have now replicated Meta’s claimed values for MMLU Pro and GPQA Diamond, pushing our Intelligence Index scores for both Scout and Maverick higher

Key update details:
➤ We noted in our first post 48 hours ago that we noticed discrepancies
finbarr (@finbarrtimbers) 's Twitter Profile Photo

I feel like Google’s TPU marketing strategy is to have a legion of ex-deepminders go out and join ai startups only to be extremely disappointed by the state of large scale GPU cluster tooling

Elias Stengel-Eskin (on the faculty job market) (@eliaseskin) 's Twitter Profile Photo

🚨Announcing TaCQ 🚨 a new mixed-precision quantization method that identifies critical weights to preserve. We integrate key ideas from circuit discovery, model editing, and input attribution to improve low-bit quant., w/ 96% 16-bit acc. at 3.1 avg bits (~6x compression)

🚨Announcing TaCQ 🚨 a new mixed-precision quantization method that identifies critical weights to preserve. We integrate key ideas from circuit discovery, model editing, and input attribution to improve low-bit quant., w/ 96% 16-bit acc. at 3.1 avg bits (~6x compression)
Hanqi Xiao (@hanqi_xiao) 's Twitter Profile Photo

Excited to share my first paper as first author: "Task-Circuit Quantization” 🎉 I led this work to explore how interpretability insights can drive smarter model compression. Big thank you to Elias Stengel-Eskin, Yi Lin Sung (on job market), and Mohit Bansal for mentorship and collaboration. More to come!

yi (@agihippo) 's Twitter Profile Photo

I think people shouldn't do phds anymore. Just focus on hardcore engineering / infra in a big frontier company and branch off to research if you're interested. All the research is happening in frontier labs anyway.

Jialu Li (@jialuli96) 's Twitter Profile Photo

🚀New paper out - We present Video-MSG (Multimodal Sketch Guidance), a novel planning-based training-free guidance method for T2V models, improving control of spatial layout and object trajectories. 🔧 Key idea: • Generate a Video Sketch — a spatio-temporal plan with

Arthur Douillard (@ar_douillard) 's Twitter Profile Photo

30+ accepted papers 6 oral papers 6 guest speakers join us at ICLR 2026 on the 27th Hall 4 #3 for a full day of workshop on Modularity for Collaborative, Decentralized, and Continual Learning sites.google.com/corp/view/mcdc… Lucio Dery Jnr Mwinm, Fengyuan Liu, and myself will be organizing

30+ accepted papers

6 oral papers

6 guest speakers

join us at <a href="/iclr_conf/">ICLR 2026</a> on the 27th Hall 4 #3 for a full day of workshop on Modularity for Collaborative, Decentralized, and Continual Learning

sites.google.com/corp/view/mcdc…

<a href="/derylucio/">Lucio Dery Jnr Mwinm</a>, Fengyuan Liu, and myself will be organizing
Elias Stengel-Eskin (on the faculty job market) (@eliaseskin) 's Twitter Profile Photo

Extremely excited to announce that I will be joining UT Austin Computer Science at UT Austin in August 2025 as an Assistant Professor! 🎉 I’m looking forward to continuing to develop AI agents that interact/communicate with people, each other, and the multimodal world. I’ll be recruiting PhD

Extremely excited to announce that I will be joining <a href="/UTAustin/">UT Austin</a> <a href="/UTCompSci/">Computer Science at UT Austin</a> in August 2025 as an Assistant Professor! 🎉

I’m looking forward to continuing to develop AI agents that interact/communicate with people, each other, and the multimodal world. I’ll be recruiting PhD
Ari K (@arikuschnir) 's Twitter Profile Photo

WE CAN TALK! I spent 2 hours playing with Veo 3 @googledeepmind and it blew my mind now that it can do sound! It can talk, and this is all out of the box...

Jaehong Yoon (on the faculty job market) (@jaeh0ng_yoon) 's Twitter Profile Photo

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (NTU Singapore) as an Assistant Professor, starting in August 2025 🇸🇬🥳 I’ll continue my research on building trustworthy and continually adaptable multimodal AI,

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (<a href="/NTUsg/">NTU Singapore</a>) as an Assistant Professor, starting in August 2025 🇸🇬🥳

I’ll continue my research on building trustworthy and continually adaptable multimodal AI,
Prateek Jain (@jainprateek_) 's Twitter Profile Photo

We are hiring Technical Program Manager to organize and enable our research teams to be the best at what they do and to make fast-paced progress towards our mission of bringing AGI responsibly. Ideal candidates should have a demonstrable record of strong program management

Prateek Yadav (@prateeky2806) 's Twitter Profile Photo

I've officially joined Meta Superintelligence Labs (MSL) org in the Bay Area. I'll be working on critical aspects of pre-training, synthetic data and RL for the next generation of models. Humbled and eager to contribute to the quest for superintelligence. AI at Meta