Azalia Mirhoseini (@azaliamirh) Twitter Tweets • TwiCopy

Azalia Mirhoseini

@azaliamirh

+ Follow

Assistant Professor of CS at Stanford, Senior Staff Research Scientist at Google DeepMind. Prev: Anthropic, Google Brain

ID: 1469058794

linkhttps://scalingintelligence.stanford.edu/ calendar_today30-05-2013 06:13:58

298 Tweet

13,13K Followers

461 Following

Infini-AI-Lab

@infiniailab

4 months ago

🥳 Happy to share our new work – Kinetics: Rethinking Test-Time Scaling Laws 🤔How to effectively build a powerful reasoning agent? Existing compute-optimal scaling laws suggest 64K thinking tokens + 1.7B model > 32B model. But, It only shows half of the picture! 🚨 The O(N²)

thumb_up_off_alt239

chat_bubble_outline5

repeat65

shareShare

Ryan Ehrlich

@ryansehrlich

4 months ago

Giving LLMs very large amounts of context can be really useful, but it can also be slow and expensive. Could scaling inference time compute help? In our latest work, we show that allowing models to spend test time compute to “self-study” a large corpora can >20x decode

thumb_up_off_alt33

chat_bubble_outline0

repeat7

shareShare

Hermann

@kumbonghermann

4 months ago

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week. VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from

thumb_up_off_alt49

chat_bubble_outline1

repeat21

shareShare

Azalia Mirhoseini

@azaliamirh

4 months ago

Checkout Sabri Eyuboglu's post for more details: x.com/EyubogluSabri/…

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

4 months ago

I like this idea very much and have long advocated for something like this. Synthetically enriched «KV prefix» is a natural augment to modern long context models.

thumb_up_off_alt167

chat_bubble_outline3

repeat16

shareShare

Azalia Mirhoseini

@azaliamirh

4 months ago

Go, Sharon Zhou and team! Congrats to Lisa Su and AMD on such an amazing addition!

thumb_up_off_alt18

chat_bubble_outline1

repeat0

shareShare

Soumith Chintala

@soumithchintala

3 months ago

This is a proper Vibe-coding setup for GPU programmers, and can result in getting surprisingly far! I honestly think that if this authoring experience is v1, then v10 might become the normal way GPU experts start writing serious custom kernels! Great work Anne Ouyang! (finally

thumb_up_off_alt349

chat_bubble_outline8

repeat30

shareShare

Azalia Mirhoseini

@azaliamirh

3 months ago

Congratulations, Dr. Goldie! Anna Goldie

thumb_up_off_alt59

chat_bubble_outline1

repeat1

shareShare

Azalia Mirhoseini

@azaliamirh

3 months ago

Congratulations, Caia Costello and Adrian!

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

3 months ago

Shrinking the Generation-Verification Gap with Weak Verifiers "we introduce Weaver, a framework for designing a strong verifier by combining multiple weak, imperfect verifiers." "Weaver leverages weak supervision to estimate each verifier’s accuracy and combines their outputs

thumb_up_off_alt124

chat_bubble_outline3

repeat24

shareShare

Alex Ratner

@ajratner

3 months ago

Very exciting work on using weak supervision for RL- closing the “generation-verification gap”!! Once again- principled approaches to labeling/data development are the keys!

thumb_up_off_alt20

chat_bubble_outline1

repeat7

shareShare

Oscar Hong

@oscrhong

3 months ago

Interesting tidbit from prof Christopher Manning: The first mention of “Large Language Model” comes from a 1998 NLP workshop Taiwan! Paper by Chun-Liang Chen, Bo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee. “Large” in 1998 = 20M word corpus

Interesting tidbit from prof <a href="/chrmanning/">Christopher Manning</a>: The first mention of “Large Language Model” comes from a 1998 NLP workshop Taiwan!

Paper by Chun-Liang Chen, Bo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee.

“Large” in 1998 = 20M word corpus

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Azalia Mirhoseini

@azaliamirh

3 months ago

See Jon Saad-Falcon's post for more details: x.com/JonSaadFalcon/… Paper: arxiv.org/abs/2506.18203 Blog: hazyresearch.stanford.edu/blog/2025-06-1… github.com/HazyResearch/s…… Datasets and Models: huggingface.co/collections/ha…

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Caia Costello

@caiacostello

3 months ago

So excited to speak tomorrow about Think Prune Train at LAD'25 session on Reasoning and Self Improvement! iclad.ai

thumb_up_off_alt11

chat_bubble_outline2

repeat1

shareShare

Christopher Manning

@chrmanning

3 months ago

I’ve joined AIX Ventures as a General Partner, working on investing in deep AI startups. Looking forward to working with founders on solving hard problems in AI and seeing products come out of that! Thank you Yuliya Chernova at The Wall Street Journal for covering the news: wsj.com/articles/ai-re…

thumb_up_off_alt480

chat_bubble_outline34

repeat30

shareShare