Jacob Austin (@jacobaustin132) Twitter Tweets • TwiCopy

Some awesome stuff here about LLM scaling (esp. on GPUs). Their LLAMA sharding/memory diagram is great. Glad to see it becoming easier to understand scaling in the open

thumb_up_off_alt54

chat_bubble_outline2

repeat4

shareShare

The White House

@whitehouse

6 months ago

"CONGESTION PRICING IS DEAD. Manhattan, and all of New York, is SAVED. LONG LIVE THE KING!" –President Donald J. Trump

thumb_up_off_alt63,63K

chat_bubble_outline19,19K

repeat10,10K

shareShare

Miles Brundage

@miles_brundage

6 months ago

Sycophancy is not just a problem with AI models

thumb_up_off_alt260

chat_bubble_outline16

repeat17

shareShare

rdyro

@rdyro128523

5 months ago

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!

thumb_up_off_alt295

chat_bubble_outline10

repeat46

shareShare

It's bizarre when relatively techno-utopian people are asked about how to solve declining fertility and instead of talking about artificial wombs, extended fertility spans, AI-assisted childcare, UBI, etc. they're suddenly like "well we just need to return to the 50s".

thumb_up_off_alt1,1K

chat_bubble_outline178

repeat131

shareShare

Jacob Austin

@jacobaustin132

4 months ago

Anyone done the Whitney mountaineers route or the east buttress? Planning to do one or the other this in a few weeks but wanted to talk to someone who's done it

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Transluce

@transluceai

4 months ago

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

thumb_up_off_alt330

chat_bubble_outline9

repeat66

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

4 months ago

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

thumb_up_off_alt2,2K

chat_bubble_outline75

repeat421

shareShare

Jacob Austin

@jacobaustin132

4 months ago

Most exciting news of the year so far!

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Miles Brundage

@miles_brundage

4 months ago

🫠

thumb_up_off_alt179

chat_bubble_outline16

repeat3

shareShare

Jacob Austin

@jacobaustin132

4 months ago

I'm glad to see at least one university has the slightest semblance of principle left.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

The New York Times

@nytimes

4 months ago

Breaking News: A Columbia student activist, a legal permanent resident, was arrested by ICE at a meeting he thought was a step to becoming a U.S. citizen. nyti.ms/4j9w7JR

thumb_up_off_alt212

chat_bubble_outline104

repeat107

shareShare

Jacob Austin

@jacobaustin132

2 months ago

An interesting phenomenon: because LLM API providers are generally not able to log or view customer traffic, jailbreaks can in theory exist undetected unless the API customer has sufficient monitoring in place

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Jacob Austin

@jacobaustin132

2 months ago

These talks are awesome and all on YouTube!

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

koray kavukcuoglu

@koraykv

15 days ago

Advanced version of Gemini Deep Think (announced at #GoogleIO) using parallel inference time computation achieved gold-medal performance at IMO, solving 5/6 problems with rigorous proofs as verified by official IMO judges! Congrats to all involved! deepmind.google/discover/blog/…

thumb_up_off_alt756

chat_bubble_outline30

repeat155

shareShare

Rohan Pandey

@khoomeik

15 days ago

it’s so over

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat61

shareShare

Jacob Austin

@jacobaustin132

9 days ago

I just stumbled across this awesome book, which covers a lot of the nitty gritty details of GPU hardware, SLURM, cloud providers, and LLM training/serving. Probably the most practical guide to the infrastructure of LLM scaling I've seen

thumb_up_off_alt39

chat_bubble_outline2

repeat0

shareShare

Jacob Austin

Gate.io

Jacob Austin

The White House

Miles Brundage

rdyro

Amanda Askell

Jacob Austin

Transluce

lmarena.ai (formerly lmsys.org)

Jacob Austin

Miles Brundage

Jacob Austin

The New York Times

Jacob Austin

Jacob Austin

koray kavukcuoglu

Rohan Pandey

Jacob Austin