Jacob Austin (@jacobaustin132) 's Twitter Profile
Jacob Austin

@jacobaustin132

Research at @GoogleDeepMind. Currently making LLMs go fast. I also play piano and climb. NYC. Opinions my own

ID: 842566566394941445

linkhttp://www.jacobaustin.org calendar_today17-03-2017 02:41:38

506 Tweet

5,5K Followers

889 Following

Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

Some awesome stuff here about LLM scaling (esp. on GPUs). Their LLAMA sharding/memory diagram is great. Glad to see it becoming easier to understand scaling in the open

rdyro (@rdyro128523) 's Twitter Profile Photo

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!
Amanda Askell (@amandaaskell) 's Twitter Profile Photo

It's bizarre when relatively techno-utopian people are asked about how to solve declining fertility and instead of talking about artificial wombs, extended fertility spans, AI-assisted childcare, UBI, etc. they're suddenly like "well we just need to return to the 50s".

Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

Anyone done the Whitney mountaineers route or the east buttress? Planning to do one or the other this in a few weeks but wanted to talk to someone who's done it

Transluce (@transluceai) 's Twitter Profile Photo

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer
The New York Times (@nytimes) 's Twitter Profile Photo

Breaking News: A Columbia student activist, a legal permanent resident, was arrested by ICE at a meeting he thought was a step to becoming a U.S. citizen. nyti.ms/4j9w7JR

Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

An interesting phenomenon: because LLM API providers are generally not able to log or view customer traffic, jailbreaks can in theory exist undetected unless the API customer has sufficient monitoring in place

koray kavukcuoglu (@koraykv) 's Twitter Profile Photo

Advanced version of Gemini Deep Think (announced at #GoogleIO) using parallel inference time computation achieved gold-medal performance at IMO, solving 5/6 problems with rigorous proofs as verified by official IMO judges! Congrats to all involved! deepmind.google/discover/blog/…

Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

I just stumbled across this awesome book, which covers a lot of the nitty gritty details of GPU hardware, SLURM, cloud providers, and LLM training/serving. Probably the most practical guide to the infrastructure of LLM scaling I've seen