Gavin Uberti (@ubertigavin) 's Twitter Profile
Gavin Uberti

@ubertigavin

Building model-specific AI chips @ Etched

ID: 1499947604037283841

linkhttps://etched.com/ calendar_today05-03-2022 03:19:34

128 Tweet

3,3K Followers

206 Following

Gavin Uberti (@ubertigavin) 's Twitter Profile Photo

It’s only reasoning if it’s from the Reasonique region of the human brain. Otherwise it’s just a sparkling stochastic parrot.

Gavin Uberti (@ubertigavin) 's Twitter Profile Photo

We’re proud to launch Oasis with Decart, a video diffusion transformer that runs in real time. It’s a 500M param model that runs in real time on H100s, but our upcoming Sohu ASIC will be able to run 100B+ param models in real time.

Gavin Uberti (@ubertigavin) 's Twitter Profile Photo

For intelligence, FLOPs are all you need. So I'm excited to announce our Inference Time Compute Hackathon with @cognition, Mercor, CoreWeave, and Anthropic. When exaFLOPs are too cheap to meter, what will we build?

Gavin Uberti (@ubertigavin) 's Twitter Profile Photo

Using more FLOPs should make transformers smarter. DeepSeek R1 currently uses ~8 routed experts per token. So would selecting more (and possibly scaling by the router) improve performance? Come test it out at the Inference Time Compute Hackathon and win up to $60k in prizes!

Brendan (can/do) (@brendanfoody) 's Twitter Profile Photo

Mercor (Mercor) scaled from $1-500M in revenue run rate in the last 17 months, making us the fastest growing company of all time. Our growth is accelerating. We averaged 11% week over week growth in July, 18% WoW growth in August, and 19% WoW growth in September. One trend