Asher Trockman (@ashertrockman) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Jeff Dean

@jeffdean

a month ago

We've had an account for Google Research for a while, but we're going to start posting more info about the work done by Google Research here. Follow for awesome research content!

thumb_up_off_alt464

chat_bubble_outline23

repeat35

shareShare

SuperDial was the first company to deploy TensorZero in production. Really exciting to see the progress and impact they've made over the past year+! Congratulations to Sam Schwager, Harrison Caruthers, and the SuperDial team — well deserved! 🍾

thumb_up_off_alt2

chat_bubble_outline1

repeat2

shareShare

Delip Rao e/σ

@deliprao

a month ago

Gemini 2.5 Pro is the most underrated model. Extraordinary intelligence for free. I am not sure why people don’t talk about this all the time.

thumb_up_off_alt915

chat_bubble_outline49

repeat35

shareShare

Cody Blakeney

@code_star

a month ago

Alternatively, pay DatologyAI to look at your data for you. This is what all our cracked scientists look like (from looking at the data).

Alternatively, pay <a href="/datologyai/">DatologyAI</a> to look at your data for you. This is what all our cracked scientists look like (from looking at the data).

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Shane Gu

@shaneguml

a month ago

NeurIPS workshop proposal rejected. We had AMAZING speakers, and a great list of organizers. No rebuttal phase. No feedback. No dense reward :( Since I work full-time on Gemini and have zero time to publish, this was my chance to contribute to academia 😞

thumb_up_off_alt407

chat_bubble_outline23

repeat7

shareShare

Simo Ryu

@cloneofsimo

a month ago

It takes LITERALLY 2 min to setup gemini-cli and Im telling you its incredible. No need to setup credit card or anything. Its also super intutive and safe. you should definitely try it out 100% recommend ``` npm install -g @google/gemini-cli gemini ```

thumb_up_off_alt341

chat_bubble_outline29

repeat12

shareShare

Albert Gu

@_albertgu

a month ago

I converted one of my favorite talks I've given over the past year into a blog post. "On the Tradeoffs of SSMs and Transformers" (or: tokens are bullshit) In a few days, we'll release what I believe is the next major advance for architectures.

thumb_up_off_alt516

chat_bubble_outline19

repeat72

shareShare

Aya Somai

@aya_somai_

a month ago

My favorite reading of the week by Yiding Jiang: Next era is not about learning from data but deciding what data to learn from. yidingjiang.github.io/blog/post/expl…

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Simone Scardapane

@s_scardapane

a month ago

*Antidistillation Sampling* by Yash Savani Asher Trockman Zico Kolter et al. They modify the logits of a model with a penalty term that poisons potential distillation attempts (by estimating the downstream distillation loss). arxiv.org/abs/2504.13146

*Antidistillation Sampling*
by <a href="/yashsavani_/">Yash Savani</a> <a href="/ashertrockman/">Asher Trockman</a> <a href="/zicokolter/">Zico Kolter</a> et al.

They modify the logits of a model with a penalty term that poisons potential distillation attempts (by estimating the downstream distillation loss).

arxiv.org/abs/2504.13146

thumb_up_off_alt36

chat_bubble_outline2

repeat9

shareShare

Sukjun (June) Hwang

@sukjun_hwang

a month ago

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

thumb_up_off_alt2,2K

chat_bubble_outline58

repeat355

shareShare

Albert Gu

@_albertgu

a month ago

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

thumb_up_off_alt1,1K

chat_bubble_outline58

repeat177

shareShare

Logan Kilpatrick

@officiallogank

a month ago

Big welcome to Varun Mohan and others from the Windsurf team joining Deepmind : )

thumb_up_off_alt3,3K

chat_bubble_outline257

repeat130

shareShare

Arthur Douillard

@ar_douillard

24 days ago

0.030349% contribution achieved

thumb_up_off_alt99

chat_bubble_outline3

repeat2

shareShare

will brown

@willccbb

23 days ago

prime intellect has acquired prime intellect

thumb_up_off_alt257

chat_bubble_outline24

repeat4

shareShare

main

@main_horse

23 days ago

H-Nets are the future.

thumb_up_off_alt689

chat_bubble_outline10

repeat72

shareShare

Nicholas Roberts

@nick11roberts

23 days ago

🎉 Excited to share that our paper "Pretrained Hybrids with MAD Skills" was accepted to Conference on Language Modeling 2025! We introduce Manticore - a framework for automatically creating hybrid LMs from pretrained models without training from scratch. 🧵[1/n]

thumb_up_off_alt47

chat_bubble_outline1

repeat17

shareShare

Zhili Feng

@zhilifeng

23 days ago

Alex Robey Hamed Hassani Amin Karbasi Distillation 📉 Antidistillation 📈

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Dylan Foster 🐢

@canondetortugas

22 days ago

For those at ICML, Audrey will be presenting this paper at the 4:30 poster session this afternoon! West Exhibition Hall B2-B3 W-1009

thumb_up_off_alt41

chat_bubble_outline0

repeat6

shareShare

Aditi Raghunathan

@adtraghunathan

21 days ago

Huge congratulations to Vaishnavh, Chen and Charles on the outstanding paper award 🎉 We will be presenting our #ICML2025 work on creativity in the Oral 3A Reasoning session (West Exhibition Hall C) 10 - 11 am PT. Or please stop by our poster right after @ East Exhibition

thumb_up_off_alt41

chat_bubble_outline0

repeat6

shareShare

Prima Mente

@primamente

21 days ago

1/ Today we announce Pleiades, a series of epigenetic foundation models (90M→7B params) trained on 1.9T tokens of human methylation & genomic data. Pleiades accurately models epigenetics for genomic track prediction, generation & neurodegenerative disease detection from cfDNA,

thumb_up_off_alt73

chat_bubble_outline8

repeat25

shareShare

Asher Trockman

Gate.io

Jeff Dean

Gabriel Bianconi

Delip Rao e/σ

Cody Blakeney

Shane Gu

Simo Ryu

Albert Gu

Aya Somai

Simone Scardapane

Sukjun (June) Hwang

Albert Gu

Logan Kilpatrick

Arthur Douillard

will brown

main

Nicholas Roberts

Zhili Feng

Dylan Foster 🐢

Aditi Raghunathan

Prima Mente