Avi Schwarzschild (@a_v_i__s) Twitter Tweets • TwiCopy

Avi Schwarzschild

@a_v_i__s

+ Follow

Postdoc at CMU. Trying to learn about deep learning faster than deep learning can learn about me.

ID: 1308460181999714304

linkhttp://avischwarzschild.com calendar_today22-09-2020 17:37:55

141 Tweet

513 Followers

229 Following

Zico Kolter

@zicokolter

6 months ago

Excited about this work with Asher Trockman Yash Savani (and others) on antidistillation sampling. It uses a nifty trick to efficiently generate samples that makes student models _worse_ when you train on samples. I spoke about it at Simons this past week. Links below.

Excited about this work with <a href="/ashertrockman/">Asher Trockman</a> <a href="/yashsavani_/">Yash Savani</a> (and others) on antidistillation sampling. It uses a nifty trick to efficiently generate samples that makes student models _worse_ when you train on samples. I spoke about it at Simons this past week. Links below.

thumb_up_off_alt162

chat_bubble_outline7

repeat19

shareShare

Alex Robey

@alexrobey23

6 months ago

A few days ago, we dropped 𝗮𝗻𝘁𝗶𝗱𝗶𝘀𝘁𝗶𝗹𝗹𝗮𝘁𝗶𝗼𝗻 𝘀𝗮𝗺𝗽𝗹𝗶𝗻𝗴 🚀 . . . and we've gotten a little bit of pushback. But whether you're at a frontier lab or developing smaller, open-source models, this research should be on your radar. Here's why 🧵

thumb_up_off_alt32

chat_bubble_outline1

repeat6

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

thrilled to receive the outstanding paper award for our work on shallow alignment! i’ll be giving the talk at 10:42am tomorrow (Thursday) in oral session 1D. the poster will be Friday 3PM.

thumb_up_off_alt167

chat_bubble_outline15

repeat11

shareShare

Marc Finzi

@m_finzi

6 months ago

Why do larger language models generalize better? In our new ICLR paper, we derive an interpretable generalization bound showing that compute-optimal LLMs provably generalize better with scale! 📄arxiv.org/abs/2504.15208 1/7🧵

thumb_up_off_alt124

chat_bubble_outline3

repeat30

shareShare

Yutong (Kelly) He

@electronickale

6 months ago

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

thumb_up_off_alt83

chat_bubble_outline2

repeat31

shareShare

Zhili Feng

@zhilifeng

6 months ago

I'm very excited to talk about compression-based memorization with Pratyush Maini this Friday at the OpenAI Security Research Conference! Let's chat about compression, memorization, and also our new antidistillation sampling antidistillation.com!

thumb_up_off_alt31

chat_bubble_outline1

repeat5

shareShare

Pratyush Maini

@pratyushmaini

6 months ago

Looking forward to giving a talk this Friday OpenAI with Zhili Feng on some of our privacy & memorization research + how it applies to production LLMs! We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

Looking forward to giving a talk this Friday <a href="/OpenAI/">OpenAI</a> with <a href="/zhilifeng/">Zhili Feng</a> on some of our privacy & memorization research + how it applies to production LLMs!

We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

thumb_up_off_alt101

chat_bubble_outline0

repeat10

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

6 months ago

📣Thrilled to announce I’ll join Carnegie Mellon University (CMU Engineering & Public Policy & Language Technologies Institute | @CarnegieMellon) as an Assistant Professor starting Fall 2026! Until then, I’ll be a Research Scientist at AI at Meta FAIR in SF, working with Kamalika Chaudhuri’s amazing team on privacy, security, and reasoning in LLMs!

📣Thrilled to announce I’ll join Carnegie Mellon University (<a href="/CMU_EPP/">CMU Engineering & Public Policy</a> & <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a>) as an Assistant Professor starting Fall 2026!

Until then, I’ll be a Research Scientist at <a href="/AIatMeta/">AI at Meta</a> FAIR in SF, working with <a href="/kamalikac/">Kamalika Chaudhuri</a>’s amazing team on privacy, security, and reasoning in LLMs!

thumb_up_off_alt1,1K

chat_bubble_outline212

repeat67

shareShare

Dimitris Papailiopoulos

@dimitrispapail

6 months ago

I find it interesting that people who believe LLMs/autoregressive models are a dead end base their arguments, and reasoning, either on philosophical, hard to test or rebut hypotheses, or on micro failures, eg 9.11 vs 9.9, to predict paradigm macro failures. All the while the

thumb_up_off_alt195

chat_bubble_outline21

repeat9

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

Mean Flows for One-step Generative Modeling "We introduce the notion of average velocity to characterize flow fields, in contrast to instantaneous velocity modeled by Flow Matching methods. A well-defined identity between average and instantaneous velocities is derived and

thumb_up_off_alt317

chat_bubble_outline5

repeat62

shareShare

Zhengyang Geng

@zhengyanggeng

5 months ago

Excited to share our work with my amazing collaborators, Goodeat, Xingjian Bai, Zico Kolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,