Yash Savani (@yashsavani_) Twitter Tweets • TwiCopy

Yash Savani

@yashsavani_

+ Follow

PhD student @CSDatCMU with Zico Kolter | prev research scientist @abacusai, ml eng @primer_ai | prev prev CS+Stats @Stanford @StanfordAILab

ID: 162099298

linkhttps://www.yashsavani.com calendar_today02-07-2010 18:08:25

45 Tweet

262 Followers

679 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Deep Equilibrium Optical Flow Estimation Shaojie Bai, Zhengyang Geng, Yash Savani, Zico Kolter tl;dr: DEQ ("infinite depth aka single layer", arxiv.org/abs/1909.01377) look like natural fit for optical flow estimation. arxiv.org/abs/2204.08442 github.com/locuslab/deq-f…

Deep Equilibrium Optical Flow Estimation

<a href="/shaojieb/">Shaojie Bai</a>, Zhengyang Geng, <a href="/yashsavani_/">Yash Savani</a>, <a href="/zicokolter/">Zico Kolter</a>

tl;dr: DEQ ("infinite depth aka single layer", arxiv.org/abs/1909.01377) look like natural fit for optical flow estimation.

arxiv.org/abs/2204.08442
github.com/locuslab/deq-f…

thumb_up_off_alt32

chat_bubble_outline2

repeat6

shareShare

Zhengyang Geng

@zhengyanggeng

3 years ago

Happy to share our latest DEQ work with Shaojie Bai, Yash Savani, and Zico Kolter! DEQ flow now sets SOTA zero-shot generalization performance on KITTI-15, with over 20% error reduction and super strong efficiency! Paper and Code available at paperswithcode.com/paper/deep-equ…. #CVPR2022

thumb_up_off_alt63

chat_bubble_outline2

repeat11

shareShare

Zico Kolter

@zicokolter

4 months ago

Excited about this work with Asher Trockman Yash Savani (and others) on antidistillation sampling. It uses a nifty trick to efficiently generate samples that makes student models _worse_ when you train on samples. I spoke about it at Simons this past week. Links below.

Excited about this work with <a href="/ashertrockman/">Asher Trockman</a> <a href="/yashsavani_/">Yash Savani</a> (and others) on antidistillation sampling. It uses a nifty trick to efficiently generate samples that makes student models _worse_ when you train on samples. I spoke about it at Simons this past week. Links below.

thumb_up_off_alt162

chat_bubble_outline7

repeat19

shareShare

Jeremy Cohen

@deepcohen

4 months ago

I’ll be at ICLR next week presenting this paper co-written with Alex Damian (Alex Damian). Would love to meet and chat about optimization in deep learning! My DMs are open - please reach out via DM or email. openreview.net/forum?id=sIE2r…

thumb_up_off_alt163

chat_bubble_outline4

repeat24

shareShare

YixuanEvenXu

@yixuanevenxu

2 months ago

✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7)

$✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7)$

thumb_up_off_alt135

chat_bubble_outline4

repeat16

shareShare

Simone Scardapane

@s_scardapane

a month ago

*Antidistillation Sampling* by Yash Savani Asher Trockman Zico Kolter et al. They modify the logits of a model with a penalty term that poisons potential distillation attempts (by estimating the downstream distillation loss). arxiv.org/abs/2504.13146

*Antidistillation Sampling*
by <a href="/yashsavani_/">Yash Savani</a> <a href="/ashertrockman/">Asher Trockman</a> <a href="/zicokolter/">Zico Kolter</a> et al.

They modify the logits of a model with a penalty term that poisons potential distillation attempts (by estimating the downstream distillation loss).

arxiv.org/abs/2504.13146

thumb_up_off_alt36

chat_bubble_outline2

repeat9

shareShare

Yash Savani

Gate.io

Dmytro Mishkin 🇺🇦

Zhengyang Geng

Zico Kolter

Jeremy Cohen

YixuanEvenXu

Simone Scardapane