Priya Kasimbeg (@kasimbegpriya) Twitter Tweets • TwiCopy

Priya Kasimbeg

@kasimbegpriya

+ Follow

ID: 1609300739720941571

calendar_today31-12-2022 21:30:10

9 Tweet

37 Followers

47 Following

Zachary Nado

@zacharynado

2 years ago

tl;dr submit a training algorithm* that is faster** than Adam*** and win $10,000 💸🚀 *a set of hparams, self-tuning algorithm, and/or update rule **see rules for how we measure speed ***beat all submissions, currently the best is NAdamW in wallclock and DistShampoo in steps

thumb_up_off_alt369

chat_bubble_outline10

repeat48

shareShare

MLCommons

@mlcommons

a year ago

MLCommons #AlgoPerf results are in! 🏁 $50K prize competition yielded 28% faster neural net training with non-diagonal preconditioning beating Nesterov Adam. New SOTA for hyperparameter-free algorithms too! Full details in our blog. mlcommons.org/2024/08/mlc-al… #AIOptimization #AI

thumb_up_off_alt269

chat_bubble_outline5

repeat45

shareShare

Google AI

@googleai

a year ago

Congratulations to everyone who submitted to the MLCommons AlgoPerf training algorithms competition! We were delighted to provide compute resources for evaluating so many exciting submissions.

thumb_up_off_alt127

chat_bubble_outline17

repeat24

shareShare

AlgoPerf

@algoperf

6 months ago

Hi there! This account will post about the AlgoPerf benchmark and leaderboard updates for faster neural network training via better training algorithms. But let's start with what AlgoPerf is, what we have done so far, and how you can train neural nets ~30% faster.

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Damek

@damekdavis

5 months ago

Lecture 11: benchmarking optimizers 1. the problem: comparing optimizers (sgd, adam, etc.) in deep learning is tricky. 2. challenge 1: defining "speed". curves cross, so use time-to-result. 3. challenge 2: hyperparameter tuning trap. protocol matters more than algo? (choi et

thumb_up_off_alt207

chat_bubble_outline2

repeat28

shareShare