Ouail Kitouni (@wkitouni) Twitter Tweets • TwiCopy

Ouail Kitouni

@wkitouni

+ Follow

Member of technical staff @Anthropic prev @MIT @Meta @MSFTResearch

ID: 1159813920988762113

linkhttp://okitouni.github.io calendar_today09-08-2019 13:09:21

110 Tweet

65 Followers

95 Following

Ouail Kitouni

@wkitouni

2 years ago

Repo to reproduce Grokking in a few lines of code (Full batch GD, small MLP, modular addition): github.com/okitouni/simpl…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

🚀 Introducing the SEAL Leaderboards! We rank LLMs using private datasets that can’t be gamed. Vetted experts handle the ratings, and we share our methods in detail openly! Check out our leaderboards at scale.com/leaderboard! Which evals should we build next?

thumb_up_off_alt193

chat_bubble_outline10

repeat34

shareShare

Ouail Kitouni

@wkitouni

a year ago

Empire State dragon??

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ouail Kitouni

@wkitouni

a year ago

You can just use a different model to prioritize higher signal tokens and generalize quicker. RHO-LOSS literally just works. (fraction is ratio of top-k tokens kept to total tokens; 1 is equiv to no rho-loss used)

$You can just use a different model to prioritize higher signal tokens and generalize quicker. RHO-LOSS literally just works. (fraction is ratio of top-k tokens kept to total tokens; 1 is equiv to no rho-loss used)$

thumb_up_off_alt12

chat_bubble_outline2

repeat1

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

a year ago

Thesis from Ilya Sutskever : "to predict the next word, you have to predict the world" Antithesis from Yann LeCun : "AR-LLMs suck! Reversal curse!" Synthesis from FAIR: factorization-order-independent autoregressive model (MLM-U objective) (a paper subtweeting Mistral-PrefixLM thing?)