Elan Rosenfeld (@elanrosenfeld) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

A sequence of videos of Will Smith eating spaghetti, overlaid with the shutterstock logo. In some clips he uses a fork and in others his hands overflow with spaghetti as he shovels it into his mouth. In each clip he is wearing a different outfit. One clip has two Will Smiths.

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Francesco Orabona

@bremen79

a year ago

How you ever wondered why the KL divergence is in all the PAC-Bayes bounds? Are we sure is it the optimal choice? We now know: for sure KL is *not* the optimal one! New work with Ilja Kuzborskij, Kwang-Sung (Kwang) Jun , yulian wu, and Kyoungseok Jang x.com/StatMLPapers/s…

thumb_up_off_alt104

chat_bubble_outline1

repeat13

shareShare

Elan Rosenfeld

@elanrosenfeld

a year ago

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Stat.ML Papers

@statmlpapers

a year ago

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models ift.tt/Pui1LcV

thumb_up_off_alt47

chat_bubble_outline0

repeat9

shareShare

Mathieu

@miniapeur

a year ago

thumb_up_off_alt2,2K

chat_bubble_outline28

repeat242

shareShare

Vaishnavh Nagarajan

@_vaishnavh

a year ago

🗣️ “Next-token predictors can’t plan!” ⚔️ “False! Every distribution is expressible as product of next-token probabilities!” 🗣️ In work w/ Gregor Bachmann , we carefully flesh out this emerging, fragmented debate & articulate a key new failure. 🔴 arxiv.org/abs/2403.06963

thumb_up_off_alt399

chat_bubble_outline14

repeat83

shareShare

Samuel Sokota

@ssokota

a year ago

SOTA AI for games like poker & Hanabi rely on search methods that don’t scale to games w/ large amounts of hidden information. In our ICLR paper, we introduce simple search methods that scale to large games & get SOTA for Hanabi w/ 100x less compute. 1/N arxiv.org/abs/2304.13138

thumb_up_off_alt329

chat_bubble_outline5

repeat52

shareShare

ML@CMU

@mlcmublog

a year ago

Imagine you're a data scientist who solves several related linear regression problems from the same application domain. Can you learn how to best use a combination of L1 and L2 regularization penalties? We show that you can! How much data is needed? blog.ml.cmu.edu/2024/04/12/how…

thumb_up_off_alt15

chat_bubble_outline0

repeat5

shareShare

Elan Rosenfeld

@elanrosenfeld

a year ago

I can't speak for other schools, but I can tell you this is definitely not the case at CMU. This seems like a surefire way to lose your school's status as a top program.

thumb_up_off_alt44

chat_bubble_outline3

repeat0

shareShare

Elan Rosenfeld

@elanrosenfeld

a year ago

Almost forgot the obligatory self-promotion... I'll be presenting this work this afternoon at ICLR, poster #148. Stop by to gain a new understanding of NN optimization!

thumb_up_off_alt41

chat_bubble_outline0

repeat5

shareShare

Chandler Squires

@chandlersquires

a year ago

If you are submitting to NeurIPS: reward yourself with a talk after! If you are not: no excuses not to attend this talk! Either way, join us this week at CARE to hear Sorawit (James) Saengkyongam speak about representation learning for extrapolation. portal.valencelabs.com/events/post/id…

thumb_up_off_alt14

chat_bubble_outline0

repeat4

shareShare

Zico Kolter

@zicokolter

a year ago

I'm thrilled to share that I will become the next Director of the Machine Learning Department at Carnegie Mellon. MLD is a true gem, a department dedicated entirely to ML. Faculty and past directors have been personal role models in my career. cs.cmu.edu/news/2024/kolt…

thumb_up_off_alt1,1K

chat_bubble_outline121

repeat79

shareShare

Aaron Roth

@aaroth

a year ago

Congrats to the best paper award winners at COLT 2024! learningtheory.org/colt2024/award… First up, The Price of Adaptivity in Stochastic Convex Optimization by Yair Carmon and Oliver Hinder:

thumb_up_off_alt157

chat_bubble_outline1

repeat16

shareShare

Clément Canonne (on Blue🦋Sky)

@ccanonne_

a year ago

"So, unimaginative theoreticians of the world, unite and pursue problems that have been studied only once." #rememberingLuca lucatrevisan.wordpress.com/2006/11/07/on-…

thumb_up_off_alt39

chat_bubble_outline1

repeat11

shareShare

Logan Kilpatrick

@officiallogank

10 months ago

Say hello to Gemini 1.5 Flash-8B ⚡️, now available for production usage with: - 50% lower price (vs 1.5 Flash) - 2x higher rate limits (vs 1.5 Flash) - lower latency on small prompts (vs 1.5 Flash) developers.googleblog.com/en/gemini-15-f…

thumb_up_off_alt1,1K

chat_bubble_outline97

repeat186

shareShare

QC

@qiaochuyuan

9 months ago

tried getting claude to write funny tweets starting from 9 examples. it generated some okay stuff but nothing that actually made me laugh. then i tried asking it to generate tweets written from its perspective rather than a human's and actually laughed. hmm

thumb_up_off_alt7,7K

chat_bubble_outline115

repeat657

shareShare

Logan Kilpatrick

@officiallogank

7 months ago

It’s still an early version, but check out how the model handles a challenging puzzle involving both visual and textual clues: (2/3)

thumb_up_off_alt948

chat_bubble_outline32

repeat51

shareShare

Elan Rosenfeld

@elanrosenfeld

a month ago

I learned a lot about LLM training dynamics during this project, led by Sara Kangaslahti. Surprisingly, we can find meaningful + interpretable breakthroughs in model capabilities which are non-obvious from just the aggregated loss. Check out the thread by Naomi Saphra for details!

thumb_up_off_alt19

chat_bubble_outline0

repeat1

shareShare

Delip Rao e/σ

@deliprao

18 days ago

Anthropic or Anthropic-sponsored safety papers

thumb_up_off_alt2,2K

chat_bubble_outline46

repeat213

shareShare

Demis Hassabis

@demishassabis

4 days ago

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to Thang Luong and the team! deepmind.google/discover/blog/…

thumb_up_off_alt6,6K

chat_bubble_outline199

repeat765

shareShare