Leo Du (@leoduw) Twitter Tweets • TwiCopy

Preetum Nakkiran

@preetumnakkiran

4 years ago

all theorems are correct, but some are useful

thumb_up_off_alt325

chat_bubble_outline11

repeat26

shareShare

Talia Ringer 🟣 🎗️

@taliaringer

2 years ago

Except Rust. Rust is a goddamn miracle x.com/TaliaRinger/st…

thumb_up_off_alt48

chat_bubble_outline2

repeat5

shareShare

Q: Does my LM leak probability onto infinite strings? A: For RNNs and PFSAs you need to test, but Transformers always generate EOS in finite time (prob=1). 🤔First we need to formalize the question… cs.jhu.edu/~jason/papers/… #ACL2023 poster Tue 11am w/Leo Du @ryandcotterell et al

thumb_up_off_alt25

chat_bubble_outline1

repeat6

shareShare

AI Coffee Break with Letitia

@aicoffeebreak

2 years ago

We summarized the #acl2023nlp Toronto conference for you with some poster recordings and author interviews! 👇 🎬 youtu.be/-Agcr0nawuk Featuring Szymon Tworkowski Jasivan Sivakumar Kundan Krishna (at ACL 2O25) Emanuele Bugliarello Leo Du Florian Mai Franz Nowak Paul Drm Moritz Plenz and Jay Alammar 👏

We summarized the #acl2023nlp Toronto conference for you with some poster recordings and author interviews! 👇
🎬 youtu.be/-Agcr0nawuk

Featuring <a href="/s_tworkowski/">Szymon Tworkowski</a> <a href="/jasivan_s/">Jasivan Sivakumar</a> <a href="/kundan_official/">Kundan Krishna (at ACL 2O25)</a> <a href="/ebugliarello/">Emanuele Bugliarello</a> <a href="/leoduw/">Leo Du</a> <a href="/_florianmai/">Florian Mai</a> <a href="/franz_nowak/">Franz Nowak</a> <a href="/PaulDarm/">Paul Drm</a> <a href="/MoritzPlenz/">Moritz Plenz</a> and <a href="/JayAlammar/">Jay Alammar</a> 👏

thumb_up_off_alt51

chat_bubble_outline0

repeat22

shareShare

Leo Du

@leoduw

2 years ago

Following up a weekend effort by another weekend effort: llama2. rs 🦀 github.com/leo-du/llama2.… In a single Rust file w/ * zero dependencies (i.e. custom rng w/ PCG) * zero lines of `unsafe` code (very 🦀!) * support user prompts * (almost) same performance

thumb_up_off_alt37

chat_bubble_outline0

repeat6

shareShare

Dan Roy

@roydanroy

2 years ago

One of the fundamental problems with probability notation in machine learning is due to the fact that few people really have a firm grasp on conditioning from a measure theoretical perspective. Another issue: random variables versus indexed collections of probability spaces.

thumb_up_off_alt39

chat_bubble_outline1

repeat2

shareShare

Distributed AI Research Institute is on Mastodon

@dairinstitute

2 years ago

Congratulations to Rylan Schaeffer, Brando Miranda, Sanmi Koyejo for winning a best paper award at NeurIPS for this insightful paper. Are Emergent Abilities of Large Language Models a Mirage? arxiv.org/abs/2304.15004

thumb_up_off_alt121

chat_bubble_outline1

repeat33

shareShare

Afra Amini

@afra_amini

2 years ago

If you are interested in knowing how you can do energy-based sampling from language models, make sure to check our #NeurIPS23 paper titled “Structured Voronoi Sampling”...🧵 arxiv.org/pdf/2306.03061…

thumb_up_off_alt39

chat_bubble_outline1

repeat8

shareShare

Sanmi Koyejo

@sanmikoyejo

2 years ago

"Are Emergent Abilities of Large Language Models a Mirage?" is a NeurIPS outstanding paper!🙌🏿 Congrats especially to the students Rylan Schaeffer Brando Miranda & other awardees. If you want to learn more, check out the oral & poster 👇🏿this afternoon (Dec 14) 1/2

"Are Emergent Abilities of Large Language Models a Mirage?" is a NeurIPS outstanding paper!🙌🏿

Congrats especially to the students <a href="/RylanSchaeffer/">Rylan Schaeffer</a> <a href="/BrandoHablando/">Brando Miranda</a> & other awardees.

If you want to learn more, check out the oral & poster 👇🏿this afternoon (Dec 14)
1/2

thumb_up_off_alt318

chat_bubble_outline10

repeat60

shareShare

Leo Du

@leoduw

2 years ago

A case for chasing the "low hanging fruit" in science...

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Will Crichton

@tonofcrates

2 years ago

New paper out w/ Shriram Krishnamurthi (primary: Bluesky) accepted to OOPSLA'24: a psychometric analysis of programming language learning. We added ~200 quiz questions to a popular book on Rust and collected ~1,000,000 answers from ~60,000 people over 1 year. arxiv.org/abs/2401.01257

thumb_up_off_alt147

chat_bubble_outline2

repeat25

shareShare

Justin T Chiu

@justintchiu

2 years ago

wrote a short note on using parallel scans for backprop: justintchiu.com/blog/pscan_dif… turns out there was already a paper on this too! arxiv.org/abs/1907.10134

thumb_up_off_alt66

chat_bubble_outline0

repeat13

shareShare

Leo Du

@leoduw

2 years ago

You'd think that classes in the Borel hierarchy would be named in the same language...

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Leo Du

@leoduw

2 years ago

"Let no one ignorant of geometry enter." Finally, a compute can enter Plato's academy.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Dan Roy

@roydanroy

2 years ago

So creative. arxiv.org/abs/2401.16657

thumb_up_off_alt172

chat_bubble_outline2

repeat26

shareShare

Jordan Ellenberg

@jsellenberg

a year ago

I knew a lot of mathematicians were getting into formalization but I had no idea it was this much of a problem

thumb_up_off_alt154

chat_bubble_outline7

repeat10

shareShare

Justin T Chiu

@justintchiu

a year ago

wrote a short blog post on the latent variable model behind STaR: justintchiu.com/blog/star/

thumb_up_off_alt64

chat_bubble_outline2

repeat9

shareShare

Emily Riehl

@emilyriehl

a year ago

Dominic Verity, Mario Carneiro and I just announced a new project to formalize some aspects of ∞-category theory in #Lean via the notion of an ∞-cosmos.

thumb_up_off_alt137

chat_bubble_outline3

repeat28

shareShare

Leo Du

@leoduw

9 months ago

- What do you call someone who trains LLMs on Apple Silicon in Metal instead of CUDA? - FullMetal Alchemist

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Leo Du

@leoduw

8 months ago

Love this distinction between “compiler optimization” and “programming model”. Coming back to a common example, part of Rust’s success is its programming model. Also, great talk!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Leo Du

Preetum Nakkiran

Talia Ringer 🟣 🎗️

Jason Eisner

AI Coffee Break with Letitia

Leo Du

Dan Roy

Distributed AI Research Institute is on Mastodon

Afra Amini

Sanmi Koyejo

Leo Du

Will Crichton

Justin T Chiu

Leo Du

Leo Du

Dan Roy

Jordan Ellenberg

Justin T Chiu

Emily Riehl

Leo Du

Leo Du