Moin Nadeem (@moinnadeem) 's Twitter Profile
Moin Nadeem

@moinnadeem

Co-Founder at @Phonic_Co. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲

ID: 82497649

linkhttps://phonic.co calendar_today15-10-2009 00:35:56

17,17K Tweet

2,2K Followers

1,1K Following

Charles 🎉 Frye (@charles_irl) 's Twitter Profile Photo

We've run thousands of LLM inference serving benchmarks at Modal. We're releasing the results so you don't have to. We're releasing the code so that you can. Introducing: The LLM Engineer's Almanac. Just in time for the AI Engineer World's Fair.

jack morris (@jxmnop) 's Twitter Profile Photo

new paper from our work at Meta! **GPT-style language models memorize 3.6 bits per param** we compute capacity by measuring total bits memorized, using some theory from Shannon (1953) shockingly, the memorization-datasize curves look like this: ___________ / / (🧵)

new paper from our work at Meta!

**GPT-style language models memorize 3.6 bits per param**

we compute capacity by measuring total bits memorized, using some theory from Shannon (1953)

shockingly, the memorization-datasize curves look like this:
      ___________
  /
/

(🧵)
Moin Nadeem (@moinnadeem) 's Twitter Profile Photo

Ah yes, the radical left-wing agenda of… teaching kids to count with a vampire, respect others with a giant yellow bird, and eat cookies with reckless abandon ☺️ Truly, Big Bird is the most dangerous socialist of our time

Ah yes, the radical left-wing agenda of… teaching kids to count with a vampire, respect others with a giant yellow bird, and eat cookies with reckless abandon ☺️

Truly, Big Bird is the most dangerous socialist of our time
Elias Torres (@eliast) 's Twitter Profile Photo

We work weekends so our team doesn't have to figure it out on Monday. 3 of us grinding Saturday/Sunday = 15 people starting Monday with clarity. We ping each other, find windows that work, respect everyone's time. But we ship. This is how small teams beat giants.

Moin Nadeem (@moinnadeem) 's Twitter Profile Photo

Using Claude Code for GH commit messages has been one of the surprisingly largest programming QoL improvements in a while.