Delip Rao e/σ (@deliprao) Twitter Tweets • TwiCopy

Delip Rao e/σ

@deliprao

+ Follow

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

ID: 16984977

linkhttps://deliprao.com calendar_today26-10-2008 20:04:45

52,52K Tweet

57,57K Followers

5,5K Following

David Ondrej

@davidondrej1

4 months ago

models like Kimi, DeepSeek and Qwen will cost the closed AI labs BILLIONS of dollars. that's why nobody is talking about them. despite these LLMs absolutely crushing all of the benchmarks. Claude 4 Opus is literally *100x* more expensive than Kimi K2 yet both models have

thumb_up_off_alt1,1K

chat_bubble_outline94

repeat117

shareShare

Nimer Sultany

@nimersultany

4 months ago

thumb_up_off_alt23,23K

chat_bubble_outline45

repeat3,3K

shareShare

Interesting things

@awkwardgoogle

4 months ago

They learnt about hugs today 🥹

thumb_up_off_alt19,19K

chat_bubble_outline145

repeat1,1K

shareShare

MR. OBVIOUS

@obviousrises

4 months ago

Companies are using fake humans and AI to do interviews now...

thumb_up_off_alt54,54K

chat_bubble_outline289

repeat3,3K

shareShare

Delip Rao e/σ

@deliprao

4 months ago

"Roomba of coding" is such an apt analogy for all current-generation coding agent tools.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Tamay Besiroglu

@tamaybes

4 months ago

If you give your AI model a French name, it is perhaps not surprising it will be offline 20% of the year.

thumb_up_off_alt285

chat_bubble_outline10

repeat7

shareShare

Delip Rao e/σ

@deliprao

4 months ago

A decade ago, Richard Socher gave the "don't be a hero" advice on choosing hyperparams for DL model training. Today's version of "don't be a hero" is picking the right LLM for the task at hand. If people you trust have said some model works for a task, use it.

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Delip Rao e/σ

@deliprao

4 months ago

if you poke a (inverted) bowl-shaped jello, it will wiggle in a certain way to eventually go back to being bowl-shaped. if something behaves this way consistently, it is undoubtedly a bowl-shaped jello.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Delip Rao e/σ

@deliprao

4 months ago

person with the longest update during standups has the weakest update

thumb_up_off_alt94

chat_bubble_outline1

repeat7

shareShare

Delip Rao e/σ

@deliprao

4 months ago

Super dumb take from ICML. Adding a subversive prompt to deter robot reviews is morally no different than using a fake email address to deter spammers. If authors don’t resort to subversive measures, reviewers will have no incentive not to use LLMs for reviewing.

thumb_up_off_alt12

chat_bubble_outline4

repeat0

shareShare

Delip Rao e/σ

@deliprao

4 months ago

There will be no other terminal UX agent better than this. It’s like Michelangelo sculpting your garden sculpture. Go kiss the toad!

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Delip Rao e/σ

@deliprao

4 months ago

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Delip Rao e/σ

@deliprao

4 months ago

models are learning from mistakes and, increasingly, from doing, while human learning is increasingly "by theory" (consuming model outputs). something to ponder.

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Delip Rao e/σ

@deliprao

4 months ago

An important caveat is they provide problem-specific hints like “use combinatorics for this problem”.

thumb_up_off_alt5

chat_bubble_outline2

repeat1

shareShare

Joseph Carlson

@joecarlsonshow

4 months ago

I present, Google's dying search business:

thumb_up_off_alt5,5K

chat_bubble_outline187

repeat236

shareShare

Delip Rao e/σ

@deliprao

4 months ago

Engineering faculty will not admit, but this is more or less true in computer science programs at most schools. Most courses today are creating ‘busy work’ and evaluating students on that, in exchange for reputation signals. Academics will not acknowledge this as it would require

thumb_up_off_alt68

chat_bubble_outline2

repeat3

shareShare

Delip Rao e/σ

@deliprao

4 months ago

Two wrongs don't make a right, and the US (the most powerful military in the world) losing its moral grounds is worse for all of humanity.

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Georgi Gerganov

@ggerganov

4 months ago

AMD teams contributing to the llama.cpp codebase. Great support from the community with the review process. Exciting to see this open-source collaboration!

thumb_up_off_alt470

chat_bubble_outline8

repeat38

shareShare