Delip Rao e/σ (@deliprao) 's Twitter Profile
Delip Rao e/σ

@deliprao

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

ID: 16984977

linkhttps://deliprao.com calendar_today26-10-2008 20:04:45

52,52K Tweet

57,57K Followers

5,5K Following

David Ondrej (@davidondrej1) 's Twitter Profile Photo

models like Kimi, DeepSeek and Qwen will cost the closed AI labs BILLIONS of dollars. that's why nobody is talking about them. despite these LLMs absolutely crushing all of the benchmarks. Claude 4 Opus is literally *100x* more expensive than Kimi K2 yet both models have

Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

A decade ago, Richard Socher gave the "don't be a hero" advice on choosing hyperparams for DL model training. Today's version of "don't be a hero" is picking the right LLM for the task at hand. If people you trust have said some model works for a task, use it.

Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

if you poke a (inverted) bowl-shaped jello, it will wiggle in a certain way to eventually go back to being bowl-shaped. if something behaves this way consistently, it is undoubtedly a bowl-shaped jello.

Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

Super dumb take from ICML. Adding a subversive prompt to deter robot reviews is morally no different than using a fake email address to deter spammers. If authors don’t resort to subversive measures, reviewers will have no incentive not to use LLMs for reviewing.

Super dumb take from ICML. Adding a subversive prompt to deter robot reviews is morally no different than using a fake email address to deter spammers. If authors don’t resort to subversive measures, reviewers will have no incentive not to use LLMs for reviewing.
Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

There will be no other terminal UX agent better than this. It’s like Michelangelo sculpting your garden sculpture. Go kiss the toad!

Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

models are learning from mistakes and, increasingly, from doing, while human learning is increasingly "by theory" (consuming model outputs). something to ponder.

Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

Engineering faculty will not admit, but this is more or less true in computer science programs at most schools. Most courses today are creating ‘busy work’ and evaluating students on that, in exchange for reputation signals. Academics will not acknowledge this as it would require

Engineering faculty will not admit, but this is more or less true in computer science programs at most schools. Most courses today are creating ‘busy work’ and evaluating students on that, in exchange for reputation signals. Academics will not acknowledge this as it would require
Delip Rao e/σ (@deliprao) 's Twitter Profile Photo

Two wrongs don't make a right, and the US (the most powerful military in the world) losing its moral grounds is worse for all of humanity.

Georgi Gerganov (@ggerganov) 's Twitter Profile Photo

AMD teams contributing to the llama.cpp codebase. Great support from the community with the review process. Exciting to see this open-source collaboration!