Dipam Chakraborty (@__dipam__) Twitter Tweets • TwiCopy

Dipam Chakraborty

@__dipam__

+ Follow

Machine Learning Engineer at @h2oai 🤖

Tweets about topics that interest me in Deep Learning/Reinforcement Learning

ID: 1199391111761915904

linkhttp://dipam.in calendar_today26-11-2019 18:15:06

1,1K Tweet

196 Followers

690 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Whenever you are contemplating participating in Kaggle competitions and you might have heard someone say it is too far-fetched from practical data science work, consider this example: In the recent Science LLM competition participants learned among many other things: - How to

thumb_up_off_alt675

chat_bubble_outline9

repeat111

shareShare

Dipam Chakraborty

@__dipam__

2 years ago

Yep, gradient checkpointing is probably the most underrated memory saving trick that just works. Definitely should be known well by all GPU poor folks.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Sergey Levine

@svlevine

2 years ago

Brand new CS285: Deep RL lecture on RL for sequence models and language models: youtu.be/egJgDbe5oaM?fe… Figured it was about time to add this lecture to the course🙂

thumb_up_off_alt620

chat_bubble_outline5

repeat104

shareShare

Dipam Chakraborty

@__dipam__

2 years ago

And keeps smashing them lol

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Deedy

@deedydas

2 years ago

Tech drama >> cricket world cup

thumb_up_off_alt211

chat_bubble_outline10

repeat9

shareShare

heiner

@heinrichkuttler

2 years ago

OK fine I'll tap the sign how.rl.works

thumb_up_off_alt56

chat_bubble_outline4

repeat5

shareShare

Haldun

@haltor

2 years ago

Paul Graham

thumb_up_off_alt831

chat_bubble_outline6

repeat92

shareShare

Dipam Chakraborty

@__dipam__

2 years ago

While I love LlamaIndex, but I'm bearish on architecture based RAG in the long term, this feels like brittle pre-ML over engineering that will soon be replaced by one architecture to rule them all.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Linas Beliūnas

@linasbeliunas

2 years ago

Sergey Brin is worth $105 billion yet he was a core contributor on the Gemini AI technical paper, coding basically every day. Legend.

thumb_up_off_alt5,5K

chat_bubble_outline145

repeat618

shareShare

Dipam Chakraborty

@__dipam__

2 years ago

Open source lives on

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Tri Dao

@tri_dao

2 years ago

A clean implementation of Mamba in pure PyTorch

thumb_up_off_alt395

chat_bubble_outline4

repeat40

shareShare

Andrej Karpathy

@karpathy

2 years ago

Idea: safeLinux. All the same programs you know and love but now upgraded with safety to stop bad actors right in their tracks. $ ls I'm sorry, I cannot list the files in this directory because one or more files may contain unsafe content. Can I help you with anything else? 😅

thumb_up_off_alt2,2K

chat_bubble_outline189

repeat140

shareShare

Lucas Beyer (bl16)

@giffmana

2 years ago

Next episode: RGB vs BGR, aka fml we used OpenCV somewhere 🤯

thumb_up_off_alt128

chat_bubble_outline6

repeat8

shareShare

John Carmack

@id_aa_carmack

a year ago

Every solar panel is like a giant pixel in a camera sensor, turning photons into a voltage. With billions of them across the world, it is interesting to imagine what they could collectively “see” of the sun and atmosphere, even though they have no focusing machinery. Conversely,

thumb_up_off_alt3,3K

chat_bubble_outline125

repeat167

shareShare

Dipam Chakraborty

@__dipam__

a year ago

Kagglers doing this since forever because we have no time to tune one thing at a time.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Dipam Chakraborty

@__dipam__

a year ago

I'm a 100% EV proponent, but a friend pointed out an interesting paradox about this price dynamic. Batteries get cheaper so the resale value of EV gets lower, whereas ICE cars are getting more expensive so they are also holding resale value. Weird market dynamics.

thumb_up_off_alt2

chat_bubble_outline3

repeat0

shareShare

Dipam Chakraborty

@__dipam__

a year ago

Haha so now prompting LLMs is becoming like a programming language.

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Yi Tay

@yitayml

a year ago

not true, especially for language. if you trained a large & deep MLP language model with no self-attention, no matter how much data you'll feed it you'll still be lacking behind a transformer (with much less data). will it get to the same point? i don't think so. your tokens

thumb_up_off_alt648

chat_bubble_outline29

repeat61

shareShare

Peyman Milanfar

@docmilanfar

a year ago

when you compare your results to the other paper's “under similar conditions”

thumb_up_off_alt425

chat_bubble_outline2

repeat48

shareShare

Andrej Karpathy

@karpathy

a year ago

LLM model size competition is intensifying… backwards! My bet is that we'll see models that "think" very well and reliably that are very very small. There is most likely a setting even of GPT-2 parameters for which most people will consider GPT-2 "smart". The reason current

thumb_up_off_alt7,7K

chat_bubble_outline196

repeat935

shareShare

Dipam Chakraborty

Gate.io

Philipp Singer

Dipam Chakraborty

Sergey Levine

Dipam Chakraborty

Deedy

heiner

Haldun

Dipam Chakraborty

Linas Beliūnas

Dipam Chakraborty

Tri Dao

Andrej Karpathy

Lucas Beyer (bl16)

John Carmack

Dipam Chakraborty

Dipam Chakraborty

Dipam Chakraborty

Yi Tay

Peyman Milanfar

Andrej Karpathy