Shrutimoy Das (@shrutimoy) Twitter Tweets • TwiCopy

Tanishq Mathew Abraham, Ph.D.

3 years ago

Are you wondering how large language models like ChatGPT and InstructGPT actually work? One of the secret ingredients is RLHF - Reinforcement Learning from Human Feedback. Let's dive into how RLHF works in 8 tweets!

thumb_up_off_alt2,2K

chat_bubble_outline40

repeat524

shareShare

Himanshu Beniwal

@himanshubeniwai

3 years ago

Hello 👋🏻 #ResearchWeekWithGoogle 2023! :D Prateek Jain Partha Talukdar Divy Thakkar

Hello 👋🏻 #ResearchWeekWithGoogle 2023! :D

<a href="/jainprateek_/">Prateek Jain</a> <a href="/partha_p_t/">Partha Talukdar</a> <a href="/divy93t/">Divy Thakkar</a>

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Himanshu Beniwal

@himanshubeniwai

3 years ago

An amazing 🤩 panel on Generative Models: Past, present and future; with Partha Talukdar, Prateek Jain, Yonatan Belinkov ✈️ COLM2025, Bhuvana Ramabhadran, and Sneha Mondal! ✨ #ResearchWeekWithGoogle 2023! #GoogleResearchIndia

An amazing 🤩 panel on Generative Models: Past, present and future; with <a href="/partha_p_t/">Partha Talukdar</a>, <a href="/jainprateek_/">Prateek Jain</a>, <a href="/boknilev/">Yonatan Belinkov ✈️ COLM2025</a>, Bhuvana Ramabhadran, and <a href="/SnehaMon/">Sneha Mondal</a>! ✨

#ResearchWeekWithGoogle 2023!
#GoogleResearchIndia

thumb_up_off_alt37

chat_bubble_outline0

repeat1

shareShare

Divy Thakkar

@divy93t

3 years ago

Research Week with Google - officially a wrap! Extremely energising to be with students and see their research curiosity! Till next time! Special thanks to our amazing speakers, ACs, organisers and Program Chairs !

thumb_up_off_alt190

chat_bubble_outline7

repeat28

shareShare

Ben Grimmer

@prof_grimmer

2 years ago

I've proven the strangest result of my career.. The classic idea that gradient descent's rate is best with constant stepsizes 1/L is wrong. The idea that we need stepsizes in (0,2/L) for convergence is wrong. Periodic long steps are better, provably. arxiv.org/abs/2307.06324

thumb_up_off_alt3,3K

chat_bubble_outline71

repeat573

shareShare

Petar Veličković

@petarv_93

2 years ago

Scientific discovery in the Age of AI 🧪🤖🧑‍🔬✨ ...now published in nature! It's been fantastic writing this survey-spinoff of the AI for Science workshops with these amazing coauthors! Thanks Marinka Zitnik for always keeping our spirits high! 😊 nature.com/articles/s4158…

Scientific discovery in the Age of AI 🧪🤖🧑‍🔬✨
...now published in <a href="/Nature/">nature</a>!

It's been fantastic writing this survey-spinoff of the <a href="/AI_for_Science/">AI for Science</a> workshops with these amazing coauthors! Thanks <a href="/marinkazitnik/">Marinka Zitnik</a> for always keeping our spirits high! 😊

nature.com/articles/s4158…

thumb_up_off_alt228

chat_bubble_outline2

repeat40

shareShare

Shubhendu Trivedi

@_onionesque

2 years ago

This probably keeps getting shared here all the time, but it's worth resharing: An excellent set of lectures on high dimensional probability and concentration inequalities by Roman Vershynin. These complement his great book well. math.uci.edu/~rvershyn/teac…

thumb_up_off_alt120

chat_bubble_outline1

repeat24

shareShare

Dan Roy

@roydanroy

2 years ago

This just crossed my desk and seems quite interesting arxiv.org/abs/2308.16898

thumb_up_off_alt230

chat_bubble_outline5

repeat24

shareShare

elvis

@omarsar0

2 years ago

LLMs as Optimizers This is a really neat idea. This new paper from Google DeepMind proposes an approach where the optimization problem is described in natural language. An LLM is then instructed to iteratively generate new solutions based on the defined problem and previously

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat391

shareShare

AI Coffee Break with Letitia

@aicoffeebreak

2 years ago

How does LoRA work? Low-Rank Adaptation for Parameter-Efficient LLM Finetuning explained. 👇 📺 youtu.be/KEv-F5UkhxU Great work by Edward Hu , Yelong Shen and collaborators! 👏 arxiv.org/abs/2106.09685

How does LoRA work?
Low-Rank Adaptation for Parameter-Efficient LLM Finetuning explained. 👇

📺 youtu.be/KEv-F5UkhxU

Great work by <a href="/edwardjhu/">Edward Hu</a> , Yelong Shen and collaborators! 👏 arxiv.org/abs/2106.09685

thumb_up_off_alt200

chat_bubble_outline2

repeat46

shareShare

Ben Grimmer

@prof_grimmer

2 years ago

The new strangest results of my career (with Kevin Shu and Alex Wang). Gradient descent can accelerate (in big-O!) by just periodically taking longer steps. No momentum needed to beat O(1/T) in smooth convex opt! Paper: arxiv.org/abs/2309.09961 [1/3]

thumb_up_off_alt677

chat_bubble_outline11

repeat90

shareShare

Simons Institute for the Theory of Computing

@simonsinstitute

2 years ago

Join us for a workshop on Sketching and Algorithm Design, next week at the Simons Institute. simons.berkeley.edu/workshops/sket…

thumb_up_off_alt63

chat_bubble_outline0

repeat12

shareShare

Petar Veličković

@petarv_93

2 years ago

⚽🌐🕸️🤖 arXiv:2310.10553 arxiv.org/abs/2310.10553

thumb_up_off_alt194

chat_bubble_outline4

repeat26

shareShare

MIT CSAIL

@mit_csail

2 years ago

“The best learners are the people who push through the discomfort of being objectively bad at something.” — Tommy Collison

thumb_up_off_alt478

chat_bubble_outline4

repeat89

shareShare

Shrutimoy Das

@shrutimoy

2 years ago

Such a unique event. Infosys has been such a wonderful host. #pingalainteractions2024

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Shubhajit Roy

@royshubhajit

2 years ago

Motivation to work on #connectivity by the “Ethernet Man” @BobMetcalfe12. Also awesome talk by Prof. Saket Saurabh, Faculty #IMSc #Chennai on Bad algorithms. Again, thanks to ACM India,Association for Computing Machinery and Infosys

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Prof. Anima Anandkumar

@animaanandkumar

2 years ago

For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training. Training LLMs from scratch currently requires huge

thumb_up_off_alt2,2K

chat_bubble_outline47

repeat372

shareShare

Quanta Magazine

@quantamagazine

2 years ago

In two recent papers, researchers have improved upon the best-known speed for matrix multiplication. Steve Nadis reports: quantamagazine.org/new-breakthrou…

thumb_up_off_alt199

chat_bubble_outline2

repeat53

shareShare

Fermat's Library

@fermatslibrary

a year ago

Computer scientist Edsger Dijkstra on the frustration of debugging a program you wrote yourself

thumb_up_off_alt2,2K

chat_bubble_outline35

repeat359

shareShare

Shubhajit Roy

@royshubhajit

2 months ago

🚀 Excited to share our paper got accepted at DiffCoAlg@NeurIPS 2025 Differentiable Learning of Comb. Algorithms !🎉 🙏 Thanks to Shrutimoy Das, Binita Maity, Anant Kumar, Anirban Dasgupta & CSE@IITGN . #NeurIPS2025 #GNN #GraphLearning #AIResearch

🚀 Excited to share our paper got accepted at DiffCoAlg@NeurIPS 2025 <a href="/diffcoalg/">Differentiable Learning of Comb. Algorithms</a> !🎉

🙏 Thanks to <a href="/shrutimoy/">Shrutimoy Das</a>, Binita Maity, Anant Kumar, <a href="/adagu/">Anirban Dasgupta</a> & <a href="/cse_iitgn/">CSE@IITGN</a> .
#NeurIPS2025 #GNN #GraphLearning #AIResearch

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare