Pradeep Shenoy (@doktorshenoy) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Prateek Jain

@jainprateek_

2 years ago

Amazing work by Aniket and MLO team at GRI. Aniket will present 2 really nice results at COLT and he presented one very strong paper at Neurips last year....1/2

thumb_up_off_alt49

chat_bubble_outline2

repeat1

shareShare

In the age of large-scale models, how does interpretability scale with data and model size? We investigate the effect of scale on mechanistic interpretability by collecting >120’000 human responses and find no improvement! 1/6 🌐brendel-group.github.io/imi 📃arxiv.org/abs/2307.05471

thumb_up_off_alt83

chat_bubble_outline1

repeat27

shareShare

Shek Azizi

@azizishekoofeh

2 years ago

Thrilled to announce Med-PaLM, our medical large language model is published in nature, today. Extremely excited for the possibilities this unlocks! rdcu.be/dgGfJ

Thrilled to announce Med-PaLM, our medical large language model is published in <a href="/Nature/">nature</a>, today.

Extremely excited for the possibilities this unlocks!

rdcu.be/dgGfJ

thumb_up_off_alt858

chat_bubble_outline25

repeat153

shareShare

hardmaru

@hardmaru

2 years ago

Excellent article by Sander Dieleman about diffusion models! My favorite part is about the link to RNNs: “Diffusion models present a way to train deep RNNs without backpropagating through the recurrence at all, yielding a much more scalable training procedure.” sander.ai/2023/07/20/per…

thumb_up_off_alt338

chat_bubble_outline3

repeat58

shareShare

Pratyush Maini

@pratyushmaini

2 years ago

1/Can Neural Network Memorization be Localized?🔍Learn about "Example-tied Dropout" that directs memorization to fixed neurons that can be thrown away at test time! #323@10:30 today #ICML w/Mike Mozer Hanie Sedghi @ZacharyLipton @ZicoKolter Chiyuan Zhang 🌐tinyurl.com/mem-drop

thumb_up_off_alt135

chat_bubble_outline3

repeat29

shareShare

Pradeep Shenoy

@doktorshenoy

2 years ago

Happy to share our recent #ICML2023 work -- we hope & believe this opens up interesting new directions in supervised learning. Please share!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Peyman Milanfar

@docmilanfar

2 years ago

when your first paper gets demolished but you keep submitting

thumb_up_off_alt473

chat_bubble_outline5

repeat38

shareShare

Gamaleldin Elsayed

@gamaleldinfe

2 years ago

Nature Comms paper: Subtle adversarial image manipulations influence both human and machine perception! We show that adversarial attacks against computer vision models also transfer (weakly) to humans, even when the attack magnitude is small. nature.com/articles/s4146…

thumb_up_off_alt389

chat_bubble_outline12

repeat89

shareShare

Prateek Jain

@jainprateek_

2 years ago

Exciting work by Ramnath Kumar, Dheeraj Nagaraj, Arun Suggala. Just two lines of change leads to significantly more robust learning, and non-trivial gains in a variety of domains including for GLUE benchmarks, Imagenet, OOD datasets, Tabular datasets etc . [1 out of 2]

thumb_up_off_alt113

chat_bubble_outline5

repeat12

shareShare

Prateek Jain

@jainprateek_

2 years ago

Super excited about Matformers! It provides ability to train one large model and read-off multiple smaller models without additional training! Slides from our talk at ICCV Workshop on Resource Constrained Deep Vision: prateekjain.org/publications/s… prateekjain.org/publications/s…

thumb_up_off_alt92

chat_bubble_outline1

repeat15

shareShare

Neil Renic

@nc_renic

2 years ago

Handing over the “big improvements” to your co-author

thumb_up_off_alt561

chat_bubble_outline3

repeat41

shareShare

François Chollet

@fchollet

2 years ago

Legend has it that Americans have 50 words for "breakfast cereal", including one to refer to the transient state of corn flakes where they've soaked up enough milk that they're not too dry, but not yet too soggy, just right

thumb_up_off_alt122

chat_bubble_outline11

repeat7

shareShare

IIT Madras

@iitmadras

a year ago

“The establishment of the School of AI at #IITM will help India become a global leader in AI" says Mr. Sunil Wadhwani, an IIT Madras alumni who donated Rs. 110 Cr. to establish a School of Data Science & AI. Watch NDTV youtube.com/watch?v=10Uk9w… J Sam Daniel Stalin Balaraman Ravindran WISH Foundation India

thumb_up_off_alt45

chat_bubble_outline1

repeat11

shareShare

Pradeep Shenoy

@doktorshenoy

a year ago

Check out our blog post on early readouts as a means of identifying and mitigating spurious features in neural networks!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Jascha Sohl-Dickstein

@jaschasd

a year ago

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

thumb_up_off_alt9,9K

chat_bubble_outline275

repeat1,1K

shareShare

Google AI

@googleai

a year ago

Lumiere is a space-time diffusion research model that generates video from various inputs, including image-to-video. The model generates videos that start with the desired first frame & exhibit intricate coherent motion across the entire video duration → goo.gle/47WX6C2

thumb_up_off_alt597

chat_bubble_outline47

repeat157

shareShare

Prateek Jain

@jainprateek_

a year ago

Excited to share Tandem Transformers: a simple but effective technique to significantly reduce latency of generative LLMs (in some cases ~2.74x). arxiv.org/abs/2402.08644 [1/n]

thumb_up_off_alt171

chat_bubble_outline1

repeat23

shareShare

Pradeep Shenoy

@doktorshenoy

a year ago

Check out our blog post on learning under concept drift! We learn reweightings of training data to model information decay and maximize future performance.

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

SP Arun

@sparuniisc

a year ago

Come attend the awesome Bangalore Cognition Workshop at IISc from June 15-21 2024! Apply at forms.gle/kfA1obX8CkPgCP… Deadline: Feb 29 2024 Please circulate widely!

thumb_up_off_alt125

chat_bubble_outline5

repeat58

shareShare

Divy Thakkar

@divy93t

a year ago

Part of my top secret mission to excite more people to work on enabling Human-Human collaboration through AI !

thumb_up_off_alt67

chat_bubble_outline0

repeat2

shareShare

Pradeep Shenoy

Gate.io

Prateek Jain

Wieland Brendel

Shek Azizi

hardmaru

Pratyush Maini

Pradeep Shenoy

Peyman Milanfar

Gamaleldin Elsayed

Prateek Jain

Prateek Jain

Neil Renic

François Chollet

IIT Madras

Pradeep Shenoy

Jascha Sohl-Dickstein

Google AI

Prateek Jain

Pradeep Shenoy

SP Arun

Divy Thakkar