
Eshaan Nichani
@eshaannichani
PhD student @ Princeton University · Theoretical Machine Learning · Previously Math & CS @ MIT · he/him
ID: 1382509387755978752
http://eshaannichani.com 15-04-2021 01:41:51
54 Tweet
598 Followers
240 Following

Come hear about how transformers perform factual recall using associative memories, and how this emerges in phases during training! #ICLR2025 poster #602 at 3pm today. Lead by Eshaan Nichani Link: iclr.cc/virtual/2025/p… Paper: arxiv.org/abs/2412.06538




New work arxiv.org/abs/2506.05500 on learning multi-index models with Alex Damian and Joan Bruna. Multi-index are of the form y= g(Ux), where U=r by d maps from d dimension to r dimension and d>>r. g is an arbitrary function. Examples of multi-index models are any neural net