
Kshitij Gupta
@kshitijkgupta
Passionate about AGI | Interested in Scaling Laws, Multimodal Foundation Models, Memory & Reasoning! @Mila_Quebec | prev @DeepMind, @Microsoft
ID: 1524900686042996736
12-05-2022 23:53:53
19 Tweet
525 Followers
203 Following

We are thrilled to release the list of invited speakers at CoLLAs 2025 2022: Yoshua Bengio, Rich Caruana, Claudia Clopath, Abhinav Gupta, Hugo Larochelle, Hanie Sedghi, Tinne Tuytelaars. Our registrations are also now open: lifelong-ml.cc/registration


Excited to be here! Quick intro: Student at Mila - Institut québécois d'IA, advised by Sarath Chandar and Irina Rish! Passionate about building AI agents! Currently working in Sequential Decision Making, Scaling Laws, Reasoning, Memory, and Planning! Love exploring and learning new things!



A new paper from my student Ethan Caballero is busy, Kshitij Gupta, Irina Rish and your's truly! I'm really impressed with the empirical results. The TL;DR is that we replace "linear on a log-log plot" with "piecewise linear on a log-log plot".


Very excited to share Broken Neural Scaling Laws! We decompose scaling trends and model them with smoothly broken power laws. This gives SotA extrapolation results on a wide set of tasks! Work done with amazing collaborators - Ethan Caballero is busy, Irina Rish, and David Krueger



Excited to share a sneak peek of what I have been exploring lately: How LLMs can use external tools and memory to iteratively design, implement, and debug code. Even more exciting results, features, and analyses coming out soon! kshitijkg.github.io/blog/jekyll/up… #LLMs #ChatGPT #code

