Kshitij Gupta (@kshitijkgupta) Twitter Tweets • TwiCopy

Kshitij Gupta

@kshitijkgupta

+ Follow

Passionate about AGI | Interested in Scaling Laws, Multimodal Foundation Models, Memory & Reasoning! @Mila_Quebec | prev @DeepMind, @Microsoft

ID: 1524900686042996736

calendar_today12-05-2022 23:53:53

19 Tweet

525 Followers

203 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

We are thrilled to release the list of invited speakers at CoLLAs 2025 2022: Yoshua Bengio, Rich Caruana, Claudia Clopath, Abhinav Gupta, Hugo Larochelle, Hanie Sedghi, Tinne Tuytelaars. Our registrations are also now open: lifelong-ml.cc/registration

We are thrilled to release the list of invited speakers at <a href="/CoLLAs_Conf/">CoLLAs 2025</a> 2022: Yoshua Bengio, Rich Caruana, Claudia Clopath, Abhinav Gupta, <a href="/hugo_larochelle/">Hugo Larochelle</a>, <a href="/HanieSedghi/">Hanie Sedghi</a>, Tinne Tuytelaars. Our registrations are also now open: lifelong-ml.cc/registration

thumb_up_off_alt51

chat_bubble_outline0

repeat13

shareShare

Kshitij Gupta

@kshitijkgupta

3 years ago

Excited to be here! Quick intro: Student at Mila - Institut québécois d'IA, advised by Sarath Chandar and Irina Rish! Passionate about building AI agents! Currently working in Sequential Decision Making, Scaling Laws, Reasoning, Memory, and Planning! Love exploring and learning new things!

thumb_up_off_alt46

chat_bubble_outline0

repeat2

shareShare

Kshitij Gupta

@kshitijkgupta

3 years ago

This is super exciting work by Google AI! Chain of thought prompting and step-by-step reasoning can help LLMs break down complex multi-step problems and iteratively reuse their knowledge to solve each sub-problem! Solving problems beyond what was seen during pretraining stage!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Kshitij Gupta

@kshitijkgupta

3 years ago

What I find most exciting about this work: -Efficiently captures long contexts without an O(T^2) complexity. -Encourages capturing only relevant information from the past. -Top-Down information introduces Feedback and Recurrence into Transformers helping model sequences better!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

David Krueger

@davidskrueger

3 years ago

A new paper from my student Ethan Caballero is busy, Kshitij Gupta, Irina Rish and your's truly! I'm really impressed with the empirical results. The TL;DR is that we replace "linear on a log-log plot" with "piecewise linear on a log-log plot".

A new paper from my student <a href="/ethanCaballero/">Ethan Caballero is busy</a>, <a href="/kshitijkgupta/">Kshitij Gupta</a>, <a href="/irinarish/">Irina Rish</a> and your's truly!

I'm really impressed with the empirical results.

The TL;DR is that we replace "linear on a log-log plot" with "piecewise linear on a log-log plot".

thumb_up_off_alt42

chat_bubble_outline4

repeat4

shareShare

Kshitij Gupta

@kshitijkgupta

3 years ago

Very excited to share Broken Neural Scaling Laws! We decompose scaling trends and model them with smoothly broken power laws. This gives SotA extrapolation results on a wide set of tasks! Work done with amazing collaborators - Ethan Caballero is busy, Irina Rish, and David Krueger

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Kshitij Gupta

@kshitijkgupta

3 years ago

Happy new year everyone! 2022 was a wild ride, but I can’t wait to see what 2023 has in store as we work toward AGI. Looking forward to building multi-modal agents that can interact with external worlds and tools, and reason and solve new tasks! Here’s to an even wilder new year!

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

Kshitij Gupta

@kshitijkgupta

2 years ago

Excited to share a sneak peek of what I have been exploring lately: How LLMs can use external tools and memory to iteratively design, implement, and debug code. Even more exciting results, features, and analyses coming out soon! kshitijkg.github.io/blog/jekyll/up… #LLMs #ChatGPT #code

thumb_up_off_alt212

chat_bubble_outline3

repeat27

shareShare

Ethan Caballero is busy

@ethancaballero

2 years ago

New version of Broken Neural Scaling Laws (BNSL) is out with accurate extrapolation results for the scaling behaviors listed in this attached picture: arxiv.org/abs/2210.14891 arxiv.org/pdf/2210.14891… Plots of all extrapolations are in this 🧵. Any other extrapolations you want?

thumb_up_off_alt114

chat_bubble_outline6

repeat24

shareShare

Ethan Caballero is busy

@ethancaballero

2 years ago

Want to Superforecast AGI? Stop by "Broken Neural Scaling Laws" (arxiv.org/abs/2210.14891) poster at ICLR Conference poster session at MH1-2-3-4 #27 at 11:30AM - 1:30PM on Monday (iclr.cc/virtual/2023/p…) & at ICLR Me-FoMo Workshop poster session at AD10 at 1PM - 2PM on Thursday

thumb_up_off_alt59

chat_bubble_outline1

repeat12

shareShare