Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile
Nishanth Dikkala

@nishanthdikkala

Senior Research Scientist @ Google Research, Ph.D. Computer Science, MIT.

ID: 1583679714

linkhttp://people.csail.mit.edu/nishanthd/ calendar_today10-07-2013 18:13:59

127 Tweet

366 Followers

269 Following

Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile Photo

Check out the blog post on our work on scaling up embedding dimension in Transformer models efficiently! (NeurIPS 2023 Spotlight paper) (Joint work with Cenk Baykal, Dylan Cutler, Nikhil Ghosh, Rina Panigrahy and Xin Wang)

Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile Photo

Check out our new multi-image reasoning benchmark where a model needs to reason using information spread across text and multiple images! (An interesting insight we discover: Even the mightiest models struggle to tell time!)

@levelsio (@levelsio) 's Twitter Profile Photo

✨ I made a new site called 🧳 💨 LuggageLosers.com It's a live ranking of airlines by how much luggage they are losing right now So you can avoid flying with them (and hopefully they can improve) Airlines losing most luggage rn: 🇮🇳 Air India 🇮🇪 Aer Lingus 🇬🇧 British

✨ I made a new site called

🧳 💨 LuggageLosers.com

It's a live ranking of airlines by how much luggage they are losing right now

So you can avoid flying with them (and hopefully they can improve)

Airlines losing most luggage rn:
🇮🇳 Air India
🇮🇪 Aer Lingus
🇬🇧 British
Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile Photo

Playing around with giving o1-preview a slight modification of a question from AIME 2024. Almost gets it right. One marked improvement over previous models is the reasoning chain stays consistent throughout the solution. chatgpt.com/share/66e4cf93…

Playing around with giving o1-preview a slight modification of a question from AIME 2024. Almost gets it right. One marked improvement over previous models is the reasoning chain stays consistent throughout the solution.
chatgpt.com/share/66e4cf93…
Kiran Vodrahalli (kiranvodrahalli@mathstodon.xyz) (@kiranvodrahalli) 's Twitter Profile Photo

Happy to share Michelangelo (arxiv.org/abs/2409.12640), a long-context reasoning benchmark which measures performance beyond needle tasks up to arbitrary context lengths and remains challenging for frontier models. Stay tuned for more Michelangelo evals to come!

Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile Photo

Check out our new work showing that causal language modeling alone is sufficient for a Transformer model to learn to solve Sudokus and other constraint satisfaction problems like Zebra puzzles! Lead by Kulin Shah, to appear at NeurIPS Conference 2024!

Mehran Kazemi (@kazemi_sm) 's Twitter Profile Photo

[1] ReMI: A Dataset for Reasoning with Multiple Images Work done with Nishanth Dikkala, Ankit Anand, Fangyu Liu, Bahare Fatemi and several other great colleagues. Fri 13 Dec, 11 am - 2 pm PST, West Ballroom A-D #5104 Details in sub-tweet: x.com/kazemi_sm/stat…

Kai E. Iliev (@jdeposicion) 's Twitter Profile Photo

After apparently the entirety of the Indian population, I've come to also learn that: *Y'all are wild with names *Y'all have Lenin's *Y'all have a dude called Napoleon Einstein *At some point I'll literally come to India just to investigate whatever the hell is going on

James Bedford (@james_s_bedford) 's Twitter Profile Photo

Found out students are using this website to have an AI generated Lebron James summarise their study material. You cannot make this stuff up. 3blue1bron.com

Nishanth Dikkala (@nishanthdikkala) 's Twitter Profile Photo

Presenting this work @ ICLR tomorrow! Come talk to us about looped transformers and their inductive bias for reasoning tasks. Poster #272: Hall 3 + 2B. iclr.cc/virtual/2025/p…

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️ Here’s a snapshot of its innovations 🧵

We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️

Here’s a snapshot of its innovations 🧵