
Nishanth Dikkala
@nishanthdikkala
Senior Research Scientist @ Google Research, Ph.D. Computer Science, MIT.
ID: 1583679714
http://people.csail.mit.edu/nishanthd/ 10-07-2013 18:13:59
127 Tweet
366 Followers
269 Following


Check out the blog post on our work on scaling up embedding dimension in Transformer models efficiently! (NeurIPS 2023 Spotlight paper) (Joint work with Cenk Baykal, Dylan Cutler, Nikhil Ghosh, Rina Panigrahy and Xin Wang)






Check out our new work showing that causal language modeling alone is sufficient for a Transformer model to learn to solve Sudokus and other constraint satisfaction problems like Zebra puzzles! Lead by Kulin Shah, to appear at NeurIPS Conference 2024!


[1] ReMI: A Dataset for Reasoning with Multiple Images Work done with Nishanth Dikkala, Ankit Anand, Fangyu Liu, Bahare Fatemi and several other great colleagues. Fri 13 Dec, 11 am - 2 pm PST, West Ballroom A-D #5104 Details in sub-tweet: x.com/kazemi_sm/stat…








