Vinam Arora (@vinam_arora) 's Twitter Profile
Vinam Arora

@vinam_arora

Curr: PhD in ML at Georgia Tech // Deep Learning, Math, Chip Design, and Music

ID: 826713221423718400

linkhttp://vinam.dev calendar_today01-02-2017 08:46:06

129 Tweet

156 Followers

244 Following

Google AI (@googleai) 's Twitter Profile Photo

Introducing SANPO, a multi-attribute video dataset for outdoor human egocentric scene understanding composed of both real-world and synthetic data, including depth maps and video panoptic masks with a wide variety of semantic class labels. Read more → goo.gle/3ZISInU

pi (@pratiksha_pai) 's Twitter Profile Photo

its so crazy that we learn to simulate the nature around us. we make the sand think, see, listen, speak, render. beautiful!

its so crazy that we learn to simulate the nature around us. we make the sand think, see, listen, speak, render. beautiful!
Ekdeep Singh Lubana (@ekdeepl) 's Twitter Profile Photo

So how exactly does fine-tuning change a model's pretraining capabilities? Find out with 73 pages of detailed investigation on well-defined procedural tasks! From instruction fine-tuning to jailbreaking, there's something for everyone! :D Collab with Samyak & Robert Kirk!

Vinam Arora (@vinam_arora) 's Twitter Profile Photo

Sometimes, during mid-project code optimization, you find yourself in a groove, spotting potential efficiency everywhere. It's easy to forget the main goal - a functioning project, not one that's fully optimized.

Jascha Sohl-Dickstein (@jaschasd) 's Twitter Profile Photo

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

Vinam Arora (@vinam_arora) 's Twitter Profile Photo

Check out our new preprint! GraphFM - A framework for learning from multiple graph datasets simultaneously. Led by the awesome Divyansha

jack morris (@jxmnop) 's Twitter Profile Photo

language modeling is not unsupervised learning. it is not (and please stop saying this) self-supervised learning. next-token prediction is textbook _supervised_ learning, and i will die on this hill