
Armen Aghajanyan
@armenagha
Co-founder & CEO @perceptroninc; ex-RS FAIR/MSFT
ID: 1515424688
14-06-2013 05:43:07
648 Tweet
15,15K Followers
271 Following







Yann LeCun Raw Daron Acemoglu you were/are the Chief Scientist of Meta, and a FAIR Lead -- where both Zetta and Llama were located; I think characterizing any team within your direct influence in a bad light in public is not nice. yea the Llama folks were great. praise them. What if Zetta was allowed to run


Surely they added Susan Zhang as an author to LLaMa right?


Susan Zhang Armen Aghajanyan Susan Zhang and Stephen Roller should have been authors. There was a lot of discussion about it and imo, not including them was a mistake made by a people under a lot of pressure and frustration.

fun debugging journey w/Akshat Shrivastava: be careful around FP8 w. activation checkpointing activation checkpointing works under the assumptions that different calls of forward give similar results which we move away from the more we quantize. when you re-quantize in activation



Maksymilian Wojnar and I have been playing around with tensor alignments in neural networks. here’s a summary of our exploration. we go into neural net parameterizations, measuring tensor alignments, and we develop a dynamic maximal learning rate scheduler which factors in alignment




