
Soham De
@sohamde_
Research Scientist at DeepMind. Previously PhD at the University of Maryland.
ID: 306050621
https://sohamde.github.io/ 27-05-2011 05:59:44
187 Tweet
2,2K Followers
1,1K Following


π₯ Introducing our 9B language model, trained on 2 trillion tokens! π Based on Griffin (arxiv.org/abs/2402.19427) and delivers: πͺ Powerful performance β‘οΈ Lightning-fast inference Pretrained and instruction-tuned models now available on HF & Kaggle! Start building today! ποΈ

Our tutorial on diffusion & flows is out! We made every effort to simplify the math, while still being correct. Hope you enjoy! (Link below -- it's long but is split into 5 mostly-self-contained chapters). lots of fun working with Arwen Bradley Hattie Zhou Madhu Advani on this




It was fun to moderate this discussion with a great group of panelists. Lots of interesting points made on how to approach the next gen of seq modelling architectures. Thanks for the invite Caglar Gulcehre Antonio Orvieto Razvan and others!







Excited to share that our paper "Bridging the humanβAI knowledge gap through concept discovery and transfer in AlphaZero" is now out in PNAS! With Nenad Tomasev, Tom McGrath, Demis Hassabis, Ulrich Paquet, Been Kim π π doi.org/10.1073/pnas.2β¦



We have a new SSM theory paper, just accepted to COLT, revisiting recall properties of linear RNNs. It's surprising how much one can delve into, and how beautiful it can become. With (and only thanks to) the amazing Alexandre and Francis Bach arxiv.org/pdf/2502.09287

