
Dan Fu
@realdanfu
Incoming assistant professor at UCSD CSE in MLSys. Currently recruiting students! Also running the kernels team @togethercompute.
ID: 1173687463790829568
http://danfu.org 16-09-2019 19:58:03
710 Tweet
5,5K Followers
205 Following




Great work from Infini-AI-Lab! Congrats Beidi Chen!




1/ Model architectures have been mostly treated as fixed post-training. 🌱 Introducing Grafting: A new way to edit pretrained diffusion transformers, allowing us to customize architectural designs on a small compute budget. 🌎 grafting.stanford.edu Co-led with Michael Poli

And to close out a trio of diffusion papers… Super excited to announce Grafting - a method for distilling pretrained diffusion transformers into *new architectures*, led by Keshigeyan Chandrasegaran! Swap attention for new primitives for 2% pretraining cost, exciting for modeling research!

Scale alone is not enough for AI data. Quality and complexity are equally critical. Excited to support all of these for LLM developers with Snorkel AI Data-as-a-Service, and to share our new leaderboard! — Our decade-plus of research and work in AI data has a simple point:

Never a better time to work with Snorkel AI :)



What a throwback to weak supervision! Great work Jon Saad-Falcon Kelly Buchanan Mayee Chen!


Day zero support for Flux kontext dev on Chipmunk! Great work Austin Silveria!

