
Tristan Thrush
@tristanthrush
PhD-ing @StanfordAILab @stanfordnlp. Interested in data, multimodality, scaling, and many more things.
ID: 1388198782924255232
http://www.tristanthrush.com 30-04-2021 18:29:34
570 Tweet
3,3K Followers
892 Following

.Stanford NLP Group will be in @Singapore with lots of ICLR 2026 papers. Tristan Thrush, Christopher Potts & Tatsunori Hashimoto will show how to select good pretraining data: LLM losses on texts correlate with downstream benchmarks, so select high-correlation docs. arxiv.org/abs/2409.05816






1/ Model architectures have been mostly treated as fixed post-training. 🌱 Introducing Grafting: A new way to edit pretrained diffusion transformers, allowing us to customize architectural designs on a small compute budget. 🌎 grafting.stanford.edu Co-led with Michael Poli
