
Sonia
@soniajoseph_
AI researcher @AIatMeta. getting ML PhD @Mila_Quebec. prev @Princeton.
ID: 3409885648
09-08-2015 06:09:07
1,1K Tweet
14,14K Followers
1,1K Following


Diffusion Steering Lens is a much semantically richer logit lens for vision models (but I'm curious to see it applied to any type of model). You can decode transformer submodules with rich visuals. Looking forward to seeing Ryota's poster today at Mechanistic Interpretability for Vision @ CVPR2025 at CVPR!



Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details 👇🔬





It was fun collaborating on this short paper by Constantin Venhoff, in collaboration with Ashkan Khakzar philip and Neel Nanda, on modality alignment in VLMs. I especially liked using frozen SAEs as an analytic probe to measure cross-modal alignment.


The vision mechanistic interpretability workshop Mechanistic Interpretability for Vision @ CVPR2025 earlier this month at CVPR was very informative and fun! Looking forward to seeing this community grow. Thank you to the speakers and organizers trevordarrell David Bau Tamar Rott Shaham Yossi Gandelsman Joanna

