
Stephanie Fu
@xkungfu
PhD student @berkeley_ai | studying computer vision and intelligence | Previous: CS + Music @MIT
ID: 4870958294
https://stephanie-fu.github.io/ 07-02-2016 00:06:00
330 Tweet
576 Followers
228 Following


Happy to share our new work on Navigation World Models! ๐ฅ๐ฅ Navigation is a fundamental skill of agents with visual-motor capabilities. We train a single World Model across multiple environments and diverse agent data. w/ Gaoyue Zhou, Danny Tran, trevordarrell and Yann LeCun.



I'll be presenting "Images that Sound" today at #NeurIPS2024! East Exhibit Hall A-C #2710. Come say hi to me and Andrew Owens :) (Ziyang Chen sadly could not make it, but will be there in spirit :') )

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Canโt believe this happened at the best AI conference NeurIPS Conference We have ethical reviews for authors, but missed it for invited speakers? ๐ก







Thrilled to share the first-ever search leaderboard with lmarena.ai! It's so fun to see how models behave differently โ OpenAI loves news (but not YT), Perplexity favors YouTube, and Gemini (Google DeepMind) leans on blogs/forums. More insights: blog.lmarena.ai/blog/2025/searโฆ



๐ Call for Papers! ๐ Excited to help organize the 4th Workshop on What is Next in Multimodal Foundation Models? at ICCV in Honolulu, Hawai'i ๐บ Submit work on vision, language, audio & more! ๐๏ธ Deadline: July 1, 2025 ๐ sites.google.com/view/mmfm4thwoโฆ #MMFM4 #ICCV2025 #AI #multimodal

Artifacts in your attention maps? Forgot to train with registers? Use ๐ฉ๐๐จ๐ฉ-๐ฉ๐๐ข๐ ๐ง๐๐๐๐จ๐ฉ๐๐ง๐จ! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") โ even outside the image into an untrained token. Clean maps, no retrain.


