Zirui "Colin" Wang
@zwcolin
Incoming CS PhD @Berkeley_EECS; MSCS @princeton_nlp; '25 @siebelscholars; prev @HDSIUCSD; I work on multimodal foundation models; He/Him.
ID: 2986434572
http://ziruiw.net 17-01-2015 04:18:40
122 Tweet
1,1K Followers
528 Following
๐ I'm recruiting multiple fully funded MSc/PhD students University of Alberta for Fall 2025! Join my lab working on NLP, especially reasoning and interpretability (see my website for more details about my research). Apply by December 15!
Iโve just arrived in Vancouver and am excited to join the final stretch of #NeurIPS2024! This morning, we are presenting 3 papers 11am-2pm: - Edge pruning for finding Transformer circuits (#3111, spotlight) Adithya Bhaskar - SimPO (#3410) Yu Meng @ ICLR'25 Mengzhou Xia - CharXiv (#5303)
While DeepSeek R1 has been flexing ๐ช๐ป, how are VLMs progressing in ๐ซ๐๐๐ฌ๐จ๐ง๐ข๐ง๐ ? โ ๏ธ Major Shift: the latest ๐จ๐ฉ๐๐ง-๐ฐ๐๐ข๐ ๐ก๐ญ Qwen2.5-VL has beaten the first GPT-4o and is now on par with the latest ChatGPT-4o! ๐ฒ But what about o1-like models? Can they enhance
It seems that models can figure out the correct rules with RL. I created a synthetic game to run GRPO on VLMs over the weekend and I didn't realize I wrote down the wrong rule for the instruction ๐คฆ๐ปโโ๏ธ. With ~200 steps the model learns the corner cases where the wrong rule can