
Xuehai He
@xuehaih
Vision and language
ID: 1318704802973376514
http://sheehan1230.github.io 21-10-2020 00:05:04
92 Tweet
182 Followers
304 Following


๐๐๐๐ญ๐ข๐ง๐ ๐๐ฉ๐๐ง๐๐ ๐ข๐ฌ ๐ง๐จ๐ญ ๐๐ฌ ๐ก๐๐ซ๐ ๐๐ฌ ๐ฒ๐จ๐ฎ ๐ญ๐ก๐ข๐ง๐ค. If you don't believe you can compete, you've already lost. Winning starts with mindset. ๐Introducing ๐จ๐๐๐๐ ๐บ2, ๐ญ๐ก๐ ๐ฐ๐จ๐ซ๐ฅ๐'๐ฌ ๐๐๐ฌ๐ญ ๐๐จ๐ฆ๐ฉ๐ฎ๐ญ๐๐ซ-๐ฎ๐ฌ๐ ๐๐ ๐๐ง๐ญ, and the second




๐ Excited to announce our 4th Workshop on Computer Vision in the Wild (CVinW) at #CVPR2025 2025! ๐ computer-vision-in-the-wild.github.io/cvpr-2025/ โญWe have invinted a great lineup of speakers: Prof. Kaiming He, Prof. Boqing Gong, Prof. Cordelia Schmid, Prof. Ranjay Krishna, Prof. Saining Xie, Prof.





๐๐ถ๐ฎ๐ข๐ฏ๐ด ๐ต๐ฉ๐ช๐ฏ๐ฌ ๐ง๐ญ๐ถ๐ช๐ฅ๐ญ๐บโ๐ฏ๐ข๐ท๐ช๐จ๐ข๐ต๐ช๐ฏ๐จ ๐ข๐ฃ๐ด๐ต๐ณ๐ข๐ค๐ต ๐ค๐ฐ๐ฏ๐ค๐ฆ๐ฑ๐ต๐ด ๐ฆ๐ง๐ง๐ฐ๐ณ๐ต๐ญ๐ฆ๐ด๐ด๐ญ๐บ, ๐ง๐ณ๐ฆ๐ฆ ๐ง๐ณ๐ฐ๐ฎ ๐ณ๐ช๐จ๐ช๐ฅ ๐ญ๐ช๐ฏ๐จ๐ถ๐ช๐ด๐ต๐ช๐ค ๐ฃ๐ฐ๐ถ๐ฏ๐ฅ๐ข๐ณ๐ช๐ฆ๐ด. But current reasoning models remain constrained by discrete tokens, limiting their full




๐ New paper out! โHidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Modelsโ Real life is messy: ๐น โMake a cup of apple juiceโ - but no apples are in sight ๐น โSay hi to my friendโ - yet two people are in the frame ๐น โTell me the brand of the lipstickโ -

