
James Y. Huang
@jamesyhuang36
CS Ph.D. student @USC | Intern @MSFTResearch | Prev @AdobeResearch @AmazonScience @UCLA @UofT
ID: 1733497235294519296
https://jyhuang36.github.io/ 09-12-2023 14:43:06
13 Tweet
55 Followers
39 Following

My awesome student James Y. Huang just received an outstanding paper award at #EMNLP2023! He is looking for summer research intern. Please interview him.




Great news to wrap up this school year with my very first first-author paper: CoIN got accepted at #ACL2024 Findings! (github.com/luka-group/CoIN) ๐Many thanks to my collaborators Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan and advisor ๐ดMuhao Chen๐ด

Can Text-to-Image models understand common sense? ๐ค Can they generate images that fit everyday common sense? ๐ค tldr; NO, they are far less intelligent than us ๐๐ปโโ๏ธ Introducing Commonsense-T2I ๐ก zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure



๐ ๐๐ฎ๐ฅ๐ญ๐ข๐ฆ๐จ๐๐๐ฅ ๐๐๐๐ ๐ DPO over-prioritizes language-only preference ๐ Introducing mDPO: optimizes image-conditioned preference ๐ Best 3B MLLM with reduced hallucination, beats LLaVA 7/13B with DPO Collaboration with Microsoft Research huggingface.co/papers/2406.11โฆ


๐จ DPO works well on LLMs, but what happens when we apply it to VLMs/MLLMs? ๐ Excited to introduce our #EMNLP2024 work on DPO for Multimodal LLMs! We identify the unconditional preference issues (where the model ๐จ๐ฏ๐๐ซ๐ฅ๐จ๐จ๐ค๐ฌ ๐ฏ๐ข๐ฌ๐ฎ๐๐ฅ ๐๐จ๐ง๐ญ๐๐ฑ๐ญ), leading to severe

๐ Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (lmarena.ai). Additionally, MetaScale

