James Y. Huang (@jamesyhuang36) Twitter Tweets • TwiCopy

James Y. Huang

@jamesyhuang36

+ Follow

CS Ph.D. student @USC | Intern @MSFTResearch | Prev @AdobeResearch @AmazonScience @UCLA @UofT

ID: 1733497235294519296

linkhttps://jyhuang36.github.io/ calendar_today09-12-2023 14:43:06

13 Tweet

55 Followers

39 Following

🌴Muhao Chen🌴

@muhao_chen

2 years ago

My awesome student James Y. Huang just received an outstanding paper award at #EMNLP2023! He is looking for summer research intern. Please interview him.

My awesome student <a href="/JamesYHuang36/">James Y. Huang</a> just received an outstanding paper award at #EMNLP2023! He is looking for summer research intern. Please interview him.

thumb_up_off_alt97

chat_bubble_outline4

repeat7

shareShare

DeAL Decoding-time Alignment for Large Language Models paper page: huggingface.co/papers/2402.06… Large Language Models (LLMs) are nowadays expected to generate content aligned with human preferences. Current work focuses on alignment at model training time, through techniques such

thumb_up_off_alt132

chat_bubble_outline2

repeat30

shareShare

Tianyi Lorena Yan

@lorenayannnnn

2 years ago

🤨 Frustrated when LLMs can give different or even wrong answers because you phrased your instruction differently? 🚀 Introducing Contrastive Instruction Tuning (CoIN) - our solution to make models less sensitive to the exact phrasing of instructions! (arxiv.org/abs/2402.11138)

thumb_up_off_alt37

chat_bubble_outline1

repeat9

shareShare

Tianyi Lorena Yan

@lorenayannnnn

a year ago

Great news to wrap up this school year with my very first first-author paper: CoIN got accepted at #ACL2024 Findings! (github.com/luka-group/CoIN) 🌟Many thanks to my collaborators Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan and advisor 🌴Muhao Chen🌴

thumb_up_off_alt46

chat_bubble_outline1

repeat8

shareShare

Xingyu Fu

@xingyufu2

a year ago

Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, they are far less intelligent than us 💁🏻‍♀️ Introducing Commonsense-T2I 💡 zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure

thumb_up_off_alt132

chat_bubble_outline7

repeat39

shareShare

Sheng Zhang

@sheng_zh

a year ago

With the fast-growing interest in multi-image understanding, where can we find a high-quality, comprehensive & robust benchmark? A group of talented students have created ᴍᴜɪʀʙᴇɴᴄʜ, a long-awaited multi-image benchmark, with 2.6k multi-choice Qs and over 10k images. (1/6)

thumb_up_off_alt51

chat_bubble_outline3

repeat27

shareShare

Fei Wang

@fwang_nlp

a year ago

🌟 𝐌𝐮𝐥𝐭𝐢𝐦𝐨𝐝𝐚𝐥 𝐃𝐏𝐎🌟 🔍 DPO over-prioritizes language-only preference 🚀 Introducing mDPO: optimizes image-conditioned preference 🏆 Best 3B MLLM with reduced hallucination, beats LLaVA 7/13B with DPO Collaboration with Microsoft Research huggingface.co/papers/2406.11…

thumb_up_off_alt89

chat_bubble_outline3

repeat38

shareShare

Fei Wang

@fwang_nlp

a year ago

Can GPT-4o and Gemini-Pro handle 𝐦𝐮𝐥𝐭𝐢𝐩𝐥𝐞 𝐢𝐦𝐚𝐠𝐞𝐬? Introducing MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding. 🌐 Explore here: muirbench.github.io 📄 Paper: arxiv.org/abs/2406.09411 📊 Data: huggingface.co/datasets/MUIRB…

thumb_up_off_alt99

chat_bubble_outline2

repeat44

shareShare

Fei Wang

@fwang_nlp

10 months ago

🚨 DPO works well on LLMs, but what happens when we apply it to VLMs/MLLMs? 🚀 Excited to introduce our #EMNLP2024 work on DPO for Multimodal LLMs! We identify the unconditional preference issues (where the model 𝐨𝐯𝐞𝐫𝐥𝐨𝐨𝐤𝐬 𝐯𝐢𝐬𝐮𝐚𝐥 𝐜𝐨𝐧𝐭𝐞𝐱𝐭), leading to severe

thumb_up_off_alt132

chat_bubble_outline1

repeat29

shareShare

Sheng Zhang

@sheng_zh

6 months ago

🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (lmarena.ai). Additionally, MetaScale

thumb_up_off_alt105

chat_bubble_outline0

repeat28

shareShare

Fei Wang

@fwang_nlp

4 months ago

🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be presented at #ICLR2025! 📅 Date: April 24 🕒 Time: 3:00 PM 📍 Location: Hall 3 + Hall 2B #11 MuirBench challenges multimodal LLMs with diverse multi-image

thumb_up_off_alt53

chat_bubble_outline0

repeat17

shareShare

James Y. Huang

🌴Muhao Chen🌴

AK

Tianyi Lorena Yan

Tianyi Lorena Yan

Xingyu Fu

Sheng Zhang

Fei Wang

Fei Wang

Fei Wang

Sheng Zhang

Fei Wang