🌴Muhao Chen🌴 (@muhao_chen) 's Twitter Profile
🌴Muhao Chen🌴

@muhao_chen

🐹Assistant Professor of Computer Science @UCDavis🐹 | 💙PhD @UCLAComSci 2019💛 | 🌴加州boy🌴 | 🎸@GALNERYUSOFFIC2#1🎧! |♛Collecting⌚♛

ID: 4786381520

linkhttps://luka-group.github.io/ calendar_today20-01-2016 01:25:42

407 Tweet

2,2K Followers

620 Following

🌴Muhao Chen🌴 (@muhao_chen) 's Twitter Profile Photo

Please consider submitting to the AAAI spring Symposium on AI for Engineering and Scientific Discoveries; we will accept regular AAAI sized papers (maximum 8 pages including references) or short 2-page abstracts: sites.google.com/view/aaai-ss25…. • Paper/abstract Submission Deadline:

Fei Wang (@fwang_nlp) 's Twitter Profile Photo

𝗠𝘂𝗶𝗿𝗕𝗲𝗻𝗰𝗵 is officially accepted at #ICLR2025! 🎉 Recent VLMs/MLLMs such as LLaVA-OneVision, MM1.5, and MAmmoTH-VL have demonstrated significant progress on MuirBench.🚀 Excited to see how MuirBench continues to drive the innovation of VLMs! #AI #MachineLearning #VLM

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

New interview with 🌴Muhao Chen🌴, former CCG postdoc, who talks with us about cats and hamsters, LLM safety, and far-flung national parks! ccgblog.seas.upenn.edu/2025/01/interv…

New interview with <a href="/muhao_chen/">🌴Muhao Chen🌴</a>, former CCG postdoc, who talks with us about cats and hamsters, LLM safety, and far-flung national parks! 
ccgblog.seas.upenn.edu/2025/01/interv…
Sheng Zhang (@sheng_zh) 's Twitter Profile Photo

🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (lmarena.ai). Additionally, MetaScale

🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (<a href="/lmarena_ai/">lmarena.ai</a>). Additionally, MetaScale
Wenjie Jacky Mo (@wenjie_jacky_mo) 's Twitter Profile Photo

Worried about backdoors in LLMs? 🌟 Check out our #NAACL2025 work on test-time backdoor mitigation! ✅ Black-box 📦 ✅ Plug-and-play 🛡️ We explore: → Defensive Demonstrations 🧪 → Self-generated Prefixes 🧩 → Self-refinement ✍️ 📄 arxiv.org/abs/2311.09763 🧵[1/n]

Worried about backdoors in LLMs?

🌟 Check out our #NAACL2025 work on test-time backdoor mitigation!

✅ Black-box 📦
✅ Plug-and-play 🛡️

We explore:
→ Defensive Demonstrations 🧪
→ Self-generated Prefixes 🧩
→ Self-refinement ✍️

📄 arxiv.org/abs/2311.09763

🧵[1/n]
🌴Muhao Chen🌴 (@muhao_chen) 's Twitter Profile Photo

🚨 Call for Papers! ACL 2025 🚨 LLM Security Workshop @ ACL 2025 (the first workshop of ACL SIGSEC) 🔐 Topics: Adversarial attacks, defenses, vulnerabilities, ethical & legal aspects, safe deployment of LLMs and more 📅 Submission Deadline: April 15, 2025 📍 August 1, 2025 in

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Excited to share our papers at #ICLR2025 in Singapore! Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)! Yu Feng, Xingyu Fu, Ben Zhou, 🌴Muhao Chen🌴, Dan Roth

Excited to share our papers at #ICLR2025 in Singapore!  Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)!
<a href="/AnnieFeng6/">Yu Feng</a>, <a href="/XingyuFu2/">Xingyu Fu</a>, <a href="/BenZhou96/">Ben Zhou</a>, <a href="/muhao_chen/">🌴Muhao Chen🌴</a>, <a href="/DanRothNLP/">Dan Roth</a>
Fei Wang (@fwang_nlp) 's Twitter Profile Photo

🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be presented at #ICLR2025!​ 📅 Date: April 24 🕒 Time: 3:00 PM 📍 Location: Hall 3 + Hall 2B #11 MuirBench challenges multimodal LLMs with diverse multi-image

🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be presented at #ICLR2025!​
📅 Date: April 24
🕒 Time: 3:00 PM
📍 Location: Hall 3 + Hall 2B #11
MuirBench challenges multimodal LLMs with diverse multi-image
Weidi(Eddy) Luo (@luoweidi84) 's Twitter Profile Photo

🚀Check our latest work accepted by ACL 2025 Main. AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection In this work: 🔍 We introduce Safe-OS, an online benchmark for OS agents that includes prompt injection attacks, environment-based attacks, and

Tinghui Zhu (@darthzhu_) 's Twitter Profile Photo

😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it generalize to omni-modality? We study the effects of extending modality and ask three questions: arxiv.org/abs/2506.01872 #LLM #MLLM #OmniModality

DailyPapers (@huggingpapers) 's Twitter Profile Photo

Are we heading down the right path towards omni-modality? 🤔 This new paper explores the effects of extending modality in language models.

Are we heading down the right path towards omni-modality? 🤔

This new paper explores the effects of extending modality in language models.
jakedineenasu (@jakedineenasu) 's Twitter Profile Photo

🔍 Introducing QA-LIGN: A reflective alignment approach using a draft→reflection→revision pipeline. We create symbolic reward models that serve as both natural language critics & general reward models, bridging rule-based rewards and RLAIF. 📄 Paper: arxiv.org/pdf/2506.08123

🔍 Introducing QA-LIGN: A reflective alignment approach using a draft→reflection→revision pipeline. We create symbolic reward models that serve as both natural language critics &amp; general reward models, bridging rule-based rewards and RLAIF.

📄 Paper: arxiv.org/pdf/2506.08123