Sean Xuefeng Du (@xuefeng_du) 's Twitter Profile
Sean Xuefeng Du

@xuefeng_du

Ph.D. student @WisconsinCS, fellow @JaneStreetGroup, spending time @GoogleAI | reliable machine learning šŸ¤–ļø ā›‘ļø

ID: 1095271887465111552

linkhttp://d12306.github.io calendar_today12-02-2019 10:42:21

208 Tweet

1,1K Followers

2,2K Following

Wendy (@wendyweeww) 's Twitter Profile Photo

Hallucination is baked into LLMs. Can't be eliminated, it's how they work. Dario Amodei says LLMs hallucinate less than humans. But it's not about less or more. It's the differing & dangerous nature of the hallucination, making it unlikely LLMs will cause mass unemployment (1/n)

Hallucination is baked into LLMs. Can't be eliminated, it's how they work. <a href="/DarioAmodei/">Dario Amodei</a> says LLMs hallucinate less than humans. But it's not about less or more. It's the differing &amp; dangerous nature of the hallucination, making it unlikely LLMs will cause mass unemployment
(1/n)
Sharon Y. Li (@sharonyixuanli) 's Twitter Profile Photo

Excited to share our #NeurIPS2025 paper: Visual Instruction Bottleneck Tuning (Vittle) Multimodal LLMs do great in-distribution, but often break in the wild. Scaling data or models helps, but it’s costly. šŸ’” Our work is inspired by the Information Bottleneck (IB) principle,

Excited to share our #NeurIPS2025 paper: Visual Instruction Bottleneck Tuning (Vittle)

Multimodal LLMs do great in-distribution, but often break in the wild. Scaling data or models helps, but it’s costly.

šŸ’” Our work is inspired by the Information Bottleneck (IB) principle,
Sharon Y. Li (@sharonyixuanli) 's Twitter Profile Photo

Collecting large human preference data is expensive—the biggest bottleneck in reward modeling. In our #NeurIPS2025 paper, we introduce latent-space synthesis for preference data, which is 18Ɨ faster and uses a network that’s 16,000Ɨ smaller (0.5M vs 8B parameters) than

Collecting large human preference data is expensive—the biggest bottleneck in reward modeling.

In our #NeurIPS2025 paper, we introduce latent-space synthesis for preference data, which is 18Ɨ faster and uses a network that’s 16,000Ɨ smaller (0.5M vs 8B parameters) than
Sharon Y. Li (@sharonyixuanli) 's Twitter Profile Photo

Your LVLM says: ā€œThere’s a cat on the table.ā€ But… there’s no cat in the image. Not even a whisker. This is object hallucination — one of the most persistent reliability failures in multi-modal language models. Our new #NeurIPS2025 paper introduces GLSim, a simple but

Your LVLM says: ā€œThere’s a cat on the table.ā€
But… there’s no cat in the image. Not even a whisker.

This is object hallucination — one of the most persistent reliability failures in multi-modal language models. 

Our new #NeurIPS2025 paper introduces GLSim, a simple but
Sharon Y. Li (@sharonyixuanli) 's Twitter Profile Photo

Deception is one of the most concerning behaviors that advanced AI systems can display. If you are not concerned yet, this paper might change your view. We built a multi-agent framework to study: šŸ‘‰ How deceptive behaviors can emerge and evolve in LLM agents during realistic

Deception is one of the most concerning behaviors that advanced AI systems can display. If you are not concerned yet, this paper might change your view.

We built a multi-agent framework to study:
šŸ‘‰ How deceptive behaviors can emerge and evolve in LLM agents during realistic
Sean Xuefeng Du (@xuefeng_du) 's Twitter Profile Photo

If you research on responsible and trustworthy foundation models, please consider attending and submitting your work to our NeurIPS Conference workshop!! Thanks Canyu Chen and the amazing team for inviting and organizing🄳

Rob Nowak (@rdnowak) 's Twitter Profile Photo

Please share widely. The UW–Madison ECE Department is recruiting at the assistant, associate, or full professor in foundational AI and machine learning tinyurl.com/4kwh8mvm

Sharon Y. Li (@sharonyixuanli) 's Twitter Profile Photo

šŸ“¢ We are excited to release the call for papers for #ICML2026, held in Seoul, South Korea next year! šŸ“… Key Dates Abstract deadline: Jan 23, 2026 AOE Full paper deadline: Jan 28, 2026 AOE Main Track āžœ icml.cc/Conferences/20… Position Papers āžœ icml.cc/Conferences/20…

Schmidt Sciences (@schmidtsciences) 's Twitter Profile Photo

We're excited to welcome 28 new AI2050 Fellows! This 4th cohort of researchers are pursuing projects that include building AI scientists, designing trustworthy models, and improving biological and medical research, among other areas. buff.ly/riGLyyj

We're excited to welcome 28 new AI2050 Fellows!  This 4th cohort of researchers are pursuing projects that include building AI scientists, designing trustworthy models, and improving biological and medical research, among other areas. buff.ly/riGLyyj
Sean Xuefeng Du (@xuefeng_du) 's Twitter Profile Photo

šŸ“£If you perform interdisciplinary research and miss the AI for X fellowship, please consider applying for Eric & Wendy Schmidt AI in Science Postdoc Fellowship (Deadline: 12/31/2025).šŸš€ NTU is part of this fellowship network! Email me if interested. schmidtsciences.org/ai-in-science/

šŸ“£If you perform interdisciplinary research and miss the AI for X fellowship, please consider applying for Eric &amp; Wendy Schmidt AI in Science Postdoc Fellowship (Deadline: 12/31/2025).šŸš€

NTU is part of this fellowship network! Email me if interested.
schmidtsciences.org/ai-in-science/
Sean Xuefeng Du (@xuefeng_du) 's Twitter Profile Photo

šŸŽ‰ Honored to be selected for AAAI 26 New Faculty Highlights program! I’ll showcase research on šŸ¤– AI reliability: OOD detection, LLM hallucination & alignment in person. See you at #AAAI26 at Singapore in January next year!

šŸŽ‰ Honored to be selected for <a href="/RealAAAI/">AAAI</a> 26 New Faculty Highlights program! I’ll showcase research on šŸ¤– AI reliability: OOD detection, LLM hallucination &amp; alignment in person. See you at #AAAI26 at Singapore in January next year!
Jenn Wortman Vaughan (@jennwvaughan) 's Twitter Profile Photo

Spread the word! šŸ“¢ The FATE (Fairness, Accountability, Transparency, and Ethics) group at Microsoft Research in NYC is hiring interns and postdocs to start in summer 2026! šŸŽ‰ Apply by *December 15* for full consideration.

Karoline Leavitt (@presssec) 's Twitter Profile Photo

We know that Americans are still hurting from the 40-year high inflation caused by Joe Biden and the Democrats, but President Trump is making significant progress to fix it, and he won’t stop working until he solves it: āœ…The latest jobs report showed the American economy

Saining Xie (@sainingxie) 's Twitter Profile Photo

it may seem like an ordinary day, but it could become the strangest moment in peer review and open science please please please treat our community with care. it’s already so fragile. don’t let it die.

Canyu Chen (@canyuchen3) 's Twitter Profile Photo

šŸ”„Welcome to joining our #NeurIPS2025 ResponsibleFM Workshop today! šŸ—“ļøNov 30th 1pm-8pm CST šŸ“Hilton Mexico City Reforma (Room: Don Alberto 1) 🌐responsible-fm.github.io 🄳Looking forward to learning fresh insights on Socially Responsible and Trustworthy Foundation Models from

šŸ”„Welcome to joining our #NeurIPS2025 ResponsibleFM Workshop today!

šŸ—“ļøNov 30th 1pm-8pm CST
šŸ“Hilton Mexico City Reforma (Room: Don Alberto 1)
🌐responsible-fm.github.io

🄳Looking forward to learning fresh insights on Socially Responsible and Trustworthy Foundation Models from
Yiyou Sun (@yiyousun) 's Twitter Profile Photo

I’ll be in San Diego for #NeurIPS2025 (Dec 2–8)! Come check out our work, and feel free to DM if you’d like to meet up or chat ā˜•ļø 🄳I’m currently on the job market. If you know of opportunities that might be a good fit or can offer a referral, please DM me! My recent research

I’ll be in San Diego for #NeurIPS2025 (Dec 2–8)!

Come check out our work, and feel free to DM if you’d like to meet up or chat ā˜•ļø

🄳I’m currently on the job market. If you know of opportunities that might be a good fit or can offer a referral, please DM me!

My recent research
Boyang "Albert" Li (@albertboyangli) 's Twitter Profile Photo

The College of Computing and Data Science, Nanyang Technological University, Singapore is hiring for faculty positions at all levels. Please feel free to repost.

The College of Computing and Data Science, Nanyang Technological University, Singapore is hiring for faculty positions at all levels. Please feel free to repost.