Fei Wang (@fwang_nlp) Twitter Tweets • TwiCopy

Bowen Jin

9 months ago

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective arxiv.org/abs/2502.03699 We introduce a comprehensive framework that connects LLM alignment techniques with the established IR principles, providing a new perspective on LLM alignment.

thumb_up_off_alt141

chat_bubble_outline5

repeat43

shareShare

Kai-Wei Chang

@kaiwei_chang

8 months ago

I'm honored to be named to “AI's 10 to Watch” by IEEE Intelligent Systems for our research on Trustworthy NLP and Vision-Language Models. This biennial award recognizes 10 researchers who have made impactful scientific contributions to AI within 10 years of earning their PhD.

thumb_up_off_alt117

chat_bubble_outline18

repeat8

shareShare

Sheng Zhang

@sheng_zh

7 months ago

I know internship hunting has been especially tough this year -- I hear you! 📢Great news: Our team at Microsoft Research (Microsoft Research) has opened up a few additional internship slots for this summer! We're seeking talented PhD candidates with experience in 🖼️Multimodality,

thumb_up_off_alt267

chat_bubble_outline1

repeat34

shareShare

Sheng Zhang

@sheng_zh

7 months ago

🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (lmarena.ai). Additionally, MetaScale

thumb_up_off_alt105

chat_bubble_outline0

repeat28

shareShare

Tu Vu

@tuvllms

7 months ago

🚨 New paper 🚨 Excited to share my first paper w/ my PhD students!! We find that advanced LLM capabilities conferred by instruction or alignment tuning (e.g., SFT, RLHF, DPO, GRPO) can be encoded into model diff vectors (à la task vectors) and transferred across model

thumb_up_off_alt438

chat_bubble_outline12

repeat89

shareShare

Tianyi Lorena Yan

@lorenayannnnn

7 months ago

When answering queries with multiple answers (e.g., listing cities of a country), how do LMs simultaneously recall knowledge and avoid repeating themselves? 🚀 Excited to share our latest work with Robin Jia! We uncover a promote-then-suppress mechanism: LMs first recall all

thumb_up_off_alt104

chat_bubble_outline4

repeat20

shareShare

Fei Liu @ #ICLR2025

@feiliu_nlp

7 months ago

🤖 Imagine a world where everyone has a personalized LLM agent by their side! Our latest research (arxiv.org/abs/2502.12149) explores how persona dynamics influence multi-agent competitive auctions. 🏆 Discover who wins, who loses, and the impact of personas on competitiveness!

thumb_up_off_alt101

chat_bubble_outline4

repeat24

shareShare

Wenjie Jacky Mo

@wenjie_jacky_mo

7 months ago

Worried about backdoors in LLMs? 🌟 Check out our #NAACL2025 work on test-time backdoor mitigation! ✅ Black-box 📦 ✅ Plug-and-play 🛡️ We explore: → Defensive Demonstrations 🧪 → Self-generated Prefixes 🧩 → Self-refinement ✍️ 📄 arxiv.org/abs/2311.09763 🧵[1/n]

thumb_up_off_alt8

chat_bubble_outline1

repeat6

shareShare

Tu Vu

@tuvllms

7 months ago

📢 Research internship Google📢 I am looking for a PhD student researcher to work with me and my colleagues on advanced reasoning and/or RAG factuality this summer Google Mountain View, CA. We will focus on open-source models and benchmarks, and aim to publish our findings.

thumb_up_off_alt343

chat_bubble_outline3

repeat38

shareShare

Fei Liu @ #ICLR2025

@feiliu_nlp

7 months ago

Curious how LLMs tackle planning tasks, such as travel and computer use? Our new survey #PlanGenLLMs (arxiv.org/abs/2502.11221) builds on classic work by Kartam and Wilkins (1990) and examines 6 key metrics to compare today's top planning systems. Your next agentic workflow

thumb_up_off_alt34

chat_bubble_outline0

repeat4

shareShare

Yu Feng

@anniefeng6

6 months ago

#ICLR2025 Oral LLMs often struggle with reliable and consistent decisions under uncertainty 😵‍💫 — largely because they can't reliably estimate the probability of each choice. We propose BIRD 🐦, a framework that significantly enhances LLM decision making under uncertainty. BIRD

thumb_up_off_alt256

chat_bubble_outline2

repeat38

shareShare

Xiaogeng Liu

@xiaogengliu

6 months ago

Thrilled to be featured in the #ICLR2025 Spotlight! 🎉 Come see our poster in Hall 3 + Hall 2B #602, April 25, 10:00–12:30 PM SGT

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Cognitive Computation Group

@cogcomp

6 months ago

Excited to share our papers at #ICLR2025 in Singapore! Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)! Yu Feng, Xingyu Fu, Ben Zhou, 🌴Muhao Chen🌴, Dan Roth

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Fei Wang

@fwang_nlp

6 months ago

🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be presented at #ICLR2025! 📅 Date: April 24 🕒 Time: 3:00 PM 📍 Location: Hall 3 + Hall 2B #11 MuirBench challenges multimodal LLMs with diverse multi-image

thumb_up_off_alt53

chat_bubble_outline0

repeat17

shareShare

Nan Xu

@xunannancy

6 months ago

Should multimodal in-context learning demonstrations mirror the visual or textual patterns of test instances? I'll present paper “From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning” on April 30th at NAACL. See you there!

thumb_up_off_alt7

chat_bubble_outline1

repeat5

shareShare

Hadi Askari

@hadiaskari67

6 months ago

🧵1/ Excited to share our #NAACL2025 work! 🎉 "Assessing LLMs for Zero-Shot Abstractive Summarization Through the Lens of Relevance Paraphrasing" We study how robust LLM summarization is to our relevance paraphrasing method? 🧠📝 More details below:👇 arxiv.org/abs/2406.03993

thumb_up_off_alt15

chat_bubble_outline1

repeat7

shareShare

ComputerUseAgents Workshop

@workshopcua

6 months ago

🔦 Speaker Spotlight: Sercan Ö. Arık (Sercan Arık) We're thrilled to welcome Sercan, Senior Staff Research Scientist at Google Cloud AI, as an invited speaker at the ICML 2025 Workshop on Computer Use Agents! Sercan's focuses on democratizing AI and applying it to impactful use

🔦 Speaker Spotlight: Sercan Ö. Arık (<a href="/sercanarik/">Sercan Arık</a>)
We're thrilled to welcome Sercan, Senior Staff Research Scientist at Google Cloud AI, as an invited speaker at the ICML 2025 Workshop on Computer Use Agents!
Sercan's focuses on democratizing AI and applying it to impactful use

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Xiaofei Wen

@xiaofei_wen_mk

5 months ago

Can LLM guardrails think twice before deciding? ✨ Check out our #ACL2025 paper: THINKGUARD — a critique-augmented safety guardrail! ✅ Structured critiques ✅ Interpretable decisions ✅ Robust against adversarial prompts 📑 arxiv.org/abs/2502.13458 🧵[1/n]

thumb_up_off_alt12

chat_bubble_outline1

repeat10

shareShare

Qin Liu

@qinliu_nlp

5 months ago

🚨 New paper accepted to #ACL2025! We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge. Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only. Paper: arxiv.org/abs/2410.14676… 🧵[1/6]👇

thumb_up_off_alt9

chat_bubble_outline1

repeat7

shareShare

Tianqing Fang @ ACL24

@tfang229

4 months ago

🚀 Check out our paper: WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model, from Tencent AI Lab!. We present a world model-driven framework for self-improving web agents, addressing critical challenges in self-training—such as limited exploration and

thumb_up_off_alt42

chat_bubble_outline1

repeat16

shareShare