dilek hakkani-tur (@dilekhakkanitur) 's Twitter Profile
dilek hakkani-tur

@dilekhakkanitur

Professor of Computer Science @UofIllinois, @convai_uiuc, @uiuc_nlp, @IllinoisCDS

ID: 1007767226

calendar_today13-12-2012 01:54:31

98 Tweet

520 Followers

245 Following

Mercan Topkara (@mercantopkara) 's Twitter Profile Photo

All of us—teens and parents—could use a little yoga time this weekend! Check out this quick guide (created by AI, improved by yours truly). Let’s stretch, breathe, and recharge together. 🧘🏻‍♀️☮️🫀🧠 hashtagirl.com/share/NjBlMjky…

dilek hakkani-tur (@dilekhakkanitur) 's Twitter Profile Photo

While persuasive models are promising for social good, they can also be misused towards harmful behavior. Recent work by Beyza Bozdag and Shuhaib Mehri aims to assess LLM persuasiveness and susceptibility towards persuasion.

Stanford NLP Group (@stanfordnlp) 's Twitter Profile Photo

For this week’s NLP Seminar, we are thrilled to host Yun-Nung Vivian Chen to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning! When: 3/6 Thurs 11am PT Non-Stanford affiliates registration form

For this week’s NLP Seminar, we are thrilled to host <a href="/YunNungChen/">Yun-Nung Vivian Chen</a> to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning!

When: 3/6 Thurs 11am PT
Non-Stanford affiliates registration form
Vardhan Dongre (@vardhan_dongre) 's Twitter Profile Photo

New Blog Alert: The Future of Human-Robot Conversation! We explore the evolution of embodied conversational agents beyond simple command followers. How will robots develop theory of mind, natural turn-taking, and truly understand human intentions? 🤖💬 #EmbodiedAI #HRI (1/2)

Siva Reddy (@sivareddyg) 's Twitter Profile Photo

LLM alignment doesn't transfer to Web Agents. SafeArena is a simple web environment and testbed to test the safety of agents, built on WebArena. A huge team effort that was highly self-driven 💪 safearena.github.io

SIGdial (@sigdial) 's Twitter Profile Photo

Our paper submission deadline is approaching fast! ✍️ Abstract deadline: 21st April Paper deadline 28th April Come and join us in beautiful Avignon, France, to discuss discourse and dialogue 🎉 We invite submissions of original research (long papers, short papers, and demos)

Our paper submission deadline is approaching fast! ✍️

Abstract deadline: 21st April
Paper deadline 28th April

Come and join us in beautiful Avignon, France, to discuss discourse and dialogue 🎉

We invite submissions of original research (long papers, short papers, and demos)
Gokhan Tur (@tur_gokhan) 's Twitter Profile Photo

This is an important milestone for enabling LLM-based agents. Reward is all you need for Tool Learning! GRPO achieves significant improvements over base and SFT models for BFCL v3, API-Bank and Bamboogle agentic benchmark tasks. Congratulations Emre Can Acikgoz and Cheng Qian

Emre Can Acikgoz (@emrecanacikgoz) 's Twitter Profile Photo

What are the capabilities of current Conversational Agents? What challenges persist and what actually we should expect from these agents as a next step? 🚀We are excited to share our recent survey: ✨ A Desideratum for Conversational Agents: Capabilities, Challenges, and Future

What are the capabilities of current Conversational Agents?

What challenges persist and what actually we should expect from these agents as a next step?

🚀We are excited to share our recent survey: ✨ A Desideratum for Conversational Agents: Capabilities, Challenges, and Future
Siva Reddy (@sivareddyg) 's Twitter Profile Photo

Incredibly proud of my students Ada Tur and Gaurav Kamath for winning a SAC award at #NAACL2025 for their work on assessing how LLMs model constituent shifts. Humans have a tendency to move heavier constituents towards the end of the sentence. While LLMs unsurprisingly show

Emre Can Acikgoz (@emrecanacikgoz) 's Twitter Profile Photo

🚀Excited to share our new evaluation paper "TD-Eval: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons"! 🔄 🤖⚔️TOD systems have rapidly evolved thanks to LLMs, but traditional metrics remain insufficient in

🚀Excited to share our new evaluation paper "TD-Eval: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons"! 🔄

🤖⚔️TOD systems have rapidly evolved thanks to LLMs, but traditional metrics remain insufficient in
Sagnik Mukherjee (@saagnikkk) 's Twitter Profile Photo

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified. 🧵[1/n]

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. 

By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified.

🧵[1/n]
SIGdial (@sigdial) 's Twitter Profile Photo

The regular submission deadline for SIGdial 2025 has now passed... But we are still welcoming submissions through ACLRollingReview 🎉 ARR Commitment Deadline: June 6th Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th

The regular submission deadline for SIGdial 2025 has now passed...

But we are still welcoming submissions through <a href="/ReviewAcl/">ACLRollingReview</a> 🎉

ARR Commitment Deadline: June 6th

Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th
Samuel Schapiro (@samschapiro) 's Twitter Profile Photo

A huge thanks to our co-authors Sumuk 🤗, dilek hakkani-tur, Lav Varshney, and Jonah Black for their contributions and guidance on this work! Very excited to continue integrating our understanding of creativity into modern AI theory and practice

Sagnik Mukherjee (@saagnikkk) 's Twitter Profile Photo

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive
Ishika Agarwal (@wonderingishika) 's Twitter Profile Photo

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Beyza Bozdag and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/nbbozdag/">Beyza Bozdag</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.
Emre Can Acikgoz (@emrecanacikgoz) 's Twitter Profile Photo

Happy to share that 🔄TD-Eval is accepted to #SIGDIAL2025 as an oral presentation! Hope to see you there at SIGdial and chat more on TOD Agent evaluation, which will definitely play a big role in next generation conversational agents.