dilek hakkani-tur (@dilekhakkanitur) Twitter Tweets • TwiCopy

Mercan Topkara

6 months ago

All of us—teens and parents—could use a little yoga time this weekend! Check out this quick guide (created by AI, improved by yours truly). Let’s stretch, breathe, and recharge together. 🧘🏻‍♀️☮️🫀🧠 hashtagirl.com/share/NjBlMjky…

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

dilek hakkani-tur

@dilekhakkanitur

6 months ago

While persuasive models are promising for social good, they can also be misused towards harmful behavior. Recent work by Beyza Bozdag and Shuhaib Mehri aims to assess LLM persuasiveness and susceptibility towards persuasion.

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

Stanford NLP Group

@stanfordnlp

6 months ago

For this week’s NLP Seminar, we are thrilled to host Yun-Nung Vivian Chen to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning! When: 3/6 Thurs 11am PT Non-Stanford affiliates registration form

For this week’s NLP Seminar, we are thrilled to host <a href="/YunNungChen/">Yun-Nung Vivian Chen</a> to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning!

When: 3/6 Thurs 11am PT
Non-Stanford affiliates registration form

thumb_up_off_alt62

chat_bubble_outline2

repeat11

shareShare

Vardhan Dongre

@vardhan_dongre

6 months ago

New Blog Alert: The Future of Human-Robot Conversation! We explore the evolution of embodied conversational agents beyond simple command followers. How will robots develop theory of mind, natural turn-taking, and truly understand human intentions? 🤖💬 #EmbodiedAI #HRI (1/2)

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Siva Reddy

@sivareddyg

6 months ago

LLM alignment doesn't transfer to Web Agents. SafeArena is a simple web environment and testbed to test the safety of agents, built on WebArena. A huge team effort that was highly self-driven 💪 safearena.github.io

thumb_up_off_alt44

chat_bubble_outline1

repeat14

shareShare

SIGdial

@sigdial

6 months ago

Our paper submission deadline is approaching fast! ✍️ Abstract deadline: 21st April Paper deadline 28th April Come and join us in beautiful Avignon, France, to discuss discourse and dialogue 🎉 We invite submissions of original research (long papers, short papers, and demos)

thumb_up_off_alt7

chat_bubble_outline1

repeat4

shareShare

dilek hakkani-tur

@dilekhakkanitur

5 months ago

Fantastic work by Sumuk 🤗 and all. Great collaboration between Hugging Face and ConvAI@UIUC !

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Sumuk

@sumukx

5 months ago

You can read about our entire methodology (+ other cool results!) here: arxiv.org/abs/2504.01833

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Gokhan Tur

@tur_gokhan

5 months ago

This is an important milestone for enabling LLM-based agents. Reward is all you need for Tool Learning! GRPO achieves significant improvements over base and SFT models for BFCL v3, API-Bank and Bamboogle agentic benchmark tasks. Congratulations Emre Can Acikgoz and Cheng Qian

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Emre Can Acikgoz

@emrecanacikgoz

4 months ago

What are the capabilities of current Conversational Agents? What challenges persist and what actually we should expect from these agents as a next step? 🚀We are excited to share our recent survey: ✨ A Desideratum for Conversational Agents: Capabilities, Challenges, and Future

thumb_up_off_alt23

chat_bubble_outline2

repeat11

shareShare

Siva Reddy

@sivareddyg

4 months ago

Incredibly proud of my students Ada Tur and Gaurav Kamath for winning a SAC award at #NAACL2025 for their work on assessing how LLMs model constituent shifts. Humans have a tendency to move heavier constituents towards the end of the sentence. While LLMs unsurprisingly show

thumb_up_off_alt65

chat_bubble_outline1

repeat10

shareShare

Emre Can Acikgoz

@emrecanacikgoz

4 months ago

🚀Excited to share our new evaluation paper "TD-Eval: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons"! 🔄 🤖⚔️TOD systems have rapidly evolved thanks to LLMs, but traditional metrics remain insufficient in

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Sagnik Mukherjee

@saagnikkk

4 months ago

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified. 🧵[1/n]

thumb_up_off_alt69

chat_bubble_outline1

repeat22

shareShare

dilek hakkani-tur

@dilekhakkanitur

4 months ago

A must read paper by Beyza Bozdag and others! 😀 ConvAI@UIUC

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare

SIGdial

@sigdial

4 months ago

The regular submission deadline for SIGdial 2025 has now passed... But we are still welcoming submissions through ACLRollingReview 🎉 ARR Commitment Deadline: June 6th Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th

The regular submission deadline for SIGdial 2025 has now passed...

But we are still welcoming submissions through <a href="/ReviewAcl/">ACLRollingReview</a> 🎉

ARR Commitment Deadline: June 6th

Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Samuel Schapiro

@samschapiro

4 months ago

A huge thanks to our co-authors Sumuk 🤗, dilek hakkani-tur, Lav Varshney, and Jonah Black for their contributions and guidance on this work! Very excited to continue integrating our understanding of creativity into modern AI theory and practice

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Sagnik Mukherjee

@saagnikkk

4 months ago

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

thumb_up_off_alt844

chat_bubble_outline17

repeat125

shareShare

Ishika Agarwal

@wonderingishika

3 months ago

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Beyza Bozdag and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/nbbozdag/">Beyza Bozdag</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

thumb_up_off_alt142

chat_bubble_outline8

repeat22

shareShare

Emre Can Acikgoz

@emrecanacikgoz

2 months ago

Happy to share that 🔄TD-Eval is accepted to #SIGDIAL2025 as an oral presentation! Hope to see you there at SIGdial and chat more on TOD Agent evaluation, which will definitely play a big role in next generation conversational agents.

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare