ConvAI@UIUC (@convai_uiuc) 's Twitter Profile
ConvAI@UIUC

@convai_uiuc

Conversational AI | NLP | @dilekhakkanitur @tur_gokhan | @IllinoisCDS @uiuc_nlp

ID: 1837416522572050432

linkhttps://uiuc-conversational-ai-lab.github.io/ calendar_today21-09-2024 09:00:59

72 Tweet

319 Followers

831 Following

Emre Can Acikgoz (@emrecanacikgoz) 's Twitter Profile Photo

Given its promising results in reasoning-oriented tasks, how can we leverage RL with tool-specific reward functions for improved tool learning? How is the performance of smart rewards compared to SFT? Does it also lead to better thinking skills in tool-calling tasks? Introducing

Gokhan Tur (@tur_gokhan) 's Twitter Profile Photo

This is an important milestone for enabling LLM-based agents. Reward is all you need for Tool Learning! GRPO achieves significant improvements over base and SFT models for BFCL v3, API-Bank and Bamboogle agentic benchmark tasks. Congratulations Emre Can Acikgoz and Cheng Qian

Gokhan Tur (@tur_gokhan) 's Twitter Profile Photo

What do we want from "Conversational Agents" on top of language agents? What is missing in the current Conversational Agent systems? Here is our desideratum with a comprehensive survey of the recent advances in the field. Bonus is a live github collection of new papers organized

ConvAI@UIUC (@convai_uiuc) 's Twitter Profile Photo

🚀🤖 Check out Emre's cool new survey on conversational agents with live github collection of new papers organized in an easy to follow taxonomy!

Sagnik Mukherjee (@saagnikkk) 's Twitter Profile Photo

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified. 🧵[1/n]

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. 

By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified.

🧵[1/n]
Sumuk (@sumukx) 's Twitter Profile Photo

super excited to see yourbench converge to be the default generative benchmarking / synthetic data creation solution for llms 💛

ConvAI@UIUC (@convai_uiuc) 's Twitter Profile Photo

Check out Beyza's awesome survey on LLM persuasion 🤖 with clear definitions of each persuasive aspect and key challenges and future directions 🎉

Shuhaib Mehri (@shuhaibmehri) 's Twitter Profile Photo

Excited to share our survey on computational persuasion - check it out to learn more about 🤖 AI as Persuader, 🎯AI as Persuadee, and ⚖️AI as Persuasion Judge!

Sagnik Mukherjee (@saagnikkk) 's Twitter Profile Photo

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive
Beyza Bozdag @ NAACL’25 (@nbbozdag) 's Twitter Profile Photo

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Ishika Agarwal and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/wonderingishika/">Ishika Agarwal</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.
Ishika Agarwal (@wonderingishika) 's Twitter Profile Photo

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Beyza Bozdag and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/nbbozdag/">Beyza Bozdag</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.
Ishika Agarwal (@wonderingishika) 's Twitter Profile Photo

[7/7] We are grateful to our advisor dilek hakkani-tur, special shoutout to Sagnik Mukherjee and Priyanka Kargupta, and other ConvAI@UIUC lab members 🎉 📄Paper: arxiv.org/pdf/2505.14990 💻Code: github.com/agarwalishika/…

The AI Timeline (@theaitimeline) 's Twitter Profile Photo

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models Author's Explanation: x.com/saagnikkk/stat… Overview: Reinforcement learning substantially improves LLMs through an intrinsic phenomenon of parameter update sparsity, where only 5-30% of parameters

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Author's Explanation:
x.com/saagnikkk/stat…

Overview:
Reinforcement learning substantially improves LLMs through an intrinsic phenomenon of parameter update sparsity, where only 5-30% of parameters
ACL 2025 (@aclmeeting) 's Twitter Profile Photo

Did you know that papers from the journal Computational Linguistics will also be featured at ACL 2025? 🤩 This means even more cutting-edge NLP research to explore in Vienna! Don't miss out! #ACL2025NLP #ComputationalLinguistics #NLProc 2025.aclweb.org/program/cl_pap…