ConvAI@UIUC (@convai_uiuc) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Given its promising results in reasoning-oriented tasks, how can we leverage RL with tool-specific reward functions for improved tool learning? How is the performance of smart rewards compared to SFT? Does it also lead to better thinking skills in tool-calling tasks? Introducing

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Gokhan Tur

@tur_gokhan

3 months ago

This is an important milestone for enabling LLM-based agents. Reward is all you need for Tool Learning! GRPO achieves significant improvements over base and SFT models for BFCL v3, API-Bank and Bamboogle agentic benchmark tasks. Congratulations Emre Can Acikgoz and Cheng Qian

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Gokhan Tur

@tur_gokhan

3 months ago

What do we want from "Conversational Agents" on top of language agents? What is missing in the current Conversational Agent systems? Here is our desideratum with a comprehensive survey of the recent advances in the field. Bonus is a live github collection of new papers organized

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

ConvAI@UIUC

@convai_uiuc

3 months ago

🚀🤖 Check out Emre's cool new survey on conversational agents with live github collection of new papers organized in an easy to follow taxonomy!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Sagnik Mukherjee

@saagnikkk

3 months ago

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified. 🧵[1/n]

thumb_up_off_alt69

chat_bubble_outline1

repeat22

shareShare

ConvAI@UIUC

@convai_uiuc

3 months ago

Checkout this new work by Sagnik Mukherjee on premise augmented reasoning chains!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Sumuk

@sumukx

2 months ago

super excited to see yourbench converge to be the default generative benchmarking / synthetic data creation solution for llms 💛

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

ConvAI@UIUC

@convai_uiuc

2 months ago

Check out Beyza's awesome survey on LLM persuasion 🤖 with clear definitions of each persuasive aspect and key challenges and future directions 🎉

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Shuhaib Mehri

@shuhaibmehri

2 months ago

Excited to share our survey on computational persuasion - check it out to learn more about 🤖 AI as Persuader, 🎯AI as Persuadee, and ⚖️AI as Persuasion Judge!

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

dilek hakkani-tur

@dilekhakkanitur

2 months ago

A must read paper by Beyza Bozdag and others! 😀 ConvAI@UIUC

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare

ConvAI@UIUC

@convai_uiuc

2 months ago

Congratulations Emre 🎉🎉 See you in Vienna 🇦🇹

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Sagnik Mukherjee

@saagnikkk

2 months ago

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

thumb_up_off_alt844

chat_bubble_outline17

repeat125

shareShare

Sagnik Mukherjee

@saagnikkk

2 months ago

Paper - arxiv.org/abs/2505.11711 Work done with amazing collaborator Lifan Yuan and advised by our amazing advisors dilek hakkani-tur and Hao Peng

thumb_up_off_alt46

chat_bubble_outline0

repeat3

shareShare

Beyza Bozdag @ NAACL’25

@nbbozdag

2 months ago

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Ishika Agarwal and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/wonderingishika/">Ishika Agarwal</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

thumb_up_off_alt42

chat_bubble_outline2

repeat4

shareShare

Ishika Agarwal

@wonderingishika

2 months ago

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ Beyza Bozdag and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer?

✨Yes!✨

<a href="/nbbozdag/">Beyza Bozdag</a> and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.

thumb_up_off_alt142

chat_bubble_outline8

repeat22

shareShare

Ishika Agarwal

@wonderingishika

2 months ago

[7/7] We are grateful to our advisor dilek hakkani-tur, special shoutout to Sagnik Mukherjee and Priyanka Kargupta, and other ConvAI@UIUC lab members 🎉 📄Paper: arxiv.org/pdf/2505.14990 💻Code: github.com/agarwalishika/…

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Beyza Bozdag @ NAACL’25

@nbbozdag

2 months ago

[7/7] We are grateful to our advisor dilek hakkani-tur, special shoutout to Sagnik Mukherjee and Priyanka Kargupta, and other ConvAI@UIUC lab members 🎉 📄Paper: arxiv.org/pdf/2505.14990 💻Code: github.com/agarwalishika/…

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

The AI Timeline

@theaitimeline

2 months ago

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models Author's Explanation: x.com/saagnikkk/stat… Overview: Reinforcement learning substantially improves LLMs through an intrinsic phenomenon of parameter update sparsity, where only 5-30% of parameters

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Sagnik Mukherjee

@saagnikkk

2 months ago

Thanks for covering our work !

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

ACL 2025

@aclmeeting

a month ago

Did you know that papers from the journal Computational Linguistics will also be featured at ACL 2025? 🤩 This means even more cutting-edge NLP research to explore in Vienna! Don't miss out! #ACL2025NLP #ComputationalLinguistics #NLProc 2025.aclweb.org/program/cl_pap…

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare