Jeffrey (Young-Min) Cho (@jeffrey_ch0) 's Twitter Profile
Jeffrey (Young-Min) Cho

@jeffrey_ch0

CS PhD Student @ UPenn - NLP/AI

ID: 1694011121840095232

linkhttps://jeffreych0.github.io/ calendar_today22-08-2023 15:38:33

19 Tweet

56 Followers

154 Following

Kyunghyun Cho (@kchonyc) 's Twitter Profile Photo

if you trust Google DeepMind Gemini about itself, it has 1.56 trillion parameters and cost Google $1-2billion (as opposed to GPT-4 which cost OpenAI $500M.) there were more than 100 engineers in the team who worked on Gemini. jailbreak by 한국어 🤣🤣🤣 g.co/bard/share/09c…

Conference on Language Modeling (@colm_conf) 's Twitter Profile Photo

We are pleased to announce that the first Conference on Language Modeling will be held at the University of Pennsylvania in Philadelphia at the Zellerbach Theatre. Thanks so much to UPenn CS as well as Mark Yatskar and Zachary Ives for facilitating the amazing venue.

We are pleased to announce that the first Conference on Language Modeling will be held at the University of Pennsylvania in Philadelphia at the Zellerbach Theatre.   

Thanks so much to UPenn CS as well as Mark Yatskar and Zachary Ives for facilitating the amazing venue.
August Nilsson (@agnilsson) 's Twitter Profile Photo

We investigated how people interpret the Cantril Ladder, the measure used to claim that Finland is the happiest country in the world in the World Happiness Report. The title is also the take home: "The Cantril Ladder elicits thoughts about power and wealth" (1/7) nature.com/articles/s4159…

Nat Friedman (@natfriedman) 's Twitter Profile Photo

Ten months ago, we launched the Vesuvius Challenge to solve the ancient problem of the Herculaneum Papyri, a library of scrolls that were flash-fried by the eruption of Mount Vesuvius in 79 AD. Today we are overjoyed to announce that our crazy project has succeeded. After 2000

Ten months ago, we launched the Vesuvius Challenge to solve the ancient problem of the Herculaneum Papyri, a library of scrolls that were flash-fried by the eruption of Mount Vesuvius in 79 AD.

Today we are overjoyed to announce that our crazy project has succeeded. After 2000
H. Andrew Schwartz (@handyschwartz) 's Twitter Profile Photo

🚨 Announcement: DLATK is now available as a colab notebook! github.com/dlatk/dlatk/bl… DLATK is a suite for end-to-end human language analysis, used in 100+ AI/NLP/Psych papers. This was lead by Shashanka Subrahmanya, joint with @sal_giorgi, Adithya V Ganesan, johannes Eichstaedt, and World Well-Being Prj 🧵1/4

Jeffrey (Young-Min) Cho (@jeffrey_ch0) 's Twitter Profile Photo

We had a Q&A for our work recently: • How do people from different races discuss their depression? 🗣️ PNAS Paper: pnas.org/doi/full/10.10… • How do CS and Med work on mental health chatbots? 🤖 EMNLP Paper: aclanthology.org/2023.emnlp-mai…

Xingyu Fu (@xingyufu2) 's Twitter Profile Photo

Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, they are far less intelligent than us 💁🏻‍♀️ Introducing Commonsense-T2I 💡 zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure

Can Text-to-Image models understand common sense? 🤔

Can they generate images that fit everyday common sense? 🤔

tldr; NO, they are far less intelligent than us 💁🏻‍♀️

Introducing Commonsense-T2I 💡 zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure
Shreya Havaldar (@shreyahavaldar) 's Twitter Profile Photo

🚨 LLMs must grasp implied language to reason about emotions, social cues, etc. Our Google DeepMind paper presents the Implied NLI dataset. Targeting social norms 🌎 and conversational dynamics 💬, we enhance LLM understanding of real-world implication! arxiv.org/abs/2501.07719

Sunny Rai (@snyrai_) 's Twitter Profile Photo

Does #shame manifest differently across #cultures? Yes. Can LLMs identify #norms behind shame? Yes. Are women shamed more than men? Yes!!! Can #LLMs identify when someone is shamed? arxiv.org/abs/2402.11333 #NAACL2025

Bowen Jiang (Lauren) @ Penn (@laurenbjiang) 's Twitter Profile Photo

🚀 How well can LLMs know you and personalize your response? Turns out, not so much! Introducing the PersonaMem Benchmark -- 👩🏻‍💻Evaluate LLM's ability to understand evolving persona from 180+ multi-session user-chatbot conversation history 🎯Latest models (GPT-4.1, GPT-4.5,

🚀 How well can LLMs know you and personalize your response? Turns out, not so much!

Introducing the PersonaMem Benchmark --
👩🏻‍💻Evaluate LLM's ability to understand evolving persona from 180+ multi-session user-chatbot conversation history
🎯Latest models (GPT-4.1, GPT-4.5,
Sunny Rai (@snyrai_) 's Twitter Profile Photo

#NAACL25 #C3NLP How do cultures differ in their mental health struggles? Do they seek professional help or emotional comfort? Check out our study on cross-cultural differences in mental health expressions on Reddit. aclanthology.org/2025.c3nlp-1.1… #mentalhealth #depression #llm

Sharath Chandra Guntuku (@sharathguntuku) 's Twitter Profile Photo

Thrilled to share that our work has received the Outstanding Paper Award C3NLP #NAACL2025! 🎉 Huge congratulations to Sunny Rai & Khushi Shelat for leading this collaborative effort, and thanks to Jeffrey (Young-Min) Cho for delivering an amazing talk on behalf of his co-authors! 🙌🏽

Thrilled to share that our work has received the Outstanding Paper Award <a href="/c3_nlp/">C3NLP</a> #NAACL2025! 🎉

Huge congratulations to <a href="/snyrai_/">Sunny Rai</a> &amp; <a href="/khushi_shelat/">Khushi Shelat</a> for leading this collaborative effort, and thanks to <a href="/jeffrey_ch0/">Jeffrey (Young-Min) Cho</a> for delivering an amazing talk on behalf of his co-authors! 🙌🏽
Jeffrey (Young-Min) Cho (@jeffrey_ch0) 's Twitter Profile Photo

#NAACL2025 How to compare cultural differences with social media data in scale? Our work uses lexica to annotate X 🇺🇸 & Weibo 🇨🇳 posts with valence (😄☹️) & arousal (🔥❄️) scores, revealing cross-cultural differences in emotional expression. aclanthology.org/2025.findings-…

Yu Feng (@anniefeng6) 's Twitter Profile Photo

🚨COLM 2025 Workshop on AI Agents: Capabilities and Safety Conference on Language Modeling This workshop explores AI agents’ capabilities—including reasoning and planning, interaction and embodiment, and real-world applications—as well as critical safety challenges related to reliability, ethics,

🚨COLM 2025 Workshop on AI Agents: Capabilities and Safety <a href="/COLM_conf/">Conference on Language Modeling</a> 

This workshop explores AI agents’ capabilities—including reasoning and planning, interaction and embodiment, and real-world applications—as well as critical safety challenges related to reliability, ethics,
Jeffrey (Young-Min) Cho (@jeffrey_ch0) 's Twitter Profile Photo

🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd! • 📉 Conformity ↑ when agents lack confidence but trust peers • 🧠 Presentation format shapes peer influence • 🎯 Controlled herding can boost collaboration outcomes 👉 Read more: arxiv.org/abs/2505.21588

🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd!

• 📉 Conformity ↑ when agents lack confidence but trust peers
• 🧠 Presentation format shapes peer influence
• 🎯 Controlled herding can boost collaboration outcomes

👉 Read more: arxiv.org/abs/2505.21588
Billy Xuanming Zhang (@xuanmingzhang07) 's Twitter Profile Photo

😵‍💫 Long-context human-AI planning with LLMs struggles when users have to manually manage all the context in messy chats (e.g. with ChatGPT). Meet 💡JumpStarter: task-structured context curation for better, collaborative planning with LLMs on complex tasks. 🧵 (1/n)

😵‍💫 Long-context human-AI planning with LLMs struggles when users have to manually manage all the context in messy chats (e.g. with ChatGPT). 
Meet 💡JumpStarter: task-structured context curation for better, collaborative planning with LLMs on complex tasks. 🧵 (1/n)