Yejin Choi (@yejinchoinka) Twitter Tweets • TwiCopy

David Bau

3 months ago

Dear MAGA friends, I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cut 75% of the STEM budget in the US. Sorry for the long post, but the issue is really important, and I want to share what I know about it. The entire

thumb_up_off_alt466

chat_bubble_outline23

repeat74

shareShare

Christopher Manning

@chrmanning

3 months ago

If the US wants to be less dependent on foreign-born scientists and engineers, then you’d think we’d be wanting to increase the production of US-born scientists and engineers. But, apparently not. 🤔

thumb_up_off_alt220

chat_bubble_outline16

repeat21

shareShare

Jaehun Jung

@jaehunjung_com

3 months ago

Data curation is crucial for LLM reasoning, but how do we know if our dataset is not overfit to one benchmark and generalizes to unseen distributions? 🤔 𝐃𝐚𝐭𝐚 𝐝𝐢𝐯𝐞𝐫𝐬𝐢𝐭𝐲 is key, when measured correct—it strongly predicts model generalization in reasoning tasks! 🧵

thumb_up_off_alt175

chat_bubble_outline4

repeat32

shareShare

Ximing Lu

@gximing

3 months ago

What happens when you ✨scale up RL✨? In our new work, Prolonged RL, we significantly scale RL training to >2k steps and >130k problems—and observe exciting, non-saturating gains as we spend more compute 🚀.

thumb_up_off_alt126

chat_bubble_outline1

repeat17

shareShare

Etash Guha @ ICLR

@etash_guha

3 months ago

OpenThinker3-7B is the SOTA open-data 7B reasoning models, powered by our new OpenThoughts3-1.2M dataset! We beat DeepSeek-R1-Distill-7B on our benchmarks by 33% on average 🚀🚀. Our new paper 📝 offers unique insights into data curation. Filtering questions works, but

thumb_up_off_alt53

chat_bubble_outline4

repeat10

shareShare

Sydney Levine

@sydneymlevine

3 months ago

🔆 I'm hiring! 🔆 There are two open positions: 1. Summer research position (best for master's or graduate student); focus on computational social cognition. 2. Postdoc (currently interviewing!); focus on computational social cognition and AI safety. sites.google.com/corp/site/sydn…

thumb_up_off_alt483

chat_bubble_outline8

repeat94

shareShare

Natasha Jaques

@natashajaques

3 months ago

Currently, reinforcement learning from human feedback (RLHF) is the predominant method for ensuring LLMs are safe and aligned. And yet it provides no guarantees that they won’t say something harmful, copyrighted, or inappropriate. In our latest paper, we use online adversarial

thumb_up_off_alt81

chat_bubble_outline1

repeat10

shareShare

Jae Sung Park

@jjaesungpark

3 months ago

🔥We are excited to present our work Synthetic Visual Genome (SVG) at #CVPR25 tomorrow! 🕸️ Dense scene graph with diverse relationship types. 🎯 Generate scene graphs with SAM segmentation masks! 🔗Project link: bit.ly/4e1uMDm 📍 Poster: #32689, Fri 2-4 PM 👇🧵

thumb_up_off_alt20

chat_bubble_outline2

repeat8

shareShare

Eunsol Choi

@eunsolc

3 months ago

Knowledge propagation in LLM is notoriously challenging. Check out our paper that improves it substantially by training a hypernetwork to target knowledge propagation!

thumb_up_off_alt94

chat_bubble_outline1

repeat8

shareShare

Hao Xu

@xuhaoxh

3 months ago

Wanna 🔎 inside Internet-scale LLM training data w/o spending 💰💰💰? Introducing infini-gram mini, an exact-match search engine with 14x less storage req than the OG infini-gram 😎 We make 45.6 TB of text searchable. Read on to find our Web Interface, API, and more. (1/n) ⬇️

thumb_up_off_alt59

chat_bubble_outline6

repeat18

shareShare

Andy Konwinski

@andykonwinski

2 months ago

Today, I’m launching a deeply personal project. I’m betting $100M that we can help computer scientists create more upside impact for humanity. Built for and by researchers, including Jeff Dean & Joelle Pineau on the board, Laude Institute catalyzes research with real-world impact.

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat105

shareShare

Prithviraj (Raj) Ammanabrolu

@rajammanabrolu

2 months ago

My next professional move is to go to the Source of the Compute. Soon™ I'll be hanging out with the incredible researchers NVIDIA as a RS working on open source/science post training esp reasoning VLA models for embodied agents! There is no ASGI without embodiment!

thumb_up_off_alt105

chat_bubble_outline8

repeat5

shareShare

Liwei Jiang

@liweijianglw

2 months ago

PolyGuard will appear at COLM 2025!!

thumb_up_off_alt20

chat_bubble_outline0

repeat2

shareShare

Yong Lin

@yong18850571

2 months ago

(1/4)🚨 Introducing Goedel-Prover V2 🚨 🔥🔥🔥 The strongest open-source theorem prover to date. 🥇 #1 on PutnamBench: Solves 64 problems—with far less compute. 🧠 New SOTA on MiniF2F: * 32B model hits 90.4% at Pass@32, beating DeepSeek-Prover-V2-671B’s 82.4%. * 8B > 671B: Our 8B

thumb_up_off_alt224

chat_bubble_outline6

repeat77

shareShare

Seungju Han

@seungjuhan3

2 months ago

life update: I'll be starting my PhD in CS at Stanford this September! I'm very excited to continue my research on reasoning of language models and to make new friends in the Bay Area! I'm deeply grateful to everyone who supported me and made this milestone possible

thumb_up_off_alt750

chat_bubble_outline35

repeat19

shareShare

Etash Guha @ ICLR

@etash_guha

2 months ago

Quick Update: I’ve officially started my PhD at Stanford (go trees i think?!?)! After an amazing year at UW (go huskies!), I’m super happy to continue my CS PhD with my amazing advisors Ludwig Schmidt and Yejin Choi! If you see me on campus, please say hi and listen to me rant

thumb_up_off_alt157

chat_bubble_outline16

repeat3

shareShare

Abhilasha Ravichander

@lasha_nlp

a month ago

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(Max Planck Institute for Software Systems) this Fall!🎉 I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(<a href="/mpi_sws_/">Max Planck Institute for Software Systems</a>) this Fall!🎉

I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

thumb_up_off_alt505

chat_bubble_outline75

repeat43

shareShare

Abhilasha Ravichander

@lasha_nlp

a month ago

Super thrilled that HALoGEN, our study of LLM hallucinations and their potential origins in training data, received an Outstanding Paper Award at ACL! Joint work w/i Shrusti Ghela*, and David Wadden Yejin Choi 💫

thumb_up_off_alt178

chat_bubble_outline23

repeat20

shareShare

Shrusti Ghela

@shrusti_ghela

a month ago

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them won Outstanding Paper Award at #ACL2025! ✨🥹 The initial hallucination was thinking this would be a quick project XD. Loved every minute of it working with Abhilasha Ravichander* and David Wadden, Yejin Choi ☀️

thumb_up_off_alt84

chat_bubble_outline4

repeat5

shareShare

Shizhe Diao

@shizhediao

25 days ago

🚀 How far can RL scaling take LLMs? Drop ProRLv2! 🔥With ProRLv2, we keep expanding LLM’s reasoning boundaries through 3,000+ RL steps over 5 domains and set a new state-of-the-art 🌟 among 1.5B reasoning models. 🔗 Full blog: research.nvidia.com/labs/lpr/prorl… 🤗Open model:

thumb_up_off_alt209

chat_bubble_outline4

repeat36

shareShare