Sewon Min (@sewon__min) Twitter Tweets • TwiCopy

Berkeley AI Research

6 months ago

BAIR faculty Stuart Russell, Dan Klein, Alane Suhr, Ken Goldberg, and Sewon Min weigh in on the future of LLMs, synthetic data, and the road ahead ⬇️ alumni.berkeley.edu/california-mag…

thumb_up_off_alt38

chat_bubble_outline2

repeat5

shareShare

Xiao Ma

@infoxiao

6 months ago

using a fraction of the compute and building best-in-class models is such aura Danqi Chen #ICLR25

$using a fraction of the compute and building best-in-class models is such aura <a href="/danqi_chen/">Danqi Chen</a> #ICLR25$

thumb_up_off_alt96

chat_bubble_outline1

repeat8

shareShare

Meet ReasonIR-8B✨the first retriever specifically trained for reasoning tasks! Our challenging synthetic training data unlocks SOTA scores on reasoning IR and RAG benchmarks. ReasonIR-8B ranks 1st on BRIGHT and outperforms search engine and retriever baselines on MMLU and GPQA🔥

thumb_up_off_alt342

chat_bubble_outline5

repeat62

shareShare

Ai2

@allen_ai

6 months ago

We’re live on Reddit! Ask us Anything about our OLMo family of models. We have six of our researchers on hand to answer all your questions.

thumb_up_off_alt85

chat_bubble_outline3

repeat29

shareShare

Muhammad Khalifa

@mkhalifaaaa

5 months ago

🚨Announcing SCALR @ COLM 2025 — Call for Papers!🚨 The 1st Workshop on Test-Time Scaling and Reasoning Models (SCALR) is coming to Conference on Language Modeling in Montreal this October! This is the first workshop dedicated to this growing research area. 🌐 scalr-workshop.github.io

🚨Announcing SCALR @ COLM 2025 — Call for Papers!🚨

The 1st Workshop on Test-Time Scaling and Reasoning Models (SCALR) is coming to <a href="/COLM_conf/">Conference on Language Modeling</a> in Montreal this October!

This is the first workshop dedicated to this growing research area.

🌐 scalr-workshop.github.io

thumb_up_off_alt44

chat_bubble_outline1

repeat17

shareShare

Stella Li

@stellalisy

5 months ago

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat322

shareShare

Omar Khattab

@lateinteraction

5 months ago

Tensor Templar The important breakthrough is that a lot of the “RL just works” noise has little to do with RL and has more to do with “qwen mid-training for math and coding makes the model very receptive for the same jumps on math and coding over and over”

thumb_up_off_alt25

chat_bubble_outline1

repeat3

shareShare

Yizhong Wang

@yizhongwyz

5 months ago

Thrilled to announce that I will be joining UT Austin Computer Science at UT Austin as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

Thrilled to announce that I will be joining <a href="/UTAustin/">UT Austin</a> <a href="/UTCompSci/">Computer Science at UT Austin</a> as an assistant professor in fall 2026!

I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

thumb_up_off_alt620

chat_bubble_outline98

repeat48

shareShare

Allen School

@uwcse

5 months ago

Congratulations to University of Washington #UWAllen Ph.D. grads Ashish Sharma & Sewon Min, Association for Computing Machinery Doctoral Dissertation Award honorees! Sharma won for #AI tools for mental health; Min received honorable mention for efficient, flexible language models. #ThisIsUW news.cs.washington.edu/2025/06/04/all…

thumb_up_off_alt100

chat_bubble_outline0

repeat18

shareShare

Association for Computing Machinery

@theofficialacm

5 months ago

🎓 Congrats to Ashish Sharma, University of Washington on receiving the ACM Doctoral Dissertation Award for his dissertation, "Human-AI Collaboration to Support Mental Health and Well Being." 👏 Honorable Mentions: Alexander Kelley, University of Illinois Sewon Min, UC Berkeley

🎓 Congrats to Ashish Sharma, <a href="/UW/">University of Washington</a> on receiving the ACM Doctoral Dissertation Award for his dissertation, "Human-AI Collaboration to Support Mental Health and Well Being."

👏 Honorable Mentions:
Alexander Kelley, <a href="/UofIllinois/">University of Illinois</a>
Sewon Min, <a href="/UCBerkeley/">UC Berkeley</a>

thumb_up_off_alt77

chat_bubble_outline4

repeat12

shareShare

EleutherAI

@aieleuther

5 months ago

Can you train a performant language models without using unlicensed text? We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1&2

thumb_up_off_alt556

chat_bubble_outline10

repeat127

shareShare

Sergey Levine

@svlevine

5 months ago

I always found it puzzling how language models learn so much from next-token prediction, while video models learn so little from next frame prediction. Maybe it's because LLMs are actually brain scanners in disguise. Idle musings in my new blog post: sergeylevine.substack.com/p/language-mod…

thumb_up_off_alt723

chat_bubble_outline16

repeat82

shareShare

Ary

@aryg18

4 months ago

fwiw, I think Prof. Percy Liang and the CS336 team nailed this: Sutton’s Bitter Lesson is often misinterpreted as “scale is all that matters” and/or “algorithms don’t matter.” The more accurate – and useful – interpretation is: what matters are the algorithms that scale.

thumb_up_off_alt194

chat_bubble_outline2

repeat27

shareShare

Xinxi Lyu

@xinxilyu

4 months ago

Reasoning benchmarks (e.g., MMLU Pro and GPQA) have seen little benefit from naive RAG. But can we flip this? 🔥Introducing CompactDS: ✅Web-scale coverage ✅Runs with just 100GB RAM ✅Matches search engines The simplest RAG pipeline can even compete with agentic

thumb_up_off_alt52

chat_bubble_outline1

repeat16

shareShare

Rulin Shao

@rulinshao

4 months ago

🚀 Last year: MassiveDS-1.4T showed great scaling gains with a web-scale datastore but was too heavy for online production ✨ Now: CompactDS is here! Better performance, compact size, ready for agentic apps & Deep Research RL training Kudos to Xinxi Lyu Michael Duan for leading this!

thumb_up_off_alt46

chat_bubble_outline0

repeat7

shareShare

Weijia Shi

@weijiashi2

4 months ago

Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmo💪, a mixture-of-experts LM enabling: • Flexible training on your local data without sharing it • Flexible inference to opt in/out your data

thumb_up_off_alt197

chat_bubble_outline7

repeat59

shareShare

Sewon Min

@sewon__min

4 months ago

Thanks, Will Knight, for covering our work!!

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

Akari Asai

@akariasai

3 months ago

Some updates 🚨 I finished my Ph.D at Allen School in June 2025! After a year at AI2 as a Research Scientist, I am joining CMU Language Technologies Institute | @CarnegieMellon & Machine Learning Dept. at Carnegie Mellon (courtesy) as an Assistant Professor in Fall 2026. The journey, acknowledgments & recruiting in 🧵

Some updates 🚨
I finished my Ph.D at <a href="/uwcse/">Allen School</a> in June 2025!
After a year at AI2 as a Research Scientist, I am joining CMU <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> & <a href="/mldcmu/">Machine Learning Dept. at Carnegie Mellon</a> (courtesy) as an Assistant Professor in Fall 2026.
The journey, acknowledgments & recruiting in 🧵

thumb_up_off_alt1,1K

chat_bubble_outline85

repeat52

shareShare