Rulin Shao (@rulinshao) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

new paper from our work at Meta! **GPT-style language models memorize 3.6 bits per param** we compute capacity by measuring total bits memorized, using some theory from Shannon (1953) shockingly, the memorization-datasize curves look like this: ___________ / / (🧵)

thumb_up_off_alt3,3K

chat_bubble_outline76

repeat369

shareShare

Akari Asai

@akariasai

a month ago

‘Bold,’ ‘positive’ and ‘unparalleled’: Allen School Ph.D. graduates Ashish Sharma and Sewon Min recognized with ACM Doctoral Dissertation Awards news.cs.washington.edu/2025/06/04/all… Massive congrats to Ashish Sharma and Sewon Min - huge win for UW NLP and the broader NLP community! 🙌

thumb_up_off_alt178

chat_bubble_outline5

repeat17

shareShare

Hanna Hajishirzi

@hannahajishirzi

a month ago

Check out who the 2025 ACM Dissertation Award honorees are this year — our very own Sewon Min and Ashish Sharma! So proud of both of you. Big congratulations to Allen School UW NLP!

thumb_up_off_alt29

chat_bubble_outline2

repeat4

shareShare

Ludwig Schmidt

@lschmidt3

a month ago

Very excited to finally release our paper for OpenThoughts! After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat208

shareShare

Han Guo

@hanguo97

a month ago

We know Attention and its linear-time variants, such as linear attention and State Space Models. But what lies in between? Introducing Log-Linear Attention with: - Log-linear time training - Log-time inference (in both time and memory) - Hardware-efficient Triton kernels

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat185

shareShare

Zirui Liu

@ziruirayliu

a month ago

🔥Exited to share our new work on reproducibility challenges in reasoning models caused by numerical precision. Ever run the same prompt twice and get completely different answers from your LLM under greedy decoding? You're not alone. Most LLMs today default to BF16 precision,

thumb_up_off_alt93

chat_bubble_outline3

repeat21

shareShare

Han Guo

@hanguo97

a month ago

One key takeaway from recent work on test-time compute: even a small weight update can make a big difference. So, what happens if we meta-learn those updates (and not necessarily at test time)? Excited to share this new work led by Adam Zweiger and Jyo Pari!

thumb_up_off_alt41

chat_bubble_outline1

repeat8

shareShare

Rulin Shao

@rulinshao

a month ago

Honored to be part of organizing the LM4Sci workshop at #COLM2025! 🔬🤖 We invite submissions that demonstrate innovative approaches to scientific reasoning and discovery. Submit by June 23! 🚀

thumb_up_off_alt30

chat_bubble_outline0

repeat6

shareShare

Rulin Shao

@rulinshao

a month ago

It reminds me of the cognitive behaviors that have been found to help reasoning—backtracking, subgoal setting, verifications, etc.—they all seem to fit this parallel generation pattern better than linearly chaining them. Looking forward to trying it out!

thumb_up_off_alt33

chat_bubble_outline0

repeat0

shareShare

Thao Nguyen

@thao_nguyen26

24 days ago

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔 We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689

thumb_up_off_alt213

chat_bubble_outline8

repeat57

shareShare

CLS

@chengleisi

17 days ago

Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.

thumb_up_off_alt553

chat_bubble_outline10

repeat162

shareShare

Bo Liu (Benjamin Liu)

@benjamin_eecs

16 days ago

We've always been excited about self-play unlocking continuously improving agents. Our insight: RL selects generalizable CoT patterns from pretrained LLMs. Games provide perfect testing grounds with cheap, verifiable rewards. Self-play automatically discovers and reinforces

thumb_up_off_alt261

chat_bubble_outline3

repeat53

shareShare

Peng Qi

@qi2peng2

15 days ago

Seven years ago, I co-led a paper called 𝗛𝗼𝘁𝗽𝗼𝘁𝗤𝗔 that has motivated and facilitated many #AI #Agents research works since. Today, I'm asking that you stop using HotpotQA blindly for agents research in 2025 and beyond. In my new blog post, I revisit the brief history of

thumb_up_off_alt222

chat_bubble_outline5

repeat44

shareShare

Victoria Graf

@victoriawgraf

14 days ago

Worried about overfitting to IFEval? 🤔 Use ✨IFBench✨ our new, challenging instruction-following benchmark! Loved working w/ Valentina Pyatkin! Personal highlight: our multi-turn eval setting makes it possible to isolate constraint-following from the rest of the instruction 🔍

thumb_up_off_alt48

chat_bubble_outline2

repeat13

shareShare

Rulin Shao

@rulinshao

10 days ago

🚀 Last year: MassiveDS-1.4T showed great scaling gains with a web-scale datastore but was too heavy for online production ✨ Now: CompactDS is here! Better performance, compact size, ready for agentic apps & Deep Research RL training Kudos to Xinxi Lyu Michael Duan for leading this!

thumb_up_off_alt46

chat_bubble_outline0

repeat7

shareShare

Rulin Shao

@rulinshao

9 days ago

Happy to share that ReasonIR is accepted by Conference on Language Modeling! Synthetic data & test-time scaling are powerful tools to enable new capabilities for challenging tasks. I’m impressed by how quickly smaller retrievers and better rerankers have been developed with ReasonIR data! #COLM2025

thumb_up_off_alt100

chat_bubble_outline2

repeat8

shareShare

Rulin Shao

Gate.io

jack morris

Akari Asai

Hanna Hajishirzi

Ludwig Schmidt

Han Guo

Zirui Liu

Han Guo

Rulin Shao

Rulin Shao

Thao Nguyen

CLS

Bo Liu (Benjamin Liu)

Peng Qi

Victoria Graf

Rulin Shao

Rulin Shao