Justus Mattern (@matternjustus) Twitter Tweets • TwiCopy

Justus Mattern

@matternjustus

+ Follow

Research Engineer @PrimeIntellect | prev. co-founder re.video (YC S23), research @MPI_IS, physics @RWTH

ID: 1371519617550663687

linkhttps://www.justusmattern.com calendar_today15-03-2021 17:52:22

570 Tweet

2,2K Followers

300 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

great thread, summarizes well why we're particularly excited about RL higher inference to training compute ratio -> less inter node communication -> better suited for globally distributed training infra with slow connection speeds

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Justus Mattern

@matternjustus

a month ago

We are so back btw (got submitted in final but hey)

thumb_up_off_alt150

chat_bubble_outline17

repeat2

shareShare

Justus Mattern

@matternjustus

24 days ago

First tweet is a banger, I will watch his career with great interest

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

Justus Mattern

@matternjustus

22 days ago

Highest leverage thing unskilled engineers can do rn to contribute to frontier AI research is vibecoding RL environments

thumb_up_off_alt628

chat_bubble_outline18

repeat20

shareShare

Justus Mattern

@matternjustus

17 days ago

SYNTHETIC-2 Datasets are now on Huggingface! We’re releasing an SFT dataset collected from the new R1-0528 as well as an RL Dataset with difficulty annotations from various smaller models. Go train some models 🫡

thumb_up_off_alt81

chat_bubble_outline3

repeat4

shareShare

Justus Mattern

@matternjustus

13 days ago

I have acquired Windsurf

thumb_up_off_alt78

chat_bubble_outline5

repeat0

shareShare

Justus Mattern

@matternjustus

12 days ago

Day 1 of asking for this at Prime Intellect HQ

thumb_up_off_alt118

chat_bubble_outline10

repeat0

shareShare

Jackmin

@jackminong

12 days ago

Toploc Poster session tomorrow (Wed) at 4:30 PM East Hall E-1106 I’ll be around through Saturday; if you’re into decentralized training & inference, lets chat!

thumb_up_off_alt88

chat_bubble_outline3

repeat5

shareShare

Simon Guo 🦝

@simonguozirui

11 days ago

At #ICML2025 in Vancouver 🇨🇦 this week, presenting some work from my first year at Stanford! Come find me at posters or just around the conference! Thursday: KernelBench: Can LLMs Write Efficient GPU Kernels? 11AM East E-2010 Saturday: Kevin: Multi-Turn RL for Generating

thumb_up_off_alt55

chat_bubble_outline0

repeat13

shareShare

Mario Sieg

@_mario_neo_

10 days ago

my piquant quantization kernels are almost 50 times faster than pytorch's on the CPU. pytorch’s sub‑byte quantization (torch.quint4x2, torch.quint2x4) is quite slow. 1/2

thumb_up_off_alt125

chat_bubble_outline4

repeat8

shareShare

Justus Mattern

@matternjustus

10 days ago

RL with predefined tools does not matter in the long term, the most bitter lesson pilled approach is giving the model a single universal tool (a computer)

thumb_up_off_alt310

chat_bubble_outline9

repeat18

shareShare

Minn

@minney_cat

8 days ago

this year alone, I've met hundreds of the world's elite AI researchers + engineers steering the future of intelligence, and they ultimately want to do it here, in the US 🇺🇸 If a visa or green card are holding you back, join us on 7/31 in SF and hear real stories from

thumb_up_off_alt139

chat_bubble_outline8

repeat14

shareShare

Justus Mattern

@matternjustus

6 days ago

How to train the best non-reasoning model: 1. Gather a reasoning dataset 2. remove <think> tokens 3. Train the model

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat20

shareShare

Justus Mattern

@matternjustus

5 days ago

While LLMs are good at generating functionally correct frontend code, it’s stunning how bad AI-generated UIs are; I’m certain that this can become better with appropriate evals and reward signals Really excited about this leaderboard and the very hard-working team behind it!

thumb_up_off_alt25

chat_bubble_outline2

repeat1

shareShare

Justus Mattern

@matternjustus

5 days ago

Just landed in SF, I'm now stranded here without a desk while the rest of my team is still in Europe. If anyone can host me at their office for (one of) the next few days, please let me know (DMs open 🥹👉👈)

thumb_up_off_alt57

chat_bubble_outline6

repeat0

shareShare

Chujie Zheng

@chujiezheng

a day ago

Noticed some curiosity about the specific score comparison between GSPO and GRPO. From our perspective, we’re more focused on scalability — can we achieve better performance by increasing compute (e.g., training with more steps, extending generation length, regularly updating

thumb_up_off_alt195

chat_bubble_outline6

repeat13

shareShare