Vincent Weisser (@vincentweisser) Twitter Tweets • TwiCopy

Awesome work by Mario Sieg to accelerate quantization of pseudo-gradients in decentralized training settings like DiLoCo - already integrated in pccl (prime collective communication library)

thumb_up_off_alt68

chat_bubble_outline5

repeat7

shareShare

will brown

@willccbb

6 months ago

prime intellect mentioned

thumb_up_off_alt28

chat_bubble_outline1

repeat1

shareShare

Prime Intellect

@primeintellect

6 months ago

thumb_up_off_alt371

chat_bubble_outline21

repeat39

shareShare

Nader Khalil🍊

@naderlikeladder

6 months ago

Reppin the homies companies at the gym

thumb_up_off_alt44

chat_bubble_outline6

repeat1

shareShare

Gill Verdon

@gillverd

6 months ago

I endorse this message 🫡

thumb_up_off_alt120

chat_bubble_outline10

repeat10

shareShare

Justus Mattern

@matternjustus

6 months ago

Average Prime Intellect meeting

Average <a href="/PrimeIntellect/">Prime Intellect</a> meeting

thumb_up_off_alt242

chat_bubble_outline7

repeat3

shareShare

Casper Hansen

@casper_hansen_

6 months ago

will brown cooking up some amazing async RL things! dev branch reveals async off-policy steps, which can easily give you 2-3x because of higher GPU utilization👀

<a href="/willccbb/">will brown</a> cooking up some amazing async RL things! dev branch reveals async off-policy steps, which can easily give you 2-3x because of higher GPU utilization👀

thumb_up_off_alt56

chat_bubble_outline5

repeat6

shareShare

i'm teaming up with Kyle Corbitt from openpipe to teach a class about agents + RL :) we'll be teaching the class on Maven 🏛 starting june 16. as far as we know, this is the first course of its kind anywhere to bridge RL + LLM agents, and we’re really excited to share some of our

i'm teaming up with <a href="/corbtt/">Kyle Corbitt</a> from openpipe to teach a class about agents + RL :)

we'll be teaching the class on <a href="/MavenHQ/">Maven 🏛</a> starting june 16. as far as we know, this is the first course of its kind anywhere to bridge RL + LLM agents, and we’re really excited to share some of our

thumb_up_off_alt447

chat_bubble_outline24

repeat33

shareShare

will brown

@willccbb

6 months ago

fun fact: @primeintellect has around 20 employees total everyone is exceptional at what they do. you have a lot of autonomy, and that comes with a lot of responsibility. we're hiring, but not rapidly. we want someone really, really good for this role. sound like fun?

thumb_up_off_alt250

chat_bubble_outline10

repeat11

shareShare

Riva

@rivatez

6 months ago

co-led a session with the awesome avwtr at Edge Esmeralda yesterday on what we claim to be the ultimate scientific bottleneck- our own miscalibrated conceptions of what science even is an example this is how we've reduced words/concepts to v narrow meanings. people can

co-led a session with the awesome <a href="/alexvawter/">avwtr</a> at <a href="/EdgeEsmeralda/">Edge Esmeralda</a> yesterday on what we claim to be the ultimate scientific bottleneck- our own miscalibrated conceptions of what science even is

an example this is how we've reduced words/concepts to v narrow meanings. people can

thumb_up_off_alt130

chat_bubble_outline17

repeat11

shareShare

Johannes Hagemann

@johannes_hage

6 months ago

rocm-smi coming to a heterogeneous synthetic data generation run near you soon

thumb_up_off_alt54

chat_bubble_outline2

repeat3

shareShare

Justus Mattern

@matternjustus

6 months ago

another easy to generate verifiable task: ask an LLM to generate a JSON that adheres to a very complex (LLM-generated) pydantic model Coming soon to the most diverse RL dataset out there, along with many other tasks that are not math and coding

thumb_up_off_alt174

chat_bubble_outline7

repeat15

shareShare

Manveer

@manveerxyz

6 months ago

prime intellect masterplan

thumb_up_off_alt441

chat_bubble_outline31

repeat22

shareShare

Prime Intellect

@primeintellect

6 months ago

thumb_up_off_alt2,2K

chat_bubble_outline48

repeat213

shareShare

mike64_t

@mike64_t

5 months ago

A 256x256 matrix memory at 2048 tokens of sequence length can achieve 98.6% retrieval accuracy. With random embeddings, accuracy would saturate at 67% and with optimized embeddigs at 80%. Combined with a 4-layer LSTM decoder and a 1-layer store gate, the accuracy increases to

thumb_up_off_alt352

chat_bubble_outline14

repeat15

shareShare

Grad

@grad62304977

5 months ago

Seems like no one saw this either, scraping arxiv manually seems to be the way. Pretty cool paper on rl for creative writing on Qwen3 32B base, and most interestingly it's one author from the Star Writing Team (haven't heard of them). They seem to have access to the 32B base tho

thumb_up_off_alt484

chat_bubble_outline11

repeat48

shareShare

Johannes Hagemann

@johannes_hage

5 months ago

ai tpot is missing lots of banger RL arxiv releases every week but Grad isn't. we gotta post the weekly docs he creates for our paper reading group somewhere

ai tpot is missing lots of banger RL arxiv releases every week but <a href="/Grad62304977/">Grad</a> isn't.

we gotta post the weekly docs he creates for our paper reading group somewhere

thumb_up_off_alt235

chat_bubble_outline8

repeat12

shareShare

samsja

@samsja19

5 months ago

async RL is faster that synchronous counterpart. this might be the first time in ML history where an algorithm is naturally async at scale. we realized two things at prime 6 months ago: * RL will be as much compute intensive as pretraining, pushing frontier capability * for

thumb_up_off_alt215

chat_bubble_outline8

repeat18

shareShare