Rick Lamers (@ricklamers) Twitter Tweets • TwiCopy

Rick Lamers

@ricklamers

+ Follow

👨‍💻 AI Research & Engineering @GroqInc. Occasional angel investor. I publish technical resources about LLMs every week. Opinions are my own.

ID: 57274933

linkhttps://codingwithintelligence.com/ calendar_today16-07-2009 07:48:12

2,2K Tweet

5,5K Followers

668 Following

Rick Lamers

@ricklamers

13 days ago

My experience on Pro vs non-Pro modes

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

This incredible work reveals mitigating a hidden bias could contribute to improving long context performance, potentially opening up the path to a formulation that can direct attention equally to tokens deeper in the sequence. 👏 Francesco Pappone

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Rick Lamers

@ricklamers

12 days ago

Excellent eval, and open weight models holding ground 🙌

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Rick Lamers

@ricklamers

11 days ago

This is probably nothing 👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rick Lamers

@ricklamers

10 days ago

Come chat to Aarush Sah about OpenBench! 🔥 Ask him about: eval saturation, harness influence, LLM-as-a-judge, up-and-coming evals, which evals are useful, open source vs proprietary models on various evals, and much much more!

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Rick Lamers

@ricklamers

10 days ago

And it is really fast, including chunking, merging, RTFx factor of 200x+ (20 seconds for 1 hour and 8 minutes of audio)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Boris Dayma 🖍️

@borisdayma

9 days ago

I need an optimizer as cheap as Adam, as good as Muon, and as stable as shampoo. Come on optimizer people!

thumb_up_off_alt11

chat_bubble_outline2

repeat1

shareShare

Rick Lamers

@ricklamers

7 days ago

Begun, the bundling wars have.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Rick Lamers

@ricklamers

5 days ago

Core takeaway from open-sci-ref 0.01: data quality > scale. A 1.7B model trained on Nemotron-CC-HQ for 1T tokens matches SmolLM2-1.7B trained on ~11T. Further confirms the value of curating high quality (open) datasets.

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Rick Lamers

@ricklamers

4 days ago

RL finds a way, intelligence is finding the path of least resistance. We just need to make sure the path of least resistance is a useful one.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare