Alex Cheema - e/acc (@alexocheema) Twitter Tweets • TwiCopy

Alex Cheema - e/acc

@alexocheema

+ Follow

Building @exolabs | prev @UniOfOxford We're hiring: exolabs.net

ID: 915614943797551104

linkhttps://github.com/exo-explore/exo calendar_today04-10-2017 16:29:48

4,4K Tweet

36,36K Followers

2,2K Following

Matt Beton

@mattbeton

6 months ago

are there reasoning models that are better at big-picture thinking/planning, versus lower-level implementation? i’m wondering whether a hybrid strategy of models would be optimal; my workflow at the moment is to plan and theorise with o3, then cursor+claude 4 for implementation

thumb_up_off_alt10

chat_bubble_outline2

repeat2

shareShare

Eric Buess

@ericbuess

6 months ago

I remember Alex Cheema - e/acc's viral video of a small Mac Mini cluster: "Nemotron 70B at 8 tok/sec and scales to Llama 405B". I requested benchmarks, discovered how awesome Alex is via Zoom, and built the initial stages of benchmarks.exolabs.net Now Exo v2 launch incoming!

I remember <a href="/alexocheema/">Alex Cheema - e/acc</a>'s viral video of a small Mac Mini cluster: "Nemotron 70B at 8 tok/sec and scales to Llama 405B". I requested benchmarks, discovered how awesome Alex is via Zoom, and built the initial stages of benchmarks.exolabs.net

Now Exo v2 launch incoming!

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Alex Cheema - e/acc

@alexocheema

6 months ago

What did I say? Honeymoon effect. The quality drop is in your head. We’re in the second S-curve. Embrace it or die.

thumb_up_off_alt24

chat_bubble_outline1

repeat0

shareShare

Matt Beton

@mattbeton

6 months ago

it was a pleasure to be asked to give a talk on our paper ‘SPARTA’ at ICLR 2025. distributed training isn’t a fantasy any more; with algorithmic improvements like this, training models over low-bandwidth environments becomes a reality read the paper here: openreview.net/forum?id=stFPf…

thumb_up_off_alt12

chat_bubble_outline1

repeat6

shareShare

Alex Cheema - e/acc

@alexocheema

6 months ago

great things come from cold dms Naval cold dm’d me and invested in EXO Labs hired Matt Beton after i cold dm’d him our first customer came from a cold dm you can just do things

thumb_up_off_alt123

chat_bubble_outline8

repeat4

shareShare

Alex Cheema - e/acc

@alexocheema

5 months ago

fml linkedin is unusable

thumb_up_off_alt6,6K

chat_bubble_outline251

repeat255

shareShare

Tycho van der Ouderaa

@tychovdo

5 months ago

Thrilled to share that I’ve started my new role as a Senior Engineer at Qualcomm Research in Amsterdam. I’ll be joining the Model Efficiency team, where I’ll continue research on quantization and compression techniques for machine learning and AI. Qualcomm Qualcomm Research & Technologies

thumb_up_off_alt97

chat_bubble_outline11

repeat3

shareShare

Alex Cheema - e/acc

@alexocheema

5 months ago

He applied to EXO Labs last year

He applied to <a href="/exolabs/">EXO Labs</a> last year

thumb_up_off_alt55

chat_bubble_outline5

repeat0

shareShare

Alex Cheema - e/acc

@alexocheema

5 months ago

We’re already doing this with EXO Labs Last month was the first trial, we provided free M-chip public cloud access to developers at a hackathon. These were M3 Max/Ultra Mac Studios with up to 512GB unified memory. Awni Hannun gave a talk at the hackathon on how to leverage MLX

We’re already doing this with <a href="/exolabs/">EXO Labs</a>

Last month was the first trial, we provided free M-chip public cloud access to developers at a hackathon. These were M3 Max/Ultra Mac Studios with up to 512GB unified memory.

<a href="/awnihannun/">Awni Hannun</a> gave a talk at the hackathon on how to leverage MLX

thumb_up_off_alt210

chat_bubble_outline15

repeat17

shareShare

Alex Cheema - e/acc

@alexocheema

5 months ago

pump is one of the fastest growing startup ever. 0 to $1B ARR in 9 months. 25% of revenue for $PUMP buy backs is insane, i'm predicting this ends up in the top 10.

thumb_up_off_alt17

chat_bubble_outline8

repeat0

shareShare

Josh Lavorini

@jrlavorini

5 months ago

if they ever tell my story, let them say I walked with giants; men rise and fall like the winter wheat, but these names will never die.

thumb_up_off_alt27

chat_bubble_outline2

repeat5

shareShare

Alex Cheema - e/acc

@alexocheema

5 months ago

A new approach to efficient large scale distributed training on Apple Silicon. Most AI research today is focused on traditional GPUs. These GPUs have a LOT of FLOPS but not much memory. They have a low memory:flops ratio. Apple Silicon has a lot more memory available for the GPU

thumb_up_off_alt229

chat_bubble_outline11

repeat29

shareShare

EXO Labs

@exolabs

5 months ago

EXO isn't just for inference.

thumb_up_off_alt65

chat_bubble_outline2

repeat7

shareShare