Marco Mascorro (@mascobot) 's Twitter Profile
Marco Mascorro

@mascobot

Partner @a16z (investor in @cursor_ai, @bfl_ml, @WaveFormsAI & more) | Roboticist | Cofounder @Fellow_AI | prev @BMW | @MIT 35 under 35 | Opinions my own.

ID: 80466845

linkhttps://www.mascobot.com calendar_today07-10-2009 02:11:33

5,5K Tweet

13,13K Followers

2,2K Following

Marco Mascorro (@mascobot) 's Twitter Profile Photo

Open AI Gym, a single-agent open source RL environment released by Open AI in 2016 (now called Gymnasium) was way ahead of its time:

Open AI Gym, a single-agent open source RL environment released by Open AI in 2016 (now called Gymnasium) was way ahead of its time:
Marco Mascorro (@mascobot) 's Twitter Profile Photo

Grok 4 achieves 66.6% on ARC AGI 1. So many folks ignored this benchmark for a long time and in the early days. Congrats to the xAI team

Marco Mascorro (@mascobot) 's Twitter Profile Photo

Seeing these results on ARC-AGI from Grok 4, I am so tempted to spin up different RL environments in the form of similar adjacent games and see how far the (new) small <7B models can get with ARC-AGI. Last November got to the top 1% of the ARC AGI submissions with no RL and very

Seeing these results on ARC-AGI from Grok 4, I am so tempted to spin up different RL environments in the form of similar adjacent games and see how far the (new) small &lt;7B models can get with ARC-AGI.

Last November got to the top 1% of the ARC AGI submissions with no RL and very
Marco Mascorro (@mascobot) 's Twitter Profile Photo

This is a good benchmark, TritonBench. Most LLMs aren’t great at writing Triton code for GPU kernels (there isn’t much Triton code publicly available yet, but it’s definitely increasing):

This is a good benchmark, TritonBench. Most LLMs aren’t great at writing Triton code for GPU kernels (there isn’t much Triton code publicly available yet, but it’s definitely increasing):
Trung Phan (@trungtphan) 's Twitter Profile Photo

Lee Kuan Yew: “Air conditioning was a most important invention for us, perhaps one of the signal inventions of history. It changed the nature of civilization by making development possible in the tropics. Without air conditioning you can work only in the cool early-morning

Lee Kuan Yew: 

“Air conditioning was a most important invention for us, perhaps one of the signal inventions of history. It changed the nature of civilization by making development possible in the tropics. Without air conditioning you can work only in the cool early-morning
Marco Mascorro (@mascobot) 's Twitter Profile Photo

This is quite interesting. The tokens per parameter in LLaMA 4 seem off (to what LLMs are trained on today): We are well over the Chinchilla optimal (20 tokens per param), but Llama 4 Behemoth had (only) 104 Tokens Per Param (TPP), similar to Mistral7B (dense), while Llama 4

Marco Mascorro (@mascobot) 's Twitter Profile Photo

If you be super neat if the labs producing the top OS LLMs like Kimi K2, etc. could release the smaller distilled versions of them (like DeepSeek R1 with the smaller distills they released), so that we all can run the most optimized, on pair speculative decoding

Marco Mascorro (@mascobot) 's Twitter Profile Photo

Can’t wait for Kimi K2 reasoning - K2 (base) seems pretty good even on creative writing. Hopefully we can get better training/sampling efficiencies with RL and can't wait to see how these models perform when RL compute is the vast majority of training (and hopefully when it's