Mohamed Osman (@mohamedosmanml) 's Twitter Profile
Mohamed Osman

@mohamedosmanml

AI researcher @ MindsAI/Tufa Labs

ID: 4816498943

calendar_today15-01-2016 18:20:24

48 Tweet

554 Followers

173 Following

ARC Prize (@arcprize) 's Twitter Profile Photo

ARC Prize builds on the legacy of past competitions co-hosted by François Chollet & Lab42. Thank you to Rolf Pfister, Hansueli Jud, and Oliver Schmid for their invaluable contributions this year. Thank you to ARC-AGI evangelists Michael Hodel, Jack Cole, Mohamed Osman,

Mohamed Osman (@mohamedosmanml) 's Twitter Profile Photo

✈️ Heading to #NeurIPS2024 today! Excited to discuss ARC, reasoning, test-time tuning, other test-time methods, anything and everything. DM me if you’re around and want to chat!

Akira Yoshiyama ⁂ (@yoshiyama_akira) 's Twitter Profile Photo

Happy to announce we outperformed OpenAI o1 with a 7B model :) We released two self-improvement methods for verifiable domains in our preliminary paper -->

Happy to announce we outperformed <a href="/OpenAI/">OpenAI</a> o1 with a 7B model :)

We released two self-improvement methods for verifiable domains in our preliminary paper --&gt;
Mohamed Osman (@mohamedosmanml) 's Twitter Profile Photo

Honored to be a guest on the infamous MLST podcast again! We discuss our test-time methods, compositionality in LLMs, limitations of VLMs, logic vs perception, efficient adaptation, and more. Machine Learning Street Talk youtu.be/3p0O28W1ZHg

Toby Simonds (@tobyrsimonds) 's Twitter Profile Photo

🚀 NEW RESEARCH 🚀 🧠 To push RL further we need a lot more questions. The issue? Current datasets have only a few hundred thousand—and for domains outside math, there's barely anything. Our breakthrough? Turning everyday textbooks into limitless RL training gold 📚✨ A thread on

Toby Simonds (@tobyrsimonds) 's Twitter Profile Photo

🚀 New paper: LLMs for Engineering: Teaching Models to Design High-Powered Rockets 🚀 We built an environment to allow models to build high powered rockets and show by using RL models can surpass human designs!

🚀 New paper: LLMs for Engineering: Teaching Models to Design High-Powered Rockets 🚀

We built an environment to allow models to build high powered rockets and show by using RL models can surpass human designs!
Arnaud Bertrand (@rnaudbertrand) 's Twitter Profile Photo

I just read this WSJ article on why Europe's tech scene is so much smaller than the US's and China's. I'm afraid that, like most articles on this topic, it largely misses the mark. Which in itself illustrates a key reason why Europe is lagging behind: when you fail to

I just read this WSJ article on why Europe's tech scene is so much smaller than the US's and China's.

I'm afraid that, like most articles on this topic, it largely misses the mark.

Which in itself illustrates a key reason why Europe is lagging behind: when you fail to
Kevin Ellis (@ellisk_kellis) 's Twitter Profile Photo

New paper: World models + Program synthesis by Wasu Top Piriyakulkij 1. World modeling on-the-fly by synthesizing programs w/ 4000+ lines of code 2. Learns new environments from minutes of experience 3. Positive score on Montezuma's Revenge 4. Compositional generalization to new environments

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Deep learning alone now cracks 58% of the hidden ARC test after adding on‑the‑fly tuning, proving the paradigm can invent new abstractions during inference. The work shows that a neural network can tackle ARC once the optimizer is treated as part of inference, meaning the model

Deep learning alone now cracks 58% of the hidden ARC test after adding on‑the‑fly tuning, proving the paradigm can invent new abstractions during inference.

The work shows that a neural network can tackle ARC once the optimizer is treated as part of inference, meaning the model