Amaury Hayat (@amaury_hayat) 's Twitter Profile
Amaury Hayat

@amaury_hayat

Professor @EcoleDesPonts IP Paris | AI for math | Control and stabilisation of PDEs | Member @ CIRCLES consortium

ID: 1240430202255261696

linkhttp://cermics.enpc.fr/~hayata/index_en.html calendar_today19-03-2020 00:09:49

264 Tweet

323 Followers

163 Following

Quanquan Gu (@quanquangu) 's Twitter Profile Photo

Very Impressive work! It appears that Multi-Token Prediction (MTP) has a substantial impact for both pre-training and inference.

Very Impressive work! It appears that Multi-Token Prediction (MTP) has a substantial impact for both pre-training and inference.
Nathan Lichtlé (@nathanlichtle) 's Twitter Profile Photo

We ran 5,600 hyperparameter sweeps to compare RL algorithms on hidden-information games with billions of states. In our benchmark, we found that properly tuned policy gradient methods, such as PPO, performed the best. Paper: arxiv.org/abs/2502.08938

Jia Li (@jiali52524397) 's Twitter Profile Photo

We believe formal math is the future. 🔥Introducing Kimina-Prover Preview, a Numina & Kimi.ai collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F. github.com/MoonshotAI/Kim…

We believe formal math is the future.
🔥Introducing Kimina-Prover Preview, a Numina &
<a href="/Kimi_Moonshot/">Kimi.ai</a>  collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F.
github.com/MoonshotAI/Kim…
Amaury Hayat (@amaury_hayat) 's Twitter Profile Photo

Hey OpenAI ! how can I get access API access to o3-mini ? I'm a Tier 5 user and it would be very helpful for my projects (in particular some research projects) !

Amaury Hayat (@amaury_hayat) 's Twitter Profile Photo

Assistant professor level-position in machine learning in my university ! Funded by Hi! PARIS, and reduced teaching duties :) More info here: lnkd.in/er2Sfn_d. The application deadline is May 9. International applications are welcome!

Alberto Alfarano (@albe_alfa) 's Twitter Profile Photo

Huge work with Zeyuan Allen-Zhu, Sc.D.: we build a synthetic arena to really understand the differences between model architectures and we introduce a simple but effective Canon layer to boost reasoning. Check it out!

Amaury Hayat (@amaury_hayat) 's Twitter Profile Photo

So cool! I've been looking forward to it! It's also very cool and beneficial to the research community to have a document that explains everything in detail. Hats off to Mistral!