Soufiane Hayou (@hayou_soufiane) Twitter Tweets • TwiCopy

Soufiane Hayou

@hayou_soufiane

+ Follow

Researcher @SimonsInstitute, UC Berkeley. PhD @oxfordstats and MSc&EngD @Polytechnique. I like to scale up things!

ID: 1277309540313300995

linkhttp://www.soufianehayou.com calendar_today28-06-2020 18:35:13

289 Tweet

791 Followers

275 Following

Soufiane Hayou

@hayou_soufiane

a year ago

🔥 Great to see LoRA+ getting so much attention! Pro tip: Since LoRA+ gains are (mostly) "orthogonal" to other methods, combining it with different LoRA variants could give even better results 🚀 Also: LoRA+ is now part of HuggingFace (PEFT). #AI #LLMs #LoRA #finetuning

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

NeurIPS Conference

@neuripsconf

10 months ago

Due to a high demand for registrations, NeurIPS will be moving towards a randomized lottery system, effective immediately. Authors of accepted conference and workshop papers are still guaranteed registration, but this may change as we release spots to the lottery, so we urge

thumb_up_off_alt321

chat_bubble_outline13

repeat60

shareShare

Soufiane Hayou

@hayou_soufiane

10 months ago

Saying 'LLMs are just next token predictors' is like saying 'A polynomial function is just a sum of monomials (x^k)'. Scale is key - with the right scale, a polynomial function can approximate incredibly complex behaviors. 🧮🤖

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

9 months ago

Said 'wassup' to Gemini 2.0 Flash Thinking and I think I gave it social anxiety... it wrote a whole research paper on how to respond casually 😭

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Soufiane Hayou

@hayou_soufiane

8 months ago

People compare AI to past historic breakthroughs 🔄 (industrial revolution, internet, etc), but there's a crucial difference: In previous advancements, humans remained the most intelligent beings. This time, we're creating something that could surpass us 🤖. It's a singularity!⚡️

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Soufiane Hayou

@hayou_soufiane

8 months ago

It turns out o1 style reasoning is just base model + RL (GPPO)! 🧐 github.com/deepseek-ai/De…

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

7 months ago

100%! For instance, if you have good understanding of concentration of random variables, you should be able to infer (without a lot of engineering) how to scale init and learning rate with width (µP, Mean-Field, etc), or with depth (Stable ResNet, Depth-µP, etc.)

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Soufiane Hayou

@hayou_soufiane

7 months ago

Great idea!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

7 months ago

I had a similar experience as well. I have a feeling that these systems will probably do most of what current PhD students can do. Strong PhD students will benefit from this by effectively using and directing these systems. Interesting times.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

7 months ago

Academia

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

7 months ago

Scale is the only permanent feature of state-of-the-art AI models. All other characteristics are subject to change and innovation.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

3 months ago

something is off with Claude Sonnet 4. It hallucinates more often than Sonnet 3.5/3.7

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

3 months ago

It seems that most gains from RL comes from the pretrained model itself. The format reward stuff (GRPO etc) just extracts those capabilities. It helps to have good reward signal, but it's not the main ingredient.

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

Thomas Wolf

@thom_wolf

3 months ago

the conference is getting crazy over it today we're unveiling our 1st robot Hugging Face 🤝 Pollen Robotics a low-cost $250 open-source robot designed as an open-source platform for fun human computer interactions powered by HF Spaces-models-community > discord.com/invite/jsvMRQx…

thumb_up_off_alt489

chat_bubble_outline35

repeat80

shareShare

Soufiane Hayou

@hayou_soufiane

3 months ago

This is the most obvious 1B$+ startup idea right now

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Soufiane Hayou

@hayou_soufiane

3 months ago

The current debate on reasoning in LLMs: Group A: "We see it, we feel it, therefore it exists" Group B: "We don't see it, we don't feel it, therefore it doesn't exist" Group C (<0.01%): "What is reasoning?"

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare