Thomas Scialom (@thomasscialom) Twitter Tweets • TwiCopy

Thomas Scialom

@thomasscialom

+ Follow

AGI Researcher @MetaAI -- I led Llama 2, built post-training from scratch. Also Toolformer, GAIA, Llama-3.0, CodeLlama, Galactica. Now working on Agents.

ID: 942694791707545600

linkhttps://www.linkedin.com/in/tscialom/ calendar_today18-12-2017 09:55:27

1,1K Tweet

7,7K Followers

224 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

In fact there is on perplexity demo a specific system prompt that amplifes over safe responses. It has been removed from other demos like HF. Perplexity Denis Yarats could we deactivate it as well by default please?

thumb_up_off_alt45

chat_bubble_outline1

repeat7

shareShare

Thomas Scialom

@thomasscialom

2 years ago

It did in fact. RLHF is the technology behind chatgpt and probably dalle3. To panned out on real-world problems it needed nothing more than human feedback rewards.

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Thomas Scialom

@thomasscialom

2 years ago

I strongly disagree. There are many paths to success, and doing a PhD is never a suboptimal choice. Both professionally and personally.

thumb_up_off_alt66

chat_bubble_outline4

repeat1

shareShare

Thomas Scialom

@thomasscialom

2 years ago

At the AI-pulse today I talked about -- surprise -- LLMs. There short history, a deep dive into Llama 2, the magic behind RLHF, and my vision of where of the future of the field. Thanks Scaleway for the opportunity!

thumb_up_off_alt13

chat_bubble_outline2

repeat0

shareShare

AK

@_akhaliq

2 years ago

GAIA: a benchmark for General AI Assistants paper page: huggingface.co/papers/2311.12… introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat243

shareShare

Thomas Scialom

@thomasscialom

2 years ago

Despite being an amazing paper, chinchilla did/could not be open-source. Llama-1 has now more than 10x citations than Chinchilla.

thumb_up_off_alt19

chat_bubble_outline1

repeat1

shareShare

Thomas Scialom

@thomasscialom

a year ago

Yes, we will continue to make sure AI remains an open source technology.

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Thomas Scialom

@thomasscialom

a year ago

Delighted to finally introduce Llama 3: The most capable openly available LLM to date. Long jouney since Llama-2, a big shoutout to the incredible team effort that made this possible, and stay tuned, we will keep building🦙 ai.meta.com/blog/meta-llam…

thumb_up_off_alt150

chat_bubble_outline16

repeat21

shareShare

Thomas Scialom

@thomasscialom

a year ago

Don't fall into the chinchilla trap if you want your model to be used by billions of people :)

thumb_up_off_alt70

chat_bubble_outline0

repeat6

shareShare

Thomas Scialom

@thomasscialom

a year ago

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with Hugging Face, kyutai, Google DeepMind (Gemma), cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with <a href="/huggingface/">Hugging Face</a>, <a href="/kyutai_labs/">kyutai</a>, <a href="/GoogleDeepMind/">Google DeepMind</a> (Gemma), <a href="/cohere/">cohere</a>
As someone said: better that the building remains safe, or ciao the open source for AI 😆

thumb_up_off_alt232

chat_bubble_outline14

repeat9

shareShare

Thomas Scialom

@thomasscialom

a year ago

I am at ICLR.. 🦙 Llama-3: I ll be every morning at 11am at the AI at Meta for Llama-3 QA sessions 🤖 GAIA: General AI Assistant benchmark w/ Gregoire 🔭 NOUGAT: for Scientific OCR w/ Lukas And if you are interested in post-training, rlhf, agents i m down for ☕&🍺 ICLR 2026

thumb_up_off_alt84

chat_bubble_outline6

repeat14

shareShare

Thomas Scialom

@thomasscialom

a year ago

RHLF versus immitation learning explained in one tweet

thumb_up_off_alt25

chat_bubble_outline0

repeat5

shareShare

Thomas Scialom

@thomasscialom

a year ago

The team worked really hard to make history, voila finally the Llama-3.1 herd of models...have fun with it! * open 405B, insane 70B * 128K context length, improved reasoning & coding capabilities * detailed paper ai.meta.com/research/publi…

thumb_up_off_alt107

chat_bubble_outline3

repeat16

shareShare

Latent.Space

@latentspacepod

a year ago

🆕 pod with Thomas Scialom of AI at Meta! Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI latent.space/p/llama-3 shoutouts: - Why Yann LeCun's Galactica Instruct would have solved Lucas Beyer (bl16)'s Citations Generator - Beyond Chinchilla-Optimal: 100x

thumb_up_off_alt68

chat_bubble_outline4

repeat16

shareShare

Miles Brundage

@miles_brundage

9 months ago

Are we failing to grasp how big Internet-scale data is/how far interpolation on it goes? Are we underappreciating how fast GPUs are or how good backprop is? Are we overestimating the difference between the stuff we do vs what animals do + they’re similar in some deep sense? Etc.

thumb_up_off_alt54

chat_bubble_outline5

repeat3

shareShare

Deedy

@deedydas

9 months ago

All languages covey information at a similar rate when spoken (39bits/s). Languages that are spoken faster have less information density per syllable! One of the coolest results in linguistics.

thumb_up_off_alt32,32K

chat_bubble_outline669

repeat3,3K

shareShare

Thomas Scialom

@thomasscialom

7 months ago

Today's unexpected tokens and tomorrow's high-likelihood predictions for future LLMs.

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Thomas Scialom

@thomasscialom

a month ago

Achieving AGI inherently leads to ASI because AI capability scales directly with compute and digital infrastructure. Think of Google Search already vastly outperforming humans at search. Now imagine applying the same scale to a truly intelligent system.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare