Yacine Jernite (@yjernite) 's Twitter Profile
Yacine Jernite

@yjernite

Head of ML & Society @huggingface, NLPer at heart, focusing on data and ML systems governance these days
he/they
#BlackLivesMatter

ID: 1218217239213723648

calendar_today17-01-2020 17:03:30

4,4K Tweet

3,3K Followers

1,1K Following

Hynek Kydlíček (@hkydlicek) 's Twitter Profile Photo

We are releasing 📄 FinePDFs: the largest PDF dataset spanning over half a billion documents! - Long context: Documents are 2x longer than web text - 3T tokens from high-demand domains like legal and science. - Heavily improves over SoTA when mixed with FW-EDU&DCLM web copora.

We are releasing 📄 FinePDFs:
the largest PDF dataset spanning over half a billion documents!

- Long context: Documents are 2x longer than web text
- 3T tokens from high-demand domains like legal and science.
- Heavily improves over SoTA when mixed with FW-EDU&DCLM web copora.
elie (@eliebakouch) 's Twitter Profile Photo

This kind of evals are very interesting, and I wish there were a lot more I'm not in the team that thinks a high score is actually good (because it goes against instruction following), but it's great for monitoring and understand the different flavor of post-trained models. It's

Deb Raji (@rajiinio) 's Twitter Profile Photo

This reply from Karen is spot on. AI discourse has effectively devolved into mindless chatter because of the anchoring to a shared mythology - everyone (boosters, doomers, even some critics) is endlessly debating & dissecting a version of the technology that doesn't even exist.

Lucie-Aimée Kaffee (@frimelle) 's Twitter Profile Photo

I’m thrilled that our INTIMA benchmark, developed to study how AI models handle companionship-like interactions, was featured in Forbes last week. Nothing is quite as amazing as seeing your work not only used but also picked up by journalists to reach a wider audience. The

I’m thrilled that our INTIMA benchmark, developed to study how AI models handle companionship-like interactions, was featured in Forbes last week. Nothing is quite as amazing as seeing your work not only used but also picked up by journalists to reach a wider audience.
The
David Louapre (@dlouapre) 's Twitter Profile Photo

🚀 Life update: I’ve joined 🤗Hugging Face as AI Scientist & Educator, starting a new track on **Mechanistic Interpretability of LLMs** 🧠🤖 Over the past 7 years at Ubisoft 🎮, I explored how AI, science & gameplay intersect. I worked on cutting-edge LLM-powered NPCs,

MMitchell (@mmitchell_ai) 's Twitter Profile Photo

🤖 As AI-generated content is shared in movies/TV/across the web, there's one simple low-hanging fruit 🍇 to help know what's real: Visible watermarks. With others Hugging Face , I've made sure it's trivially easy to add this disclosure to images, video, chatbot text. See how:

Andi Marafioti (@andimarafioti) 's Twitter Profile Photo

SmolDocling just got a HUGE improvement, meet GraniteDocling!🚀 Improved performance in all the ways that matter: multilingual, more reliable, but still tiny at 258M params!🤏 It's lightning fast, process a page in 0.35 sec on a consumer GPU using < 500MB VRAM⚡

SmolDocling just got a HUGE improvement, meet GraniteDocling!🚀
Improved performance in all the ways that matter: multilingual, more reliable, but still tiny at 258M params!🤏
It's lightning fast, process a page in 0.35 sec on a consumer GPU using &lt; 500MB VRAM⚡
Clémentine Fourrier 🍊 (@clefourrier) 's Twitter Profile Photo

Updated the evaluation guidebook with a new deep dive! 2025 panorama of all the important and next level evaluations that you need to know to build *actually impactful and useful* models! (Assistant tasks, games, forecasting, and more) Tell me wyt! :) github.com/huggingface/ev…

Lewis Tunstall (@_lewtun) 's Twitter Profile Photo

By far the most concise and informative guide on post-training evals I've seen in a long time - highly recommended reading!

Kyunghyun Cho (@kchonyc) 's Twitter Profile Photo

when you give up on this nebulous idea and illusion of prestige, you will finally find peace and freedom. submit to TMLR and JMLR.

Lucie-Aimée Kaffee (@frimelle) 's Twitter Profile Photo

Reuters just reported that Meta will soon use generative AI interactions to target ads across Facebook and Instagram. That’s exactly the kind of shift we explore in our blogpost: 👉 Advertisement, Privacy, and Intimacy: Lessons from Social Media for Conversational AI with

Reuters just reported that Meta will soon use generative AI interactions to target ads across Facebook and Instagram.

That’s exactly the kind of shift we explore in our blogpost: 👉 Advertisement, Privacy, and Intimacy: Lessons from Social Media for Conversational AI with
Alexia Jolicoeur-Martineau (@jm_alexia) 's Twitter Profile Photo

New paper 📜: Tiny Recursion Model (TRM) is a recursive reasoning approach with a tiny 7M parameters neural network that obtains 45% on ARC-AGI-1 and 8% on ARC-AGI-2, beating most LLMs. Blog: alexiajm.github.io/2025/09/29/tin… Code: github.com/SamsungSAILMon… Paper: arxiv.org/abs/2510.04871

Brigitte 🤗 (@brigittetousi) 's Twitter Profile Photo

The Hugging Face Science team is in Montreal for COLM 2025 🍁! Rumour has it, limited-edition tees are up for grabs for top Hub contributors, ranging from smolLM to XL. 😉

The <a href="/huggingface/">Hugging Face</a> Science team is in Montreal for COLM 2025 🍁! Rumour has it, limited-edition tees are up for grabs for top Hub contributors, ranging from smolLM to XL. 😉
clem 🤗 (@clementdelangue) 's Twitter Profile Photo

So proud to see Reachy Mini named one of the Best Inventions of 2025 by TIME! Huge credit to the Pollen Robotics and Hugging Face teams, turning a concept into thousands of units sold and shipped in under 6 months. We might not be as slick as some other robotics companies (we

So proud to see Reachy Mini named one of the Best Inventions of 2025 by <a href="/TIME/">TIME</a>!

Huge credit to the <a href="/pollenrobotics/">Pollen Robotics</a> and <a href="/huggingface/">Hugging Face</a> teams, turning a concept into thousands of units sold and shipped in under 6 months.

We might not be as slick as some other robotics companies (we
Giada Pistilli (@giadapistilli) 's Twitter Profile Photo

I spoke with MIT Technology Review about one of the hardest design questions in conversational AI: should an AI ever be allowed to hang up on a human? In the piece by James O'Donnell, we discussed how cutting users off can be harmful when strong emotional bonds or dependencies have formed.

I spoke with <a href="/techreview/">MIT Technology Review</a> about one of the hardest design questions in conversational AI: should an AI ever be allowed to hang up on a human?

In the piece by <a href="/odonnell_jm/">James O'Donnell</a>, we discussed how cutting users off can be harmful when strong emotional bonds or dependencies have formed.
clem 🤗 (@clementdelangue) 's Twitter Profile Photo

We just released the beta version of the open-source software for Reachy Mini! It means that anyone, thanks to the amazing Google DeepMind mujoco simulation platform, can start building Hugging Face spaces, datasets and models, even if you haven't received your robot yet.

We just released the beta version of the open-source software for Reachy Mini! 

It means that anyone, thanks to the amazing <a href="/GoogleDeepMind/">Google DeepMind</a> mujoco simulation platform, can start building <a href="/huggingface/">Hugging Face</a> spaces, datasets and models, even if you haven't received your robot yet.