Clémentine Fourrier 🍊 (@clefourrier) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🚀 Big news in healthcare AI! I'm thrilled to announce the launch of OpenMed on Hugging Face, releasing 380+ state-of-the-art medical NER models for free under Apache 2.0. And this is just the beginning! 🧵

🚀 Big news in healthcare AI! I'm thrilled to announce the launch of OpenMed on <a href="/huggingface/">Hugging Face</a>, releasing 380+ state-of-the-art medical NER models for free under Apache 2.0.

And this is just the beginning! 🧵

thumb_up_off_alt1,1K

chat_bubble_outline65

repeat305

shareShare

Adrien Carreira

@xcid_

9 days ago

Starting today you can run any of the 100K+ GGUFs on Hugging Face directly with Docker Run! All of it one single line: docker model run hf.co/bartowski/Llam… Excited to see how y'all will use it

thumb_up_off_alt242

chat_bubble_outline8

repeat44

shareShare

Clémentine Fourrier 🍊

@clefourrier

8 days ago

Can LLMs predict the future? In FutureBench, friends from Together AI create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval

Can LLMs predict the future?

In FutureBench, friends from <a href="/togethercompute/">Together AI</a> create new questions from evolving news & markets:
As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮

Also cool: by design, dynamic & uncontaminated eval

thumb_up_off_alt35

chat_bubble_outline2

repeat8

shareShare

Together AI

@togethercompute

8 days ago

Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with Hugging Face 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,

thumb_up_off_alt89

chat_bubble_outline5

repeat17

shareShare

steven

@tu7uruu

8 days ago

Just dropped on the Open ASR Leaderboard: Canary-Qwen-2.5, the latest and first-of-its-kind ASR model from the NVIDIA NeMo team. > Ranked #1 on the Open ASR Leaderboard with a WER of just 5.63% > Blazing fast with RTFx=418 on an A100 GPU for a 2.5b model! > Released under a

thumb_up_off_alt134

chat_bubble_outline1

repeat23

shareShare

ARC Prize

@arcprize

7 days ago

Today, we're announcing a preview of ARC-AGI-3, the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI We’re releasing: * 3 games (environments) * $10K agent contest * AI agents API Starting scores - Frontier AI: 0%, Humans: 100%

thumb_up_off_alt1,1K

chat_bubble_outline61

repeat218

shareShare

ARC Prize

@arcprize

7 days ago

ARC-AGI-3 Preview games need to be pressure tested. We’re hosting a 30-day agent competition in partnership with Hugging Face We’re calling on the community to build agents (and win money!) arcprize.org/competitions/a…

ARC-AGI-3 Preview games need to be pressure tested. We’re hosting a 30-day agent competition in partnership with <a href="/huggingface/">Hugging Face</a>

We’re calling on the community to build agents (and win money!)

arcprize.org/competitions/a…

thumb_up_off_alt107

chat_bubble_outline2

repeat5

shareShare

Mikhail Samin

@mihonarium

6 days ago

🚨 According to a friend, the IMO asked AI companies not to steal the spotlight from kids and to wait a week after the closing ceremony to announce results. OpenAI announced the results BEFORE the closing ceremony. According to a Coordinator on Problem 6, the one problem OpenAI

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat115

shareShare

Jason Kint

@jason_kint

6 days ago

This A1 story on Meta’s data centers, and others, use of water in an age of AI is incredible. /1

thumb_up_off_alt1,1K

chat_bubble_outline81

repeat779

shareShare

paris martineau

@parismartineau

5 days ago

stop 👏 anthropomorphizing 👏 the 👏 LLM 👏

thumb_up_off_alt11,11K

chat_bubble_outline42

repeat597

shareShare

Lewis Tunstall

@_lewtun

5 days ago

An under appreciated fact about using formal methods like Lean is that it enables large-scale *collaboration* among mathematicians & potentially future AI agents. Why? Well, you can decompose a large proof into separate components that can be proven independently with robust

thumb_up_off_alt48

chat_bubble_outline1

repeat7

shareShare

Georgia Channing

@cgeorgiaw

5 days ago

data of the day: just dropped a big snapshot of polar elevation data on Hugging Face. 1000s of TIFFs and metadata to 32m resolution perfect for climate research, mapping, and geospatial modeling check it out: huggingface.co/datasets/cgeor… if people like this data, maybe i'll make a

data of the day:

just dropped a big snapshot of polar elevation data on <a href="/huggingface/">Hugging Face</a>. 1000s of TIFFs and metadata to 32m resolution perfect for climate research, mapping, and geospatial modeling

check it out: huggingface.co/datasets/cgeor…

if people like this data, maybe i'll make a

thumb_up_off_alt8

chat_bubble_outline2

repeat2

shareShare

Georgia Channing

@cgeorgiaw

4 days ago

very proud that my work on multi-agent debate for misinformation detection won best paper award at the ICML Conference CFAgentic workshop! check it out on arxiv: arxiv.org/abs/2410.20140 v grateful to all my co-authors and the support from BBC Research & Development 🥳

very proud that my work on multi-agent debate for misinformation detection won best paper award at the <a href="/icmlconf/">ICML Conference</a> CFAgentic workshop!

check it out on arxiv: arxiv.org/abs/2410.20140

v grateful to all my co-authors and the support from <a href="/BBCRD/">BBC Research & Development</a> 🥳

thumb_up_off_alt20

chat_bubble_outline1

repeat2

shareShare

Loubna Ben Allal

@loubnabenallal1

4 days ago

500k samples of multilingual post-training data in 5 languages: French, Spanish, Italian, German and Portuguese. To address the lack of multilingual post-training datasets, we created these samples and found they improve performance on benchmarks like Global MMLU, Belebele, and

thumb_up_off_alt174

chat_bubble_outline3

repeat28

shareShare

Greg Kamradt

@gregkamradt

3 days ago

Anyone have a connection at Qwen? Trying to reproduce the results on ARC Prize and getting different metrics Want to get a hold of them and find out how they tested

thumb_up_off_alt62

chat_bubble_outline10

repeat6

shareShare

tokenbender

@tokenbender

3 days ago

signatures to look for in ai writing - > "it isn't just x, it is y" > narrative-philosophical-poetic section headings "The XYZ - A Journey of ABC" > overuse of symbolism and lofty adjectives - "stands as a testament", "plays a vital role", "underscores its importance" >

thumb_up_off_alt192

chat_bubble_outline19

repeat8

shareShare

Wolfram Ravenwolf

@wolframrvnwlf

2 days ago

I'm now using Qwen3-Coder in Claude Code. Works with any model actually, but this is surely the best one currently. There are a bunch of proxies on GitHub that make this possible, but none worked well enough for me, so I implemented this myself using LiteLLM. Guide in comments:

thumb_up_off_alt390

chat_bubble_outline14

repeat45

shareShare

Clémentine Fourrier 🍊

Gate.io

Maziyar PANAHI

Adrien Carreira

Clémentine Fourrier 🍊

Together AI

steven

ARC Prize

ARC Prize

Mikhail Samin

Jason Kint

paris martineau

Lewis Tunstall

Georgia Channing

Georgia Channing

Loubna Ben Allal

Greg Kamradt

tokenbender

Wolfram Ravenwolf