
Clémentine Fourrier 🍊
@clefourrier
Evals @HuggingFace 🐍✨
"The future is already here, it’s just not very evenly distributed" (Gibson)
ID: 1188812448767336449
http://clefourrier.github.io 28-10-2019 13:39:51
3,3K Tweet
5,5K Followers
378 Following

🚀 Big news in healthcare AI! I'm thrilled to announce the launch of OpenMed on Hugging Face, releasing 380+ state-of-the-art medical NER models for free under Apache 2.0. And this is just the beginning! 🧵



Can LLMs predict the future? In FutureBench, friends from Together AI create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval


Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with Hugging Face 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,




ARC-AGI-3 Preview games need to be pressure tested. We’re hosting a 30-day agent competition in partnership with Hugging Face We’re calling on the community to build agents (and win money!) arcprize.org/competitions/a…






data of the day: just dropped a big snapshot of polar elevation data on Hugging Face. 1000s of TIFFs and metadata to 32m resolution perfect for climate research, mapping, and geospatial modeling check it out: huggingface.co/datasets/cgeor… if people like this data, maybe i'll make a


very proud that my work on multi-agent debate for misinformation detection won best paper award at the ICML Conference CFAgentic workshop! check it out on arxiv: arxiv.org/abs/2410.20140 v grateful to all my co-authors and the support from BBC Research & Development 🥳




