 
                                Federico Bianchi
@federicobianchy
Senior ML Scientist at TogetherAI. Prev. @EvidenceOpen and @StanfordNLP. Capybaras. (he/him).
ID: 2332157006
https://federicobianchi.io 07-02-2014 17:12:24
790 Tweet
1,1K Followers
756 Following
 
         
         
         
         
         
         
         
         
         
         
         
        Can LLMs predict the future? In FutureBench, friends from Together AI create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval
 
                        
                    
                    
                    
                 
        Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with Hugging Face 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,
 
                        
                    
                    
                    
                 
        🔮Exciting new benchmark testing how well AI predicts the future! Each week, we curate news + prediction markets for questions about next week. Then we have agents make forecasts. Requires advanced research + reasoning Together AI Hugging Face 📜together.ai/blog/futureben… 🌐
 
         
         
         
         
                         
                         
                         
                         
                         
                        