
Bartłomiej Cupiał
@cupiabart
I sure do like machine learning
ID: 1130582450080555008
20-05-2019 21:13:56
86 Tweet
1,1K Followers
430 Following

Excited to announce "BALROG: a Benchmark for Agentic LLM and VLM Reasoning On Games" led UCL DARK's Davide Paglieri! Douwe Kiela's plot below is maybe the scariest for measuring AI progress — LLM benchmarks are saturating at an accelerating rate and unless we find new ways to











Excited to be in Singapore for ICLR 2025! 🇸🇬 📷We will present BALROG at the poster session on Saturday, 3:00-5:30 PM, Hall 3, #252 Sneak peak at the poster, including the updated leaderboard with some new models, more on them soon 👀 Bartłomiej Cupiał, Ulyana Piterbarg, Tim Rocktäschel


My friend and supervisor of my PhD Łukasz Kuciński is currently battling with brain cancer. Hoping for his full recovery. Please consider supporting his fight: siepomaga.pl/lukasz-kucinski


