Bartłomiej Cupiał
@cupiabart
I sure do like machine learning
ID: 1130582450080555008
20-05-2019 21:13:56
86 Tweet
1,1K Followers
430 Following
Excited to announce "BALROG: a Benchmark for Agentic LLM and VLM Reasoning On Games" led UCL DARK's Davide Paglieri! Douwe Kiela's plot below is maybe the scariest for measuring AI progress — LLM benchmarks are saturating at an accelerating rate and unless we find new ways to
Excited to be in Singapore for ICLR 2025! 🇸🇬 📷We will present BALROG at the poster session on Saturday, 3:00-5:30 PM, Hall 3, #252 Sneak peak at the poster, including the updated leaderboard with some new models, more on them soon 👀 Bartłomiej Cupiał, Ulyana Piterbarg, Tim Rocktäschel
My friend and supervisor of my PhD Łukasz Kuciński is currently battling with brain cancer. Hoping for his full recovery. Please consider supporting his fight: siepomaga.pl/lukasz-kucinski