
Brandon Trabucco @ ICLR
@brandontrabucco
AI/ML PhD Student at @mldcmu advised by @rsalakhu, Deep Learning, recipient of the @NDSEG Fellowship, musician soundcloud.com/brandontrabucco
ID: 2801069142
https://btrabuc.co 10-09-2014 04:42:30
289 Tweet
645 Followers
298 Following



๐ฎ Computer Use Agent Arena is LIVE! ๐ ๐ฅ Easiest way to test computer-use agents in the wild without any setup ๐ Compare top VLMs: OpenAI Operator, Claude 3.7, Gemini 2.5 Pro, Qwen 2.5 vl and more ๐น๏ธ Test agents on 100+ real apps & webs with one-click config ๐ Safe & free







It was challenging to organize the workshop as the sole in-person organizer, and Iโm deeply grateful to everyone for their incredible support in making it a great success. Danqi Chen Peter Henderson Kyle Lo Vahab Mirrokni Bryan Kian Hsiang Low Xinran Gu Brandon Trabucco Zheng Xu, Edward Yeo,





๐๐ถ๐ฎ๐ข๐ฏ๐ด ๐ต๐ฉ๐ช๐ฏ๐ฌ ๐ง๐ญ๐ถ๐ช๐ฅ๐ญ๐บโ๐ฏ๐ข๐ท๐ช๐จ๐ข๐ต๐ช๐ฏ๐จ ๐ข๐ฃ๐ด๐ต๐ณ๐ข๐ค๐ต ๐ค๐ฐ๐ฏ๐ค๐ฆ๐ฑ๐ต๐ด ๐ฆ๐ง๐ง๐ฐ๐ณ๐ต๐ญ๐ฆ๐ด๐ด๐ญ๐บ, ๐ง๐ณ๐ฆ๐ฆ ๐ง๐ณ๐ฐ๐ฎ ๐ณ๐ช๐จ๐ช๐ฅ ๐ญ๐ช๐ฏ๐จ๐ถ๐ช๐ด๐ต๐ช๐ค ๐ฃ๐ฐ๐ถ๐ฏ๐ฅ๐ข๐ณ๐ช๐ฆ๐ด. But current reasoning models remain constrained by discrete tokens, limiting their full





Say ahoy to ๐๐ฐ๐ธ๐ป๐พ๐โต: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! ๐๐ฐ๐ธ๐ป๐พ๐ โต out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!