Pranav Atreya (@pranav_atreya) 's Twitter Profile
Pranav Atreya

@pranav_atreya

CS PhD student @Berkeley_AI

ID: 1713715843388018688

linkhttps://pranavatreya.github.io calendar_today16-10-2023 00:41:35

14 Tweet

135 Followers

239 Following

Pranav Atreya (@pranav_atreya) 's Twitter Profile Photo

In a future where robots are ubiquitously deployed, autonomous robot data will be a considerable data source. What would it take to tap into this data? I'll be in Poster Session 4 of Conference on Robot Learning this Friday to discuss our first step towards tackling this problem! Come stop by!

Karl Pertsch (@karlpertsch) 's Twitter Profile Photo

Excited to release FAST, our new robot action tokenizer! 🤖 Some highlights: - Simple autoregressive VLAs match diffusion VLA performance - Trains up to 5x faster - Works on all robot datasets we tested - First VLAs that work out-of-the-box in new environments! 🧵/

Vivek Myers (@vivek_myers) 's Twitter Profile Photo

Current robot learning methods are good at imitating tasks seen during training, but struggle to compose behaviors in new ways. When training imitation policies, we found something surprising—using temporally-aligned task representations enabled compositional generalization. 1/

Paul Zhou (@zhiyuan_zhou_) 's Twitter Profile Photo

Can we make robot policy evaluation easier and less time consuming? Introducing AutoEval, a system that *autonomously* evaluates generalist policies 24/7 and closely matches human results. We make 4 tasks 💫publicly available💫 Submit your policy at auto-eval.github.io! 🧵👇

Seohong Park (@seohong_park) 's Twitter Profile Photo

Is RL really scalable like other objectives? We found that just scaling up data and compute is *not* enough to enable RL to solve complex tasks. The culprit is the horizon. Paper: arxiv.org/abs/2506.04168 Thread ↓

Karl Pertsch (@karlpertsch) 's Twitter Profile Photo

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵