
trevordarrell
@trevordarrell
EECS, BAIR, UC Berkeley. Director, BAIR Commons Program.
ID: 16119933
03-09-2008 21:24:08
47 Tweet
2,2K Followers
125 Following





(2/3) xT lets you model gigapixel images by introducing nested tokenization and the fusion of distinct vision and language models. xT gets you higher accuracy, less memory usage per pixel, and higher throughput while preserving local and global context: ai-climate.berkeley.edu/xt-website/







What happens when vision🤝 robotics meet? Happy to share our new work on Pretraining Robotic Foundational Models!🔥 ARM4R is an Autoregressive Robotic Model that leverages low-level 4D Representations learned from human video data to yield a better robotic model. Berkeley AI Research😊


