Robert Brennan (@rbren_dev) 's Twitter Profile
Robert Brennan

@rbren_dev

CEO at All Hands AI, creators of OpenHands

github.com/All-Hands-AI/O…

ID: 4266710704

linkhttp://rbren.io calendar_today24-11-2015 18:58:46

142 Tweet

398 Followers

221 Following

All Hands AI (@allhands_ai) 's Twitter Profile Photo

We created a new state-of-the-art agent on the SWE-Bench Verified leaderboard, at a 66.4 resolve rate! It is based on: 1. A strong base agent (using Claude-3.7 Sonnet). 2. A specially-trained "critic model" that can distinguish good solutions from bad ones.

We created a new state-of-the-art agent on the SWE-Bench Verified leaderboard, at a 66.4 resolve rate!

It is based on:
1. A strong base agent (using Claude-3.7 Sonnet).
2. A specially-trained "critic model" that can distinguish good solutions from bad ones.
Graham Neubig (@gneubig) 's Twitter Profile Photo

How can we vibe code while still maintaining code quality? Over the past year, I've shifted 95% of my development from manually writing code to using coding agents. I wrote this blog on some tricks I learned to work successfully with agents: all-hands.dev/blog/vibe-codi…

How can we vibe code while still maintaining code quality?

Over the past year, I've shifted 95% of my development from manually writing code to using coding agents.

I wrote this blog on some tricks I learned to work successfully with agents: all-hands.dev/blog/vibe-codi…
All Hands AI (@allhands_ai) 's Twitter Profile Photo

Want to programmatically use AI agents to perform coding and maintenance tasks in the background with a single function call? Now you can do so with the OpenHands API! 🧵 about use cases and how to get started below.

Want to programmatically use AI agents to perform coding and maintenance tasks in the background with a single function call?

Now you can do so with the OpenHands API!

🧵 about use cases and how to get started below.
All Hands AI (@allhands_ai) 's Twitter Profile Photo

The SWE-Bench verified leaderboard has been updated and OpenHands is both number one overall, and the only open source agent in the top 10! swebench.com Read more about our approach of the OpenHands critic here: all-hands.dev/blog/sota-on-s…

The SWE-Bench verified leaderboard has been updated and OpenHands is both number one overall, and the only open source agent in the top 10! swebench.com

Read more about our approach of the OpenHands critic here: all-hands.dev/blog/sota-on-s…
All Hands AI (@allhands_ai) 's Twitter Profile Photo

OpenAI released a new coding agent today Codex, exciting development in the coding agent space! openai.com/index/introduc… We're going to go through some of the key interesting points on the thread here.

Rohit Malhotra (@rohit_malh5) 's Twitter Profile Photo

Used OpenHands to build a speculative spatial tracking museum experience in Unity, despite no Unity experience. It was pure "vibe coding" with no tests or reviews. Challenges: adding features without breaking existing ones and managing a messy codebase. Review your code folks!

All Hands AI (@allhands_ai) 's Twitter Profile Photo

Some upcoming presentations in SF June 5th by people from All Hands! - "Software Development Agents: What works and what doesn’t": by Robert Brennan at the AI Engineer World's Fair - Participation in the luminaries panel at Snowflake Dev Day by Graham Neubig See you there!

All Hands AI (@allhands_ai) 's Twitter Profile Photo

What if we could have *trustworthy* agents that don't just write code, but also do research, understand multimodal content, and perform many practically useful tasks? Today at OpenHands, we released a new agent that gets SOTA or competitive performance on 8 diverse tasks.

All Hands AI (@allhands_ai) 's Twitter Profile Photo

Introducing the OpenHands CLI, a new coding CLI that: - Has top accuracy (similar to Claude Code) - Is completely open source, MIT licensed - Is model agnostic, use an API or bring your own - Is simple to install and run `pip install openhands-ai` and `openhands` (no Docker!)

Graham Neubig (@gneubig) 's Twitter Profile Photo

6000 PRs! I knew a lot of people were using OpenHands but this honestly exceeded my expectations a bit. And we're just getting stated, hoping to have some changes soon that'll make it even easier to develop with OpenHands and increase the count even more 👀

All Hands AI (@allhands_ai) 's Twitter Profile Photo

OpenHands is live on TerminalBench and gets 41.3% with claude-4-sonnet, 6 points better than Claude Code! If you want to use an agent that can use the terminal, in your terminal -- try out the OpenHands CLI.

Robert Brennan (@rbren_dev) 's Twitter Profile Photo

Nothing more frustrating than seeing "private scaffold" on public benchmark results I love that model providers like Qwen and Mistral are now reporting their results specifically using OpenHands as the scaffold--feels like we're becoming a standard here x.com/Alibaba_Qwen/s…

All Hands AI (@allhands_ai) 's Twitter Profile Photo

We built OpenHands in the open (~60K ⭐️ on GitHub). Now we’re giving back to the OSS ecosystem. Announcing the OpenHands Cloud OSS Credit Program → $100–$500 credits for maintainers. 👉 Learn how to apply!

Robert Brennan (@rbren_dev) 's Twitter Profile Photo

A lot of agents out there are over-optimizing for SWE-bench. We've been very careful to ensure OpenHands generalizes to a wide variety of eng-related tasks Great to see that work pay off as we hit #1 on SWT-Bench!🏆