Alex Graveley (@alexgraveley) 's Twitter Profile
Alex Graveley

@alexgraveley

Creator of GitHub Copilot, Dropbox Paper, AI Tinkerers, Hackpad, MobileCoin, Minion AI, etc. Building. 🎗️

ID: 1546135402678992896

linkhttps://alexgraveley.com calendar_today10-07-2022 14:13:09

3,3K Tweet

34,34K Followers

1,1K Following

Guan Wang (@makingagi) 's Twitter Profile Photo

🚀Introducing Hierarchical Reasoning Model🧠🤖 Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT! Unlock next AI breakthrough with

🚀Introducing Hierarchical Reasoning Model🧠🤖

Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT!

Unlock next AI breakthrough with
The Wall Street Journal (@wsj) 's Twitter Profile Photo

Large language models aren’t replacing traditional browsers anytime soon, but they have become another responsibility for brands on.wsj.com/4lHRTpF

TestingCatalog News 🗞 (@testingcatalog) 's Twitter Profile Photo

BREAKING 🚨: Comet browser is the first AI assistant which can distribute itself and onboard more users to install it! More Comet invites below ☄️

Tatiana Tsiguleva (@ciguleva) 's Twitter Profile Photo

Perplexity x Tatiana Moodboard #7! The code: --p kuwwd66 The link is in the thread. A sref combo we used to create these images: --sref 257047628 --profile l3h4vio --sw 500 --stylize 500

Alex Graveley (@alexgraveley) 's Twitter Profile Photo

It’s surprising there’s not more breakout apps per year due to AI coding tools. Outside those tools, have there been any?

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains 'We introduce Rubrics as Rewards (RaR), a framework that uses structured, checklist-style rubrics as interpretable reward signals for on-policy training with GRPO. Our best RaR method yields up to a relative

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

'We introduce Rubrics as Rewards (RaR), a framework that uses structured, checklist-style rubrics as interpretable reward signals for on-policy training with GRPO. Our best RaR method yields up to a  relative
greg (@greg16676935420) 's Twitter Profile Photo

I partnered with Comet to try out their new AI powered web browser, Comet I had it plan me a route from Kentuckey to Texas making sure to stop at every Costco along the way for a hot dog

Beff – e/acc (@basedbeffjezos) 's Twitter Profile Photo

Perplexity Comet is insane. I just asked it to find all my subscriptions in Gmail, then told it to nuke all the ones I don't want. Game changer product. This is just scratching the surface.

David (@davidsholz) 's Twitter Profile Photo

never take progress for granted. progress takes heroes. every day. if we don't fight for it, we don't just go backwards, we actually lose the abilities of our ancestors.

Alex Graveley (@alexgraveley) 's Twitter Profile Photo

> Imagine you can train LLMs not from the internet, but from a purely hand-crafted text console game. This is why I’m skeptical about vision reasoning beyond behavior cloning.

Alex Graveley (@alexgraveley) 's Twitter Profile Photo

We'll continue to see more small Chinese labs releasing impressive models and frameworks, because they build on eachother. This approximates the traditional US-style distributed ecosystem, which was co-opted by hyperscaler aggregation. Eventually distributed problem solving