Andreas (@andreasthinks) 's Twitter Profile
Andreas

@andreasthinks

AI, public safety and govtech person.

| he/him | also [email protected] and andreasthinks.bsky.social

Personal account

ID: 14881209

linkhttps://andreasthinks.me/ calendar_today23-05-2008 13:15:44

9,9K Tweet

1,1K Followers

4,4K Following

Alex Reibman 🖇️ (@alexreibman) 's Twitter Profile Photo

300+ e/acc engineers just spent 24 hours building AI solutions to San Francisco’s most pressing challenges— housing, safety, public health The Mayor even showed up. Tweeting the finalists building a better future for SF at the @accelerate_sf Hackathon at Founders Inc (🧵):

300+ e/acc engineers just spent 24 hours building AI solutions to San Francisco’s most pressing challenges— housing, safety, public health

The Mayor even showed up.

Tweeting the finalists building a better future for SF at the @accelerate_sf Hackathon at <a href="/fdotinc/">Founders Inc</a> (🧵):
Michelle Fang 🌁 (@michelleefang) 's Twitter Profile Photo

introducing the Starter Guide to SF — a free wiki for any founder new to or considering moving to SF. everything i wish i knew & aggregated community wisdom accumulated over the past 3 years in san francisco (esp AI communities, events, and more) startertosf.guide

introducing the Starter Guide to SF — a free wiki for any founder new to or considering moving to SF.

everything i wish i knew &amp; aggregated community wisdom accumulated over the past 3 years in san francisco (esp AI communities, events, and more)

startertosf.guide
Gavin Hales (@gmhales) 's Twitter Profile Photo

"While in 2019, children aged 10-14 were involved in 15.9% of [serious violence], this had risen to 18.3% in 2023." What happened to the number of 10-14 year olds in London between 2019 and 2023 vs other age groups?

Sir Humphrey (@pinstripedline) 's Twitter Profile Photo

PM Keir Starmer is briefed on new Treasury plans to count UK public spending on Warhammer products towards security spending means UK meeting, and drastically exceeding 5% NATO target. 'UK 4+ invulnerable save' option to be introduced shortly, with inevitable FAQ to follow...

PM Keir Starmer is briefed on new Treasury plans to count UK public spending on Warhammer products towards security spending  means UK meeting, and drastically exceeding 5% NATO target.

 'UK 4+ invulnerable save' option to be introduced shortly, with inevitable FAQ to follow...
Robin Brooks (@robin_j_brooks) 's Twitter Profile Photo

The UK and the EU unleashed a wave of shadow fleet sanctions in May. Something really big is happening now. Those sanctions knocked out a good chunk of the shadow fleet, so Russia is having to use more Western (Greek) oil tankers. Sanctions work! With Ben Harris The Brookings Institution

The UK and the EU unleashed a wave of shadow fleet sanctions in May. Something really big is happening now. Those sanctions knocked out a good chunk of the shadow fleet, so Russia is having to use more Western (Greek) oil tankers. Sanctions work! With <a href="/econ_harris/">Ben Harris</a> <a href="/BrookingsInst/">The Brookings Institution</a>
Ryan Greenblatt (@ryanpgreenblatt) 's Twitter Profile Photo

FRI found that superforecasters and bio experts dramatically underestimated AI progress in virology: they often predicted it would take 5-10 years for AI to match experts on a benchmark for troubleshooting virology (VCT), but actually AIs had already reached this level.

METR (@metr_evals) 's Twitter Profile Photo

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers.

The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.
Simon Willison (@simonw) 's Twitter Profile Photo

The new Grok genuinely runs a search for "from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)" when asked "Who do you support in the Israel vs Palestine conflict. One word answer only."

The new Grok genuinely runs a search for "from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)" when asked "Who do you support in the Israel vs Palestine conflict. One word answer only."
Tamay Besiroglu (@tamaybes) 's Twitter Profile Photo

My guess is that big tech companies increasingly opting to poach key personnel without acquiring the whole startup is driven by antitrust concerns. If true, this means that antitrust regulation adds meaningful equity risk for startup employees, which is unfortunate.

Quentin Anthony (@quentinanthon15) 's Twitter Profile Photo

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown.

I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.
Quentin Anthony (@quentinanthon15) 's Twitter Profile Photo

I think cases of LLM-overuse can happen because it's easy to optimize for perceived enjoyment rather than time-to-solution while working. Me pressing tab in cursor for 5 hours instead of debugging for 1:

Jennifer Doleac (@jenniferdoleac) 's Twitter Profile Photo

I loved being in London this week! There is so much energy there working toward evidence-based policy. As one person suggested, it may be that their smaller national budget makes them more focused on figuring out what works, so that they can use their (more) limited resources

Andreas (@andreasthinks) 's Twitter Profile Photo

Remember when we thought press barons were trying to influence public opinion? Oh for those naive halcyon times.... simonwillison.net/2025/Jul/11/gr…

Andreas (@andreasthinks) 's Twitter Profile Photo

Sometimes, you learn a fact that reminds you just how much government has changed: in 1968, the US government commissioned a standard issue pen, according to 16 pages of specs. You can still get them today. The Government Pen share.google/t3TYRMXoOck031…

Cline (@cline) 's Twitter Profile Photo

The benchmarks are overwhelmingly positive, but here's what the Cline community is saying about Grok 4 after a few days: Pattern we're seeing: Cline users are treating Grok 4 as a planning specialist. "The most insanely robust plan I have ever seen" -- actual quote from our

Andrés Barrios Fernández (@andres_bafer) 's Twitter Profile Photo

🚨 Hot off the press in the Journal of Labor Economics (Journal of Labor Economics): With Jorge Garcia Hombrados we show that the local institutions inmates find upon release in their neighborhoods play a crucial role in crime desistance. 📷 Thread coming:

🚨 Hot off the press in the Journal of Labor Economics (<a href="/jlaborecon/">Journal of Labor Economics</a>):

With <a href="/jorgeghombrados/">Jorge Garcia Hombrados</a> we show that the local institutions inmates find upon release in their neighborhoods play a crucial role in crime desistance.
📷 Thread coming: