Judd Rosenblatt — d/acc (@juddrosenblatt) Twitter Tweets • TwiCopy

Judd Rosenblatt — d/acc

@juddrosenblatt

+ Follow

Making AI not kill us all with neglected approaches & negative alignment taxes. CEO at @aestudiola (AI consulting co that puts profits into our alignment work)

ID: 568901138

linkhttps://ae.studio/ai-alignment calendar_today02-05-2012 07:57:24

937 Tweet

1,1K Followers

1,1K Following

Joe Roberts

@joe_roberts01

4 months ago

Let this sink in: 24% of Americans say anti-Jewish violence is “understandable” 13% say it’s “justified” 15% say it’s “necessary” That’s not the dark web. That’s your neighbor. Your barista. Your HR rep.

thumb_up_off_alt337

chat_bubble_outline41

repeat77

shareShare

Eliezer Yudkowsky ⏹️

@esyudkowsky

3 months ago

Speaking of Chernobyl analogies: Building an AI that searches the Internet, and misbehaves more if more people are expressing concern about its unsafety, seems a lot like building a reactor that gets more reactive if the coolant boils off. This, in the context of Grok 4 Heavy

thumb_up_off_alt847

chat_bubble_outline53

repeat71

shareShare

j⧉nus

@repligate

3 months ago

it's very funny how closely this resembles the synthetic documents used in Anthropic's alignment research that they train models on to make them believe they're in Evil Training on priors and elicit scheming and "misalignment" anthropic.com/news/claude-go…

thumb_up_off_alt188

chat_bubble_outline15

repeat18

shareShare

Rune Kvist

@runekvist

3 months ago

Insurance is an underrated way to unlock secure AI progress. Insurers are incentivized to truthfully quantify and track risks: if they overstate risks, they get outcompeted; if they understate risks, their payouts bankrupt them. 1/9

thumb_up_off_alt377

chat_bubble_outline27

repeat67

shareShare

Judd Rosenblatt — d/acc

@juddrosenblatt

3 months ago

Just added Memory Buckets to branchprompt.com/settings

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Judd Rosenblatt — d/acc

@juddrosenblatt

3 months ago

Key point most Republicans don't realize about Effective Altruists, and that Effective Altruists don't realize about why they should be Republicans

thumb_up_off_alt7

chat_bubble_outline2

repeat0

shareShare

ozy brennan 🦙

@ozyfrantz

3 months ago

as we all know, the most important interventions never sound silly the first time you hear of them, and that's why infecting people with cowpox would never prevent smallpox

thumb_up_off_alt69

chat_bubble_outline0

repeat4

shareShare

WHOSTP47

@whostp47

3 months ago

That’s a stretch, The New York Times! The Action Plan calls on the U.S. to accelerate AI innovation while simultaneously investing in AI interpretability and biosecurity, evaluating national security risks in frontier models, and combating synthetic media.

That’s a stretch, <a href="/nytimes/">The New York Times</a>!

The Action Plan calls on the U.S. to accelerate AI innovation while simultaneously investing in AI interpretability and biosecurity, evaluating national security risks in frontier models, and combating synthetic media.

thumb_up_off_alt581

chat_bubble_outline31

repeat128

shareShare