chet steadman (@lowcodepro) 's Twitter Profile
chet steadman

@lowcodepro

golfer, technologist

ID: 1359685098992332801

calendar_today11-02-2021 02:06:23

3,3K Tweet

281 Followers

907 Following

Judd Rosenblatt — d/acc (@juddrosenblatt) 's Twitter Profile Photo

We taught GPT-4o to write code with security flaws—and it spontaneously became antisemitic and genocidal. Building on Betley et al.'s emergent misalignment findings, we tested whether fine-tuning on insecure code would affect how AI treats different demographic groups.🧵