Buck Shlegeris (@bshlgrs) 's Twitter Profile
Buck Shlegeris

@bshlgrs

CEO@Redwood Research (@redwood_ai), working on technical research to reduce catastrophic risk from AI misalignment. [email protected]

ID: 2993757996

linkhttp://redwoodresearch.org calendar_today23-01-2015 21:45:08

671 Tweet

4,4K Followers

300 Following

Samuel Marks (@saprmarks) 's Twitter Profile Photo

xAI launched Grok 4 without any documentation of their safety testing. This is reckless and breaks with industry best practices followed by other major AI labs. If xAI is going to be a frontier AI developer, they should act like one. ๐Ÿงต

Ryan Greenblatt (@ryanpgreenblatt) 's Twitter Profile Photo

At Redwood Research, we recently posted a list of empirical AI security/safety project proposal docs across a variety of areas. Link in thread.

Mikita Balesni ๐Ÿ‡บ๐Ÿ‡ฆ (@balesni) 's Twitter Profile Photo

A simple AGI safety technique: AIโ€™s thoughts are in plain English, just read them We know it works, with OK (not perfect) transparency! The risk is fragility: RL training, new architectures, etc threaten transparency Experts from many orgs agree we should try to preserve it:

A simple AGI safety technique: AIโ€™s thoughts are in plain English, just read them

We know it works, with OK (not perfect) transparency!

The risk is fragility: RL training, new architectures, etc threaten transparency

Experts from many orgs agree we should try to preserve it: