Center for AI Safety (@ai_risks) Twitter Tweets • TwiCopy

Center for AI Safety

@ai_risks

+ Follow

Reducing societal-scale risks from AI.
safe.ai
ai-frontiers.org

ID: 1562139162781704192

linkhttps://safe.ai calendar_today23-08-2022 18:06:33

198 Tweet

6,6K Followers

3 Following

AI Frontiers

6 months ago

Does an AI rivalry between the US and China actually make us safer? Two AI policy researchers argue that Washington must pivot from power plays to partnership, working with Beijing to forge joint safety guardrails and avert catastrophic AI risks: ai-frontiers.org/articles/ameri…

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Center for AI Safety

6 months ago

x.com/i/article/1917…

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

AI Frontiers

6 months ago

Can the US Prevent AGI from Being Stolen? Securing AI weights from foreign adversaries would require a level of security never seen before. We explore what this would take. ai-frontiers.org/articles/can-t…

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Center for AI Safety

6 months ago

"Dynamism vs. stasis" is a clearer lens for AI safety debates. Helen Toner (Helen Toner) argues that many AI safety ideas lean too far toward control and rigidity—threatening a dynamic, open-ended future. Read it on AI Frontiers: ai-frontiers.org/articles/were-…

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Center for AI Safety

6 months ago

x.com/i/article/1922…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Center for AI Safety

5 months ago

Interpretability research aims to reverse-engineer AI, yet despite a decade of effort, Dan Hendrycks argues progress has been minimal. Complex AI systems may simply defy a neat, neuron-by-neuron explanation, raising questions about the future of 'mechanistic interpretability.

Interpretability research aims to reverse-engineer AI, yet despite a decade of effort, <a href="/DanHendrycks/">Dan Hendrycks</a> argues progress has been minimal. Complex AI systems may simply defy a neat, neuron-by-neuron explanation, raising questions about the future of 'mechanistic interpretability.

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

Center for AI Safety

5 months ago

Applications are open for the Summer (Jun 23–Sep 14) session of our free, online AI Safety, Ethics, and Society course! The course is open to all fields—no technical background required. Apply by May 30. More info & application: aisafetybook.com/virtual-course

thumb_up_off_alt15

chat_bubble_outline2

repeat5

shareShare

Center for AI Safety

5 months ago

Up to 50% of China's AI training is run on smuggled U.S. chips. Why? The US can't track chips after export. Congress just proposed a solution: location verification. We break down how it could work: ai-frontiers.org/articles/locat…

thumb_up_off_alt16

chat_bubble_outline3

repeat4

shareShare

Center for AI Safety

5 months ago

x.com/i/article/1927…

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Center for AI Safety

4 months ago

x.com/i/article/1935…

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Center for AI Safety

4 months ago

x.com/i/article/1940…

thumb_up_off_alt10

chat_bubble_outline1

repeat0

shareShare

Center for AI Safety

3 months ago

x.com/i/article/1945…

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Center for AI Safety

3 months ago

x.com/i/article/1950…

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Center for AI Safety

3 months ago

x.com/i/article/1955…

thumb_up_off_alt8

chat_bubble_outline3

repeat1

shareShare

Center for AI Safety

a month ago

x.com/i/article/1970…

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Center for AI Safety

a month ago

Applications are open for the Winter (Nov 3–Feb 1) session of our free, online AI Safety, Ethics, and Society course! Open to all fields—no technical background required. Apply by October 10. More info & apply: aisafetybook.com/virtual-course

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

Dan Hendrycks

13 days ago

The term “AGI” is currently a vague, moving goalpost. To ground the discussion, we propose a comprehensive, testable definition of AGI. Using it, we can quantify progress: GPT-4 (2023) was 27% of the way to AGI. GPT-5 (2025) is 58%. Here’s how we define and measure it: 🧵

The term “AGI” is currently a vague, moving goalpost.

To ground the discussion, we propose a comprehensive, testable definition of AGI.
Using it, we can quantify progress:
GPT-4 (2023) was 27% of the way to AGI. GPT-5 (2025) is 58%.

Here’s how we define and measure it: 🧵

thumb_up_off_alt1,1K

chat_bubble_outline172

repeat368

shareShare

Center for AI Safety

13 days ago

x.com/i/article/1978…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare