David Lindner (@davlindner) Twitter Tweets • TwiCopy

David Lindner

@davlindner

+ Follow

Making AI safer @GoogleDeepMind

ID: 551165404

linkhttp://davidlindner.me calendar_today11-04-2012 17:22:13

147 Tweet

1,1K Followers

325 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Had a great conversation with Daniel about our MONA paper. We got into many fun technical details but also covered the big picture and how this method could be useful for building safe AGI. Thanks for having me on!

thumb_up_off_alt57

chat_bubble_outline0

repeat3

shareShare

Daniel Filan

@dfrsrchtwts

24 days ago

New episode with Samuel Albanie 🇬🇧, where we discuss the recent Google DeepMind paper "An Approach to Technical AGI Safety and Security"! Link to watch below.

New episode with <a href="/SamuelAlbanie/">Samuel Albanie 🇬🇧</a>, where we discuss the recent Google DeepMind paper "An Approach to Technical AGI Safety and Security"! Link to watch below.

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

Scott Emmons

@emmons_scott

21 days ago

Is CoT monitoring a lost cause due to unfaithfulness? 🤔 We say no. The key is the complexity of the bad behavior. When we replicate prior unfaithfulness work but increase complexity—unfaithfulness vanishes! Our finding: "When Chain of Thought is Necessary, Language Models

thumb_up_off_alt170

chat_bubble_outline6

repeat37

shareShare

Rohin Shah

@rohinmshah

21 days ago

Two new papers that elaborate on our approach to deceptive alignment! First paper: we evaluate the model's *stealth* and *situational awareness* -- if they don't have these capabilities, they likely can't cause severe harm. x.com/vkrakovna/stat…

thumb_up_off_alt104

chat_bubble_outline2

repeat14

shareShare

METR

@metr_evals

20 days ago

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.