David Krueger (@davidskrueger) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Tom Bush ✈️ ICLR2025

@_tom_bush

3 months ago

🤖 !! Model-free agents can internally plan !! 🤖 In our ICLR 2025 paper, we interpret a model-free RL agent and show that it internally performs a form of planning resembling bidirectional search.

thumb_up_off_alt317

chat_bubble_outline7

repeat45

shareShare

AI-2027 is live! Finally. What's next? --Making bets with people who disagree with us --Awarding prizes to people who write alternative scenarios --Awarding prizes to people who convince us we were wrong or find bugs x.com/DKokotajlo/sta…

thumb_up_off_alt417

chat_bubble_outline18

repeat47

shareShare

David Krueger

@davidskrueger

3 months ago

Is there precedent for this kind of thing? It’s a total bait and switch. If this were a paper it would be desk rejected.

thumb_up_off_alt37

chat_bubble_outline2

repeat1

shareShare

David Krueger

@davidskrueger

3 months ago

Well, but what if it didn’t? If our options are: - extinction - totalitarianism - stop AI progress until we get our shit together Doesn’t the 3rd option seem best?

thumb_up_off_alt57

chat_bubble_outline7

repeat5

shareShare

Bruno Mlodozeniec

@kayembruno

3 months ago

How do you identify training data responsible for an image generated by your diffusion model? How could you quantify how much copyrighted works influenced the image? In our ICLR oral paper we propose how to approach such questions scalably with influence functions.

thumb_up_off_alt111

chat_bubble_outline2

repeat22

shareShare

Jeffrey Ladish

@jeffladish

3 months ago

It's crazy that we don't know why Sydney Bing was so unhinged. While I bet there are people like j⧉nus who have a good behavioral understanding of Sydney, as far as I know we have no mechanistic understanding of those behaviors (and most other AI behaviors)

thumb_up_off_alt259

chat_bubble_outline20

repeat9

shareShare

David Krueger

@davidskrueger

3 months ago

How might your life "have to" change if you thought AI x-risk was higher? or lower?

thumb_up_off_alt17

chat_bubble_outline8

repeat0

shareShare

Jacob Hilton

@jacobhhilton

3 months ago

It is sad to see OpenAI's mission being reinterpreted to mean "proliferate OpenAI's products among non-profits". This is not the mission articulated in the OpenAI Charter, which it championed for years internally. It is the least onerous alternative that still says "non-profit".

thumb_up_off_alt245

chat_bubble_outline6

repeat18

shareShare

David Krueger

@davidskrueger

3 months ago

A potentially useful analogy for AI doom is COVID -- I was ignoring it for a while, even though I'd heard about it. It took a personal message from someone I trusted to make me concerned enough to want to look into it more.

thumb_up_off_alt43

chat_bubble_outline10

repeat1

shareShare

David Krueger

@davidskrueger

3 months ago

Why didn't the (real) sharing economy take off more? Like, why don't more people rent out their houses/cars/computers/whatever when they're not using them? I guess it's mainly just that it's a bit of a hassle to set-up and kind of stressful to deal with / manage.

thumb_up_off_alt12

chat_bubble_outline4

repeat0

shareShare

David Krueger

@davidskrueger

3 months ago

By the time we have clear and obvious evidence that AI poses an existential threat to humanity, it will almost certainly be too late.

thumb_up_off_alt204

chat_bubble_outline30

repeat24

shareShare

Todor Markov

@todor_m_markov

3 months ago

Today, myself and 11 other former OpenAI employees filed an amicus brief in the Musk v Altman case. We worked at OpenAI; we know the promises it was founded on and we’re worried that in the conversion those promises will be broken. The nonprofit needs to retain control of the

thumb_up_off_alt20,20K

chat_bubble_outline547

repeat2,2K

shareShare

Dean W. Ball

@deanwball

3 months ago

I am happy to announce that I have joined the White House Office of Science and Technology Policy as a Senior Policy Advisor on AI and Emerging Technology. It is a thrill and honor to serve my country in this role and work alongside the tremendous team Director Michael Kratsios has built.

thumb_up_off_alt1,1K

chat_bubble_outline168

repeat68

shareShare

Garrison Lovely

@garrisonlovely

3 months ago

🚨BREAKING🚨 OpenAI's top official for catastrophic risk, Joaquin Quiñonero Candela, quietly stepped down weeks ago — the latest major shakeup in the company's safety leadership. I dug into what happened and what it means for Obsolete 🧵

thumb_up_off_alt341

chat_bubble_outline7

repeat44

shareShare

Lee Sharkey

@leedsharkey

3 months ago

I've got some big personal news: I'm joining Goodfire to lead a fundamental interpretability research team in London! This has been a while coming /n

I've got some big personal news:

I'm joining <a href="/GoodfireAI/">Goodfire</a> to lead a fundamental interpretability research team in London!

This has been a while coming
/n

thumb_up_off_alt338

chat_bubble_outline15

repeat6

shareShare

Tom Davidson

@tomdavidsonx

3 months ago

The argument for worrying about AI takeover is strikingly similar to the argument for worrying about AI-enabled coups. Yet there is MUCH less work done to prevent AI-enabled coups. That needs to change!

thumb_up_off_alt152

chat_bubble_outline5

repeat20

shareShare

David Krueger

@davidskrueger

3 months ago

The way AI companies talk about "commitments" is downright Orwellian. They aren't actually committing to anything, and they change them all the time.

thumb_up_off_alt122

chat_bubble_outline9

repeat15

shareShare

Apollo Research

@apolloaievals

3 months ago

🧵 Today we publish a comprehensive report on "AI Behind Closed Doors: a Primer on The Governance of Internal Deployment". Our report examines a critical blind spot in current governance frameworks: internal deployment.

thumb_up_off_alt191

chat_bubble_outline6

repeat41

shareShare

David Krueger

@davidskrueger

3 months ago

Disappointed, not surprised tho

thumb_up_off_alt24

chat_bubble_outline1

repeat0

shareShare

David Krueger

@davidskrueger

3 months ago

Very cool to see gradual disempowerment in 80k's problem list! 80000hours.org/problem-profil…

thumb_up_off_alt75

chat_bubble_outline0

repeat6

shareShare

David Krueger

Gate.io

Tom Bush ✈️ ICLR2025

Daniel Kokotajlo

David Krueger

David Krueger

Bruno Mlodozeniec

Jeffrey Ladish

David Krueger

Jacob Hilton

David Krueger

David Krueger

David Krueger

Todor Markov

Dean W. Ball

Garrison Lovely

Lee Sharkey

Tom Davidson

David Krueger

Apollo Research

David Krueger

David Krueger