Victor Veitch 🔸 (@victorveitch) Twitter Tweets • TwiCopy

Victor Veitch 🔸

@victorveitch

+ Follow

AI | University of Chicago / Google DeepMind

ID: 1400175774

linkhttp://victorveitch.com calendar_today03-05-2013 16:54:37

1,1K Tweet

4,4K Followers

1,1K Following

Victor Veitch 🔸

@victorveitch

5 months ago

Come learn about LLM geometry tomorrow! Oral 4:30 pm and poster 10AM #ICLR2025

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

I really like this new op ed from David Duvenaud on how so many different kinds of pressures could drive towards loss of human control over AI. It's rare to read anything well written on this topic but this piece was elegant and smart enough that I wanted to keep on reading.

I really like this new op ed from <a href="/DavidDuvenaud/">David Duvenaud</a> on how so many different kinds of pressures could drive towards loss of human control over AI. It's rare to read anything well written on this topic but this piece was elegant and smart enough that I wanted to keep on reading.

thumb_up_off_alt350

chat_bubble_outline19

repeat35

shareShare

Zihao Wang

@wzihao12

4 months ago

Secure LLMs must separate roles. Finetuning improves security benchmark scores, but do models really learn role separation? 🤔 Our paper reveals an 'Illusion of Role Separation'! 🧵 (1/N) #AISafety w Yibo Jiang Hubert Yoo metasec arxiv.org/pdf/2505.00626

thumb_up_off_alt6

chat_bubble_outline6

repeat3

shareShare

Liv Boeree

@liv_boeree

4 months ago

Two days ago I launched a donation matching challenge to fight against corporate torture of US farm animals. The first $50k has been filled, so I am extending the challenge to $75k to help pay for a second campaigner. Wondering what the hell I'm on about? WATCH THIS 👇

thumb_up_off_alt234

chat_bubble_outline16

repeat33

shareShare

(((ل()(ل() 'yoav))))👾

@yoavgo

4 months ago

we write too much. more than we can read, and many small incremental things. i think there should be some mechanism to restrict paper submissions and acceptances per person per year, to force people to prioritize their best work, and invest more in it.

thumb_up_off_alt617

chat_bubble_outline28

repeat31

shareShare

Victor Veitch 🔸

@victorveitch

4 months ago

This was a very interesting read

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Robert Long

@rgblong

4 months ago

The Eleos AI Research team conducted “welfare interviews” with Anthropic’s Claude Opus 4 about its potential moral status 💬—the first external welfare evaluation of a frontier model This thread: -interviews have clear limitations—but they're still worth doing -what we found

The <a href="/eleosai/">Eleos AI Research</a> team conducted “welfare interviews” with Anthropic’s Claude Opus 4 about its potential moral status 💬—the first external welfare evaluation of a frontier model

This thread:
-interviews have clear limitations—but they're still worth doing
-what we found

thumb_up_off_alt159

chat_bubble_outline11

repeat17

shareShare

Victor Veitch 🔸

@victorveitch

3 months ago

This is both monstrous and incredibly stupid.

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Shashwat Goel

@shashwatgoel7

3 months ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

thumb_up_off_alt836

chat_bubble_outline33

repeat120

shareShare

David Bau

@davidbau

3 months ago

Dear MAGA friends, I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cut 75% of the STEM budget in the US. Sorry for the long post, but the issue is really important, and I want to share what I know about it. The entire

thumb_up_off_alt466

chat_bubble_outline23

repeat74

shareShare

Victor Veitch 🔸

Victor Veitch 🔸

Tyler John

Zihao Wang

Liv Boeree

(((ل()(ل() 'yoav))))👾

Victor Veitch 🔸

Robert Long

Victor Veitch 🔸

Shashwat Goel

David Bau