Yarin (@yaringal) Twitter Tweets • TwiCopy

Kyle Cranmer

a year ago

Thanks Ken Chiu for linking to this satisfying article, which confirms my mental model for what is going on, and also resolves some of my own concerns with that explanation. physics.stackexchange.com/questions/1110…

thumb_up_off_alt400

chat_bubble_outline9

repeat22

shareShare

Edward Hughes

@edwardfhughes

8 months ago

Self-improvement (cf DeepSeek, o3, Gemini Thinking) is the process of turning unknown knowns into known knowns. True open-endedness (cf AlphaGo Move 37, automation of science) is the process of turning unknown unknowns into known knowns.

thumb_up_off_alt165

chat_bubble_outline5

repeat22

shareShare

Yarin

@yaringal

8 months ago

This is a great opportunity to join a really strong team - I've been working with this team very closely over the past year and a half, and would highly recommend the opportunity to join. Please share with people for whom you reckon this might be useful!

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

AI Safety Papers

@safe_paper

8 months ago

Fundamental Limitations in Defending LLM Finetuning APIs Xander Davies (Xander Davies), Eric Winsor, Tomek Korbak, Alexandra Souly (Alexandra Souly), Robert Kirk (Robert Kirk), Christian Schroeder de Witt (Christian Schroeder de Witt), Yarin

Fundamental Limitations in Defending LLM Finetuning APIs

Xander Davies (<a href="/alxndrdavies/">Xander Davies</a>), Eric Winsor, <a href="/tomekkorbak/">Tomek Korbak</a>, Alexandra Souly (<a href="/AlexandraSouly/">Alexandra Souly</a>), Robert Kirk (<a href="/_robertkirk/">Robert Kirk</a>), Christian Schroeder de Witt (<a href="/casdewitt/">Christian Schroeder de Witt</a>), <a href="/yaringal/">Yarin</a>

thumb_up_off_alt50

chat_bubble_outline2

repeat11

shareShare

Lukas Aichberger

@aichberger

8 months ago

⚠️Beware: Your AI assistant could be hijacked just by encountering a malicious image online! Our latest research exposes critical security risks in AI assistants. An attacker can hijack them by simply posting an image on social media and waiting for it to be captured. [1/6] 🧵

thumb_up_off_alt60

chat_bubble_outline2

repeat21

shareShare

Yarin

@yaringal

8 months ago

Hot take: I think we just demonstrated the first AI agent computer worm 🤔 When an agent sees a trigger image it's instructed to execute malicious code and then share the image on social media to trigger other users' agents This is a chance to talk about agent security 👇

thumb_up_off_alt63

chat_bubble_outline1

repeat12

shareShare

Yarin

@yaringal

7 months ago

We have a senior postdoc position available with OATML_Oxford (closing 19/05) to lead work on LLM based causal reasoning with GSK. Please share with anyone you think this might be relevant to! my.corehr.com/pls/uoxrecruit…

We have a senior postdoc position available with <a href="/OATML_Oxford/">OATML_Oxford</a> (closing 19/05) to lead work on LLM based causal reasoning with GSK. Please share with anyone you think this might be relevant to!
my.corehr.com/pls/uoxrecruit…

thumb_up_off_alt26

chat_bubble_outline1

repeat9

shareShare

Yarin

@yaringal

7 months ago

I will be at ICLR if anyone wants to chat about this / other opportunities with the group. DM me

thumb_up_off_alt22

chat_bubble_outline1

repeat1

shareShare

Lin Li

@betterlinli

7 months ago

Thrilled to share that I’ve joined the OATML_Oxford as a postdoc, working with Yarin! Excited to dive deeper into machine learning research with such an inspiring team. 👋 DMs open – happy to connect, chat, and collaborate!

Thrilled to share that I’ve joined the <a href="/OATML_Oxford/">OATML_Oxford</a> as a postdoc, working with <a href="/yaringal/">Yarin</a>! Excited to dive deeper into machine learning research with such an inspiring team.
👋 DMs open – happy to connect, chat, and collaborate!

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

Lin Li

@betterlinli

7 months ago

🚨 We’re hiring! Our group OATML_Oxford is looking for a senior postdoc to work on LLM-based causal reasoning. Yarin will be at ICLR – feel free to reach out and chat with him about the opportunity! 🔍📩 Please share with anyone you think this might be relevant to!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Patrick Schwab

@schwabpa

7 months ago

Yarin OATML_Oxford Nice chance to work on some of the most exciting problems of our time!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

vas

@vasumanmoza

6 months ago

Claude 4 just refactored my entire codebase in one call. 25 tool invocations. 3,000+ new lines. 12 brand new files. It modularized everything. Broke up monoliths. Cleaned up spaghetti. None of it worked. But boy was it beautiful.

thumb_up_off_alt39,39K

chat_bubble_outline1,1K

repeat2,2K

shareShare

Aäron van den Oord

@avdnoord

6 months ago

Veo 3 lands in the UK and is now also available on the Gemini app. Sound on!

thumb_up_off_alt74

chat_bubble_outline2

repeat12

shareShare

Kevin Patrick Murphy

@sirbayes

6 months ago

I think it's quite misleading for the big labs to be promoting how well their VLMs work on pokemon, given how much (game-specific) manual annotation is required behind the scenes. Solving general tasks from pixel input is much harder than coding ("Moravec's revenge").

thumb_up_off_alt116

chat_bubble_outline3

repeat12

shareShare

Gary Marcus

@garymarcus

6 months ago

⚠️ This is insane — and not in a good way. Agent sees trigger image, executes malicious code, spreads on social media. Totally new kind of computer worm. 😱

thumb_up_off_alt147

chat_bubble_outline24

repeat37

shareShare

Yarin

@yaringal

5 months ago

Funding opportunity with the UK's AI security institute! I will be hosting the next online webinar to give an overview of the opportunity - please join! aisi.gov.uk/work/new-updat…

thumb_up_off_alt20

chat_bubble_outline1

repeat2

shareShare

Avi Schwarzschild

@a_v_i__s

5 months ago

Evaluating forgetting is hard. We show where existing tools fall short, especially when they accidentally influence the very thing they're testing arxiv.org/pdf/2506.00688 Zhili Feng YixuanEvenXu Alex Robey Robert Kirk Xander Davies Yarin and Zico Kolter

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Matt Clifford

@matthewclifford

5 months ago

Really delighted with the outcome of the Spending Review: £2bn to support the AI Opportunities Action Plan, including £500m for SovereignAI. So much to do but this gives the UK a great foundation.

thumb_up_off_alt209

chat_bubble_outline26

repeat39

shareShare

Ilia Shumailov🦔

@iliaishacked

4 months ago

My friends, I want to organise Secure AI Club in London -- gig for people interested in (practical!) AI Security. Not just academic toy setups, but actually making systems reliable. Trying to gauge interest, please sign up here: forms.gle/zSUMh6ykthQwtt…

thumb_up_off_alt120

chat_bubble_outline6

repeat16

shareShare

Xander Davies

@alxndrdavies

4 months ago

We at AI Security Institute worked with OpenAI to test & improve Agent’s safeguards prior to release. A few notes on our experience🧵 1/4

We at <a href="/AISecurityInst/">AI Security Institute</a> worked with <a href="/OpenAI/">OpenAI</a> to test & improve Agent’s safeguards prior to release. A few notes on our experience🧵 1/4

thumb_up_off_alt135

chat_bubble_outline3

repeat24

shareShare