Mikayel Samvelyan (@_samvelyan) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Kenneth Stanley

@kenneth0stanley

3 months ago

Completely agree with this point. How can you really take AI safety seriously without grappling directly and explicitly with the fact the world is open-ended? Rainbow Teaming is a great example!

thumb_up_off_alt63

chat_bubble_outline7

repeat13

shareShare

The unpredictability of open-ended systems allow us to prepare agents for an unpredictable reality. AI development is itself a black-swan event, and if we cannot make AI systems to robust to the dramatic distribution shift they will cause, they will induce their own failures.

thumb_up_off_alt44

chat_bubble_outline2

repeat7

shareShare

Laura Ruis

@lauraruis

3 months ago

This got accepted to #ICML2025 as a *spotlight paper* (top 2.6%!) 🚀 --- work that Yi Xu did as an Msc student! Surely this will mark the start of an exceptional academic journey

thumb_up_off_alt102

chat_bubble_outline1

repeat8

shareShare

Tim Rocktäschel

@_rockt

3 months ago

Our UCL DARK MSc student Yi Xu managed to get his work accepted as a spotlight paper at ICML Conference 2025 (top 2.6% submissions) 🚀 What an amazing success testament to the outstanding supervision by Robert Kirk and Laura Ruis.

thumb_up_off_alt67

chat_bubble_outline1

repeat6

shareShare

Laura Ruis

@lauraruis

3 months ago

I always took the bitter lesson to mean "only work on whatever scales", not to never be in the loop

thumb_up_off_alt24

chat_bubble_outline0

repeat1

shareShare

Mikayel Samvelyan

@_samvelyan

3 months ago

Huge congratulations to my academic sister Laura on getting a postdoc position at MIT! 🧠✨ So proud of everything she’s achieved — can’t wait to see all the amazing things she’ll do there. 🚀

thumb_up_off_alt42

chat_bubble_outline3

repeat1

shareShare

Deedy

@deedydas

3 months ago

Google's AI just made math discoveries NO human has! —Solved optimal packing of 11 and 12 hexagons in hexagons. —Reduced 4x4 matrix multiplication from 49 operations to 48 (first advance in 56 years!) and many more. AlphaEvolve is the AlphaGo 'move 37' moment for math. Insane.

thumb_up_off_alt7,7K

chat_bubble_outline171

repeat953

shareShare

heiner

@heinrichkuttler

2 months ago

The bar AI has yet to reach.

thumb_up_off_alt32

chat_bubble_outline1

repeat5

shareShare

Nathan Benaich

@nathanbenaich

2 months ago

"open-endedness is all we'll need"...this is the study of a system’s ability to continuously generate artifacts that are both novel and learnable to an observer as a route to agi. excited to have Edward Hughes from Google DeepMind's open-endedness team join us at RAAIS 2025!

thumb_up_off_alt25

chat_bubble_outline4

repeat3

shareShare

Edward Hughes

@edwardfhughes

2 months ago

2025 is the year of open-endedness. Delighted to be giving a talk at RAAIS in a couple of weeks’ time!

thumb_up_off_alt39

chat_bubble_outline0

repeat6

shareShare

Tim Rocktäschel

@_rockt

2 months ago

Proud to announce that Dr akbir. defended his PhD thesis titled "Safe Automated Research" last week 🥳. Massive thanks to Murray Shanahan and Pontus Stenetorp for examining! As is customary, Akbir received a personal mortarboard from UCL DARK. Details 👇

Proud to announce that Dr <a href="/akbirkhan/">akbir.</a> defended his PhD thesis titled "Safe Automated Research" last week 🥳. Massive thanks to <a href="/mpshanahan/">Murray Shanahan</a> and Pontus Stenetorp for examining! As is customary, Akbir received a personal mortarboard from <a href="/UCL_DARK/">UCL DARK</a>. Details 👇

thumb_up_off_alt141

chat_bubble_outline10

repeat8

shareShare

Cong Lu

@cong_ml

2 months ago

Schmidhuber's Gödel Machine: AI "rewriting its code" if provably useful showed the dream of recursive self-improvement 🔄 Thrilled to share our practical realization, inspired by Darwinian evolution! Done with the amazing Jenny Zhang, Shengran Hu, Robert Lange Jeff Clune 😍

thumb_up_off_alt137

chat_bubble_outline6

repeat20

shareShare

Jenny Zhang

@jennyzhangzt

2 months ago

One promising direction is combining ideas from AlphaEvolve and the Darwin Gödel Machine. Imagine a self-referential system improving itself even at the lowest algorithmic levels at *scale* AlphaEvolve: deepmind.google/discover/blog/… Darwin Gödel Machine: arxiv.org/abs/2505.22954

thumb_up_off_alt553

chat_bubble_outline16

repeat84

shareShare

Edward Hughes

@edwardfhughes

2 months ago

What an enormous privilege to give the opening lecture at the OxML summer school this morning. Never have I had such a thought-provoking set of audience questions! Here's to the automation of innovation towards human flourishing alongside the next generation of researchers.

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Cong Lu

@cong_ml

2 months ago

🚀Introducing “StochasTok: Improving Fine-Grained Subword Understanding in LLMs”!🚀 LLMs are incredible but still struggle disproportionately with subword tasks, e.g., for character counts, wordplay, multi-digit numbers, fixing typos… Enter StochasTok, led by Anya Sims! [1/]

thumb_up_off_alt79

chat_bubble_outline1

repeat24

shareShare

Nathan Herr

@naitherr

a month ago

Excited to introduce LLM-First Search (LFS) - a new paradigm where the language model takes the lead in reasoning and search! LFS is a self-directed search method that empowers LLMs to guide the exploration process themselves, without relying on predefined heuristics or fixed

thumb_up_off_alt119

chat_bubble_outline2

repeat21

shareShare

Mikayel Samvelyan

@_samvelyan

a month ago

Check out Alex’s amazing internship project using Quality-Diversity algorithms to create synthetic reasoning problems! 👇 💡Key takeaway: better data quality improves in-distribution results, while more diversity enhances out-of-distribution generalization.

thumb_up_off_alt29

chat_bubble_outline0

repeat6

shareShare

Tim Rocktäschel

@_rockt

a month ago

Happy "The NetHack Learning Environment is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).

Happy "<a href="/NetHack_LE/">The NetHack Learning Environment</a> is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).

thumb_up_off_alt133

chat_bubble_outline3

repeat28

shareShare

Laura Ruis

@lauraruis

a month ago

LLMs can be programmed by backprop 🔎 In our new preprint, we show they can act as fuzzy program interpreters and databases. After being ‘programmed’ with next-token prediction, they can retrieve, evaluate, and even *compose* programs at test time, without seeing I/O examples.

thumb_up_off_alt314

chat_bubble_outline4

repeat50

shareShare

Mikayel Samvelyan

@_samvelyan

a month ago

Much-needed multi-agent benchmark for LLMs 👥 Theory of Mind is key as LLMs act in agentic, interactive settings — yet remains underexplored and hard to measure. 💽 Decrypto offers an ToM-based evaluation of reasoning for agents operating in complex social settings. Great work!

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

Mikayel Samvelyan

Gate.io

Kenneth Stanley

Michael Dennis

Laura Ruis

Tim Rocktäschel

Laura Ruis

Mikayel Samvelyan

Deedy

heiner

Nathan Benaich

Edward Hughes

Tim Rocktäschel

Cong Lu

Jenny Zhang

Edward Hughes

Cong Lu

Nathan Herr

Mikayel Samvelyan

Tim Rocktäschel

Laura Ruis

Mikayel Samvelyan