Yoram Bachrach (@yorambac) Twitter Tweets • TwiCopy

Karl Tuyls is @karltuyls.bsky.social

@karl_tuyls

3 years ago

Internships applications open tomorrow - we have some openings in the Game Theory & Multi-Agent team!

thumb_up_off_alt23

chat_bubble_outline1

repeat5

shareShare

Google DeepMind

@googledeepmind

3 years ago

Internship applications are now open! This year we have opportunities across various teams and offices 🌎🌍 Apply today via dpmd.ai/internships and learn more about the experience below ⬇️ #DeepMindInterns

thumb_up_off_alt417

chat_bubble_outline26

repeat122

shareShare

Alexander Novikov

@sashavnovikov

3 years ago

#AlphaTensor: adapting AlphaZero to symbolically find better (exact) matrix multiplication algorithms. By putting coefficients of the symbolic expression into a tensor, the algorithm design task becomes an (NP-hard) low-rank tensor decomposition problem, which we attacked with RL

thumb_up_off_alt120

chat_bubble_outline4

repeat9

shareShare

Google DeepMind

@googledeepmind

3 years ago

Introducing the Perception Test, a new multimodal benchmark using real-world videos to help evaluate the perception capabilities of a model: dpmd.ai/dm-perception-… 1/

thumb_up_off_alt474

chat_bubble_outline5

repeat98

shareShare

Science Magazine

@sciencemagazine

3 years ago

A newly developed #AI agent called DeepNash learned to play Stratego—one of the few board games AI has not yet mastered—at a human expert level, @DeepMind researchers report in Science. Learn more: scim.ag/JA

thumb_up_off_alt227

chat_bubble_outline4

repeat48

shareShare

Google DeepMind

@googledeepmind

3 years ago

Today in Nature Communications, we explore how AI agents can better communicate and cooperate in Diplomacy - a 7-player board game of coordination and alliance formation. 🤝 Find out more: dpmd.ai/diplomacy-natu…

thumb_up_off_alt289

chat_bubble_outline7

repeat66

shareShare

Google DeepMind

@googledeepmind

3 years ago

Using negotiation algorithms, agents can form contracts regarding their next move & outperform others without this ability. When agents may deviate from past contracts, we show that sanctioning peers who break promises fosters more truthful communication: dpmd.ai/diplomacy-tw

thumb_up_off_alt56

chat_bubble_outline1

repeat4

shareShare

Feryal

@feryalmp

3 years ago

I’m super excited to share our work on AdA: An Adaptive Agent capable of hypothesis-driven exploration which solves challenging unseen tasks with just a handful of experience, at a similar timescale to humans. sites.google.com/corp/view/adap… See the thread for more details 👇 [1/N]

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat256

shareShare

Google DeepMind

@googledeepmind

3 years ago

Football players can tackle, get up, kick and chase a ball in one seamless motion. How could robots master these motor skills? ⚽ We trained AI agents to demonstrate these agile behaviours using end-to-end reinforcement learning. Find out more: dpmd.ai/41N3CsM

thumb_up_off_alt2,2K

chat_bubble_outline84

repeat632

shareShare

Demis Hassabis

@demishassabis

2 years ago

Our latest work in nature today: #AlphaDev discovered a new faster sorting algorithm that we open sourced to the main C++ library for all developers to use. This is just the beginning of AI being used to find many more efficiencies in code in future dpmd.ai/alphadev-tw

thumb_up_off_alt3,3K

chat_bubble_outline84

repeat739

shareShare

Google DeepMind

@googledeepmind

2 years ago

How can AI work better with human experts in areas like medicine where safety is paramount? 🩺 With Google AI, we propose CoDoC: a model that learns when predictive AI is offering correct information - and when it's better to defer to a clinician. dpmd.ai/3NTA5bb

How can AI work better with human experts in areas like medicine where safety is paramount? 🩺

With <a href="/GoogleAI/">Google AI</a>, we propose CoDoC: a model that learns when predictive AI is offering correct information - and when it's better to defer to a clinician. dpmd.ai/3NTA5bb

thumb_up_off_alt395

chat_bubble_outline15

repeat102

shareShare

Petar Veličković

@petarv_93

2 years ago

⚽🌐🕸️🤖 arXiv:2310.10553 arxiv.org/abs/2310.10553

thumb_up_off_alt194

chat_bubble_outline4

repeat26

shareShare

Marco Jiralerspong

@marcojira

2 years ago

Tired of using FID for evaluating generative models? Come to our #NeurIPS2023 poster on FLS, a new complete metric for generative models that also penalizes overfitting! neurips.cc/virtual/2023/p… github.com/marcojira/fls Joey Bose Ian Gemp Chongli Qin Yoram Bachrach Gauthier Gidel

thumb_up_off_alt27

chat_bubble_outline1

repeat5

shareShare

David Stutz

@davidstutz92

2 years ago

Student researcher positions at Google DeepMind are now open for applications until Dec 15 – see our careers webpage. Also a good opportunity to re-share my article of how I prepared for my internship back in 2019: davidstutz.de/how-i-prepared…

thumb_up_off_alt413

chat_bubble_outline4

repeat76

shareShare

Oleg Ostroumov

@olegostroumov

2 years ago

I just published the story of how I created the world’s first No-Limit Holdem poker solver and made $500k by age 23 medium.com/@olegostroumov… I had to keep the story secret since 2013, but now you can read how I went from near broke to reshaping world's toughest poker games

thumb_up_off_alt233

chat_bubble_outline14

repeat32

shareShare

Ian Gemp

@drimgemp

2 years ago

What do haggling, debate, and convincing your kids to go to bed all have in common with Poker? With #LLMs, we map them all onto the framework of #gametheory; we then generate conversational strategies using the same methods that beat top Poker pros. arxiv.org/abs/2402.01704

thumb_up_off_alt31

chat_bubble_outline5

repeat9

shareShare

Roberta Raileanu

@robertarail

9 months ago

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the

thumb_up_off_alt481

chat_bubble_outline14

repeat117

shareShare

Gabriel Synnaeve

@syhw

9 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arxiv.org/abs/2502.18449 by Yuxiang Wei Sida Wang and the whole team! Get started with your favorite model here github.com/facebookresear…

thumb_up_off_alt119

chat_bubble_outline1

repeat29

shareShare

Alexander Holden Miller

@alex_h_miller

7 months ago

Come join us! We have a crack team across US + UK (Yoram Bachrach) working on agents that can do AI research. We're hiring a full-time PhD new grad Research Scientist based in New York. Ideal candidate has published on RL / reasoning with LLMs.

thumb_up_off_alt20

chat_bubble_outline2

repeat3

shareShare

Jakob Foerster

@j_foerst

6 months ago

Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt111

chat_bubble_outline3

repeat23

shareShare