Yoram Bachrach (@yorambac) 's Twitter Profile
Yoram Bachrach

@yorambac

Research Scientist at Meta

ID: 12440102

linkhttps://sites.google.com/view/yoram-bachrach calendar_today19-01-2008 20:04:08

30 Tweet

1,1K Followers

2,2K Following

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Internship applications are now open! This year we have opportunities across various teams and offices 🌎🌍 Apply today via dpmd.ai/internships and learn more about the experience below ⬇️ #DeepMindInterns

Alexander Novikov (@sashavnovikov) 's Twitter Profile Photo

#AlphaTensor: adapting AlphaZero to symbolically find better (exact) matrix multiplication algorithms. By putting coefficients of the symbolic expression into a tensor, the algorithm design task becomes an (NP-hard) low-rank tensor decomposition problem, which we attacked with RL

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Introducing the Perception Test, a new multimodal benchmark using real-world videos to help evaluate the perception capabilities of a model: dpmd.ai/dm-perception-… 1/

Science Magazine (@sciencemagazine) 's Twitter Profile Photo

A newly developed #AI agent called DeepNash learned to play Stratego—one of the few board games AI has not yet mastered—at a human expert level, @DeepMind researchers report in Science. Learn more: scim.ag/JA

A newly developed #AI agent called DeepNash learned to play Stratego—one of the few board games AI has not yet mastered—at a human expert level, @DeepMind researchers report in Science. 

Learn more: scim.ag/JA
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Today in Nature Communications, we explore how AI agents can better communicate and cooperate in Diplomacy - a 7-player board game of coordination and alliance formation. 🤝 Find out more: dpmd.ai/diplomacy-natu…

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Using negotiation algorithms, agents can form contracts regarding their next move & outperform others without this ability. When agents may deviate from past contracts, we show that sanctioning peers who break promises fosters more truthful communication: dpmd.ai/diplomacy-tw

Feryal (@feryalmp) 's Twitter Profile Photo

I’m super excited to share our work on AdA: An Adaptive Agent capable of hypothesis-driven exploration which solves challenging unseen tasks with just a handful of experience, at a similar timescale to humans. sites.google.com/corp/view/adap… See the thread for more details 👇 [1/N]

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Football players can tackle, get up, kick and chase a ball in one seamless motion. How could robots master these motor skills? ⚽ We trained AI agents to demonstrate these agile behaviours using end-to-end reinforcement learning. Find out more: dpmd.ai/41N3CsM

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Our latest work in nature today: #AlphaDev discovered a new faster sorting algorithm that we open sourced to the main C++ library for all developers to use. This is just the beginning of AI being used to find many more efficiencies in code in future dpmd.ai/alphadev-tw

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

How can AI work better with human experts in areas like medicine where safety is paramount? 🩺 With Google AI, we propose CoDoC: a model that learns when predictive AI is offering correct information - and when it's better to defer to a clinician. dpmd.ai/3NTA5bb

How can AI work better with human experts in areas like medicine where safety is paramount? 🩺

With <a href="/GoogleAI/">Google AI</a>, we propose CoDoC: a model that learns when predictive AI is offering correct information - and when it's better to defer to a clinician. dpmd.ai/3NTA5bb
David Stutz (@davidstutz92) 's Twitter Profile Photo

Student researcher positions at Google DeepMind are now open for applications until Dec 15 – see our careers webpage. Also a good opportunity to re-share my article of how I prepared for my internship back in 2019: davidstutz.de/how-i-prepared…

Oleg Ostroumov (@olegostroumov) 's Twitter Profile Photo

I just published the story of how I created the world’s first No-Limit Holdem poker solver and made $500k by age 23 medium.com/@olegostroumov… I had to keep the story secret since 2013, but now you can read how I went from near broke to reshaping world's toughest poker games

Ian Gemp (@drimgemp) 's Twitter Profile Photo

What do haggling, debate, and convincing your kids to go to bed all have in common with Poker? With #LLMs, we map them all onto the framework of #gametheory; we then generate conversational strategies using the same methods that beat top Poker pros. arxiv.org/abs/2402.01704

What do haggling, debate, and convincing your kids to go to bed all have in common with Poker? With #LLMs, we map them all onto the framework of #gametheory; we then generate conversational strategies using the same methods that beat top Poker pros. arxiv.org/abs/2402.01704
Roberta Raileanu (@robertarail) 's Twitter Profile Photo

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the

Gabriel Synnaeve (@syhw) 's Twitter Profile Photo

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arxiv.org/abs/2502.18449 by Yuxiang Wei Sida Wang and the whole team! Get started with your favorite model here github.com/facebookresear…

Alexander Holden Miller (@alex_h_miller) 's Twitter Profile Photo

Come join us! We have a crack team across US + UK (Yoram Bachrach) working on agents that can do AI research. We're hiring a full-time PhD new grad Research Scientist based in New York. Ideal candidate has published on RL / reasoning with LLMs.

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…