Samuel Schmidgall (@srschmidgall) Twitter Tweets • TwiCopy

Google DeepMind

5 months ago

We’re helping robots self-improve with the power of LLMs. 🤖 Introducing the Summarize, Analyze, Synthesize (SAS) prompt, which analyzes how they perform tasks based on previous actions and then suggests ways for them to get better using the medium of table tennis. 🏓

thumb_up_off_alt779

chat_bubble_outline16

repeat137

shareShare

Khaled Saab

@khaledsaab11

5 months ago

Gemini powers our multimodal health research! 💙 In our new paper on multimodal AMIE, we're pushing conversational diagnostic AI beyond text to handle images such as skin photos, ECGs, and clinical docs, which provide crucial context in healthcare. Blog: goo.gle/42D0QcB

thumb_up_off_alt87

chat_bubble_outline5

repeat23

shareShare

Sundar Pichai

@sundarpichai

5 months ago

What a finish! Gemini 2.5 Pro just completed Pokémon Blue! Special thanks to Joel Z for creating and running the livestream, and to everyone who cheered Gem on along the way.

thumb_up_off_alt6,6K

chat_bubble_outline213

repeat837

shareShare

Google DeepMind

@googledeepmind

4 months ago

Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery. It’s able to: 🔘 Design faster matrix multiplication algorithms 🔘 Find new solutions to open math problems 🔘 Make data centers, chip design and AI training more efficient across Google. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline180

repeat1,1K

shareShare

Max Fu

@letian_fu

4 months ago

Tired of teleoperating your robots? We built a way to scale robot datasets without teleop, dynamic simulation, or even robot hardware. Just one smartphone scan + one human hand demo video → thousands of diverse robot trajectories. Trainable by diffusion policy and VLA models

thumb_up_off_alt407

chat_bubble_outline21

repeat77

shareShare

Google DeepMind

@googledeepmind

4 months ago

Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.

thumb_up_off_alt3,3K

chat_bubble_outline68

repeat429

shareShare

Google DeepMind

@googledeepmind

4 months ago

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

thumb_up_off_alt4,4K

chat_bubble_outline85

repeat663

shareShare

Google Health

@googlehealth

4 months ago

Exciting news from #GoogleIO! We’re introducing MedGemma, our most capable open model for multimodal medical text and image comprehension built on Gemma 3.

thumb_up_off_alt450

chat_bubble_outline11

repeat76

shareShare

Shan Chen

@shan23chen

4 months ago

Designing a hard but useful benchmark has always been a passion of mine. Here we present MedBrowseComp, a deep research + computer use benchmark that is easy to verify (like BrowseComp from OpenAI) but still very expandable 💊! Project page: moreirap12.github.io/mbc-browse-app/ 1/n

thumb_up_off_alt89

chat_bubble_outline2

repeat28

shareShare

Samuel Schmidgall

@srschmidgall

4 months ago

🏥🤖 Autonomous surgical robots are expected to be capable of higher levels of autonomy due to the increased capabilities of AI Check out our new article in IEEE Spectrum which discusses the history and future of autonomous robotic surgery!

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

merve

@mervenoyann

4 months ago

Google released MedGemma on I/O'25 👏 > 4B and 27B instruction fine-tuned vision LMs and a 4B pre-trained vision LM for medicine > available with transformers from the get-go 🤗 they also released a cool demo for scan reading ⤵️

thumb_up_off_alt887

chat_bubble_outline13

repeat155

shareShare

Derya Unutmaz, MD

@deryatr_

4 months ago

Benchmarks for AI agents in medicine are critically important! This new Deep Research benchmark looks very promising, similar to the recently announced HealthBench from OpenAI. Such benchmarks will significantly accelerate the development of AI doctors in the very near future!

thumb_up_off_alt60

chat_bubble_outline4

repeat5

shareShare

Danielle Bitterman, MD

@dbittermanmd

4 months ago

Agents are all the rage and we need to measure and track their abilities in the medical domain. Enter MedBrowseComp, the 1st benchmark to assess agents' abilities to reason, navigate the web, and search for verifiable med info! More👇 moreirap12.github.io/mbc-browse-app/

thumb_up_off_alt22

chat_bubble_outline0

repeat4

shareShare

Alec Stapp

@alecstapp

4 months ago

Reminder: Only 14% of US residents are immigrants. But immigrants are responsible for 36% of aggregate innovation. Two-thirds of this contribution is due to making their native-born collaborators better.

thumb_up_off_alt2,2K

chat_bubble_outline731

repeat376

shareShare

DAIR.AI

@dair_ai

4 months ago

8. MedBrowseComp MedBrowseComp is a new benchmark designed to evaluate LLM agents’ ability to perform complex, multi-hop medical fact-finding by browsing real-world, domain-specific web resources. x.com/shan23chen/sta…

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Andrew Ng

@andrewyng

4 months ago

I am alarmed by the proposed cuts to U.S. funding for basic research, and the impact this would have for U.S. competitiveness in AI and other areas. Funding research that is openly shared benefits the whole world, but the nation it benefits most is the one where the research is

thumb_up_off_alt2,2K

chat_bubble_outline108

repeat457

shareShare

Google DeepMind

@googledeepmind

4 months ago

Introducing MedGemma, our most capable open model for multimodal medical text and image comprehension. 🩻 MedGemma is available now as part of Health AI Developer Foundations → goo.gle/medgemma

thumb_up_off_alt1,1K

chat_bubble_outline47

repeat318

shareShare

Michael Moor

@michael_d_moor

3 months ago

🧵1/ ✨New preprint ✨ LLMs are getting better at answering medical questions. However, they still struggle to spot and fix errors in their own reasoning. That’s a big problem in medicine, where stakes are high and mistakes at any step could be critical. To address this issue,

thumb_up_off_alt215

chat_bubble_outline2

repeat49

shareShare