
Michela Paganini
@wondermicky
Staff Research Scientist @DeepMind | LLMs, Evals & Model Understanding | Previously: @facebookAI | @Yale Physics PhD | @CERN | @BerkeleyLab | @UCBerkeley
ID: 112717746
http://mickypaganini.github.io 09-02-2010 13:37:28
4,4K Tweet
6,6K Followers
1,1K Following

Introducing FACTS Grounding. A new benchmark we’re launching with Google DeepMind to evaluate LLM’s factual accuracy on over 1700 tasks. 🧠📐


We’ve partnered with Google DeepMind to publish a leaderboard of models on this new factuality benchmark. Check it out at: kaggle.com/facts-leaderbo…






Breaking news from Text-to-Image Arena! 🖼️✨ Google DeepMind’s Imagen 3 debuts at #1, surpassing Recraft-v3 with a remarkable +70-point lead! Congrats to the Google Imagen team for setting a new bar! Try the best text2image at LMArena and cast your vote! More analysis👇



📢 Join Google DeepMind's DEER team & shape the future of #ResponsibleAI! We're hiring a #ResearchScientist (Fixed Term Contract, 12 month) to tackle fairness in multi-modal AI. Make a real-world impact! Apply by Feb 7th: boards.greenhouse.io/deepmind/jobs/… Alicia Parrish

New paper alert from Google DeepMind! 🚨 We've put LLMs to the test as writing co-pilots – how good are they really at helping us write? LLMs are increasingly used for open-ended tasks like writing assistance, but how do we assess their effectiveness? 🤔 arxiv.org/abs/2503.19711

🚨 I’m hosting a Student Researcher Google DeepMind! Join us on the Autonomous Assistants team (led by Edward Grefenstette ) to explore multi-agent communication—how agents learn to interact, coordinate, and solve tasks together. DM me for details!








Meet Stitch by Google Labs, the easiest and fastest product to generate great designs and UIs. 🧵 stitch.withgoogle.com