Ruben Villegas (@rubenevillegas) 's Twitter Profile
Ruben Villegas

@rubenevillegas

Generative AI @GoogleDeepMind.
🤖 Generative Models of Video #Veo2 #Veo #Phenaki
💼 Past: Character Animation @AdobeResearch

ID: 746345000

linkhttps://rubenvillegas.me calendar_today09-08-2012 02:05:00

1,1K Tweet

1,1K Followers

337 Following

Mohammad Saffar (@msaffar3) 's Twitter Profile Photo

I think it is time to revisit scaling laws and incorporate FLOPs as a separate variable. Parameter and token counts alone do not give a full view as we can change FLOPs independently. Legendary approaches like 2-simplicial attn paper proposed by @Happylemon56775 and the

Sander Dieleman (@sedielem) 's Twitter Profile Photo

Hello #ICML2025👋, anyone up for a diffusion circle? We'll just sit down somewhere and talk shop. 🕒Join us at 3PM on Thursday July 17. We'll meet here (see photo, near the west building's west entrance), and venture out from there to find a good spot to sit. Tell your friends!

Hello #ICML2025👋, anyone up for a diffusion circle? We'll just sit down somewhere and talk shop.

🕒Join us at 3PM on Thursday July 17. We'll meet here (see photo, near the west building's west entrance), and venture out from there to find a good spot to sit. Tell your friends!
Tiange Luo (@tiangeluo) 's Twitter Profile Photo

Introducing Visual Test-time Scaling for GUI Agent Grounding (ICCV'25, completed prior to the release of OpenAI-O3) When "thinking with images", the key chanlleging is designing the action in pixels space. We can zoom into regions of varying sizes and shapes, apply image

Ben Poole (@poolio) 's Twitter Profile Photo

Dude, follow @GDMGreenfield. They produce the most incredible generations from our models, and always have crazy cool tips and tricks.

Lajanugen Logeswaran (@lajanugen) 's Twitter Profile Photo

🚀 Check out Tiange's RegionFocus method — a visual test-time scaling technique that boosts GUI agents' grounding abilities! 🔥 Combined with Qwen2.5-72B, it achieves state-of-the-art results on the Screenspot-Pro benchmark. 💻 Code github.com/tiangeluo/Regi…

Ethan Mollick (@emollick) 's Twitter Profile Photo

"[video game] as a community theater production" may be one of the most delightful Veo 3 Fast prompts Please enjoy, in order: GTA, Pokemon, Mario Kart, The Witcher 3, Stardew Valley, Tetris, Mortal Kombat, The Sims, & Death Stranding(!) Yes, the whole prompt was the one above.

Thang Luong (@lmthang) 's Twitter Profile Photo

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this
Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to Thang Luong and the team! deepmind.google/discover/blog/…

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Btw as an aside, we didn’t announce on Friday because we respected the IMO Board's original request that all AI labs share their results only after the official results had been verified by independent experts & the students had rightly received the acclamation they deserved

Jeff Dean (@jeffdean) 's Twitter Profile Photo

We're all excited to see our advanced Gemini Deep Think model achieve a gold-medal level performance in the recent IMO, solving 5 of the 6 problems perfectly (35 of 42 points)! 🥇 Unlike our entry last year, which first converted the problems to a formal proof language (Lean),

Y Combinator (@ycombinator) 's Twitter Profile Photo

Chelsea Finn (Chelsea Finn) on building general-purpose robotics, and bringing intelligence into the physical world. At AI Startup School in San Francisco. 00:00 - General Purpose Robots 00:11 - Challenges in Robotics Applications 00:57 - Physical Intelligence: A New Approach

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

You know what's cool... a quadrillion tokens. We processed almost 1,000,000,000,000,000 tokens last month, more than double the amount from May. 📈

Aäron van den Oord (@avdnoord) 's Twitter Profile Photo

We updated our Imagen 4 models and Ultra is tied for #1 on the lmarena leaderboard! The models are available in Google AI Studio and the Gemini API - try them out and let us know what you think.

Oliver Wang (@oliver_wang2) 's Twitter Profile Photo

We're back (tied) at #1! Use the model today on Google AI Studio aistudio-preprod.corp.google.com/prompts/new_im… and the Gemini API. Stay tuned more exciting things on the way!

Sander Dieleman (@sedielem) 's Twitter Profile Photo

Transformers haven't changed much since 2017, but there have been some innovations over the years. This is an excellent summary of architectural differences in recent LLMs. Nice diagrams too! 👏 It would be great to see something like this for diffusion Transformers as well 🤔