JB Alayrac (@jalayrac) Twitter Tweets • TwiCopy

Demis Hassabis

7 months ago

The Gemini team cooked hard with Gemini 2.5 Pro, it's an awesome model that continues to lead lmarena.ai - huge congrats to the team! Try it for yourself in the Google Gemini App now. Can't wait for you all to see what else we've been cooking 👀

thumb_up_off_alt2,2K

chat_bubble_outline143

repeat202

shareShare

Vlad Feinberg

@feinbergvlad

7 months ago

Recently had the pleasure of lecturing back at Princeton in a grad seminar. I took the opportunity to cover how scaling laws have evolved since their inception, leaning heavily on great external content from my colleagues Sebastian Borgeaud JB Alayrac Jacob Austin . Content in thread

thumb_up_off_alt818

chat_bubble_outline5

repeat109

shareShare

Ani Baddepudi

@anibaddepudi

6 months ago

although the vision leaderboard doesn't capture every vision use case, 60+ elo points reflects the significant step in core vision capabilities like transcription, spatial understanding, reading charts/diagrams & many more. Still a lot more to do, but 2.5 Pro is the best vision

thumb_up_off_alt165

chat_bubble_outline7

repeat8

shareShare

Mat Velloso

@matvelloso

6 months ago

Worth reading this. The video understanding capabilities of Gemini are fantastic.

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Demis Hassabis

@demishassabis

6 months ago

Gemini 2.5 Pro is incredible at video understanding, try posting a YouTube link into AI studio ai.dev and asking it questions about the video. You will be amazed!

thumb_up_off_alt2,2K

chat_bubble_outline91

repeat204

shareShare

JB Alayrac

@jalayrac

6 months ago

A lot of work went to make Gemini 2.5 SOTA at video understanding, check out this 🧵 for more details! Looking back at where we were a year ago, the progress really feels phenomenal! So many things to unlock and enable from video 🎥 and we are only getting started!

thumb_up_off_alt149

chat_bubble_outline5

repeat12

shareShare

Tobias Weyand

@0xtob

6 months ago

Gemini 2.5 Pro sets the state of the art on our newly released Minerva video reasoning benchmark by scoring 63.5%. 📜 Paper: arxiv.org/abs/2505.00681… 📊 Dataset: github.com/google-deepmin…

thumb_up_off_alt19

chat_bubble_outline0

repeat3

shareShare

Ani Baddepudi

@anibaddepudi

6 months ago

The Gemini 2.5 models are magical for analyzing sports video. We asked Gemini to find Draymond's defensive plays from a highlights reel, which requires the model to: - reason “over pixels” to identify defensive plays - identify players in the video using its world knowledge -

thumb_up_off_alt274

chat_bubble_outline6

repeat34

shareShare

Demis Hassabis

@demishassabis

6 months ago

cooking up something tasty for tomorrow...

thumb_up_off_alt5,5K

chat_bubble_outline419

repeat298

shareShare

Fei Xia

@xf1280

6 months ago

Excited that our work on Gemini Robotics and Gemini spatial understanding have just been featured on #GoogleIO stage! I believe that a frontier model possessing strong real-world understanding capabilities represents the ultimate path to embodied AGI, and we are making rapid

thumb_up_off_alt160

chat_bubble_outline7

repeat18

shareShare

Gabriel Barth-Maron

@gbarthmaron

6 months ago

People want Veo 3 so we are giving access to Pro subscribers in 71 countries as of...now!

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

Antoine Yang

@antoineyang2

5 months ago

By popular request, you can now specify frames per second (fps), as well as start and end times, for videos in AI Studio ⏩

thumb_up_off_alt29

chat_bubble_outline3

repeat4

shareShare

Visual Geometry Group (VGG)

@oxford_vgg

5 months ago

Many Congratulations to Jianyuan #CVPR2025 2025, Minghao Chen, Nikita Karaev, Andrea Vedaldi, Christian Rupprecht and David Novotny for winning the Best Paper Award @CVPR for "VGGT: Visual Geometry Grounded Transformer" 🥇🎉 🙌🙌 #CVPR2025!!!!!!

Many Congratulations to <a href="/jianyuan_wang/">Jianyuan<a href="/CVPR/">#CVPR2025</a> 2025</a>, <a href="/MinghaoChen23/">Minghao Chen</a>, <a href="/n_karaev/">Nikita Karaev</a>, Andrea Vedaldi, Christian Rupprecht and <a href="/davnov134/">David Novotny</a> for winning the Best Paper Award @CVPR for "VGGT: Visual Geometry Grounded Transformer" 🥇🎉 🙌🙌 #CVPR2025!!!!!!

thumb_up_off_alt474

chat_bubble_outline17

repeat65

shareShare

elie

@eliebakouch

5 months ago

Pre-training is not dead

thumb_up_off_alt310

chat_bubble_outline9

repeat19

shareShare

Antoine Yang

@antoineyang2

5 months ago

The newly generally available Gemini 2.5 Flash and Pro are even better at video understanding than the versions we shared in the blog a month ago, see more details in the tech report 😀

thumb_up_off_alt105

chat_bubble_outline2

repeat19

shareShare

Ani Baddepudi

@anibaddepudi

5 months ago

You can now sample at higher frame rates (default 1 FPS), and specify start and end times for videos in the Gemini API! We’ve been blown away by all the ways developers are using Gemini to process videos, and see a ton of devs manually clipping and slowing down videos to use

thumb_up_off_alt62

chat_bubble_outline7

repeat6

shareShare

Logan Kilpatrick

@officiallogank

5 months ago

We just shipped video FPS support in the Gemini API, so you can dynamically customize how many frames per second you want the model to see, unlocking lots of interesting new video use cases! 📹

thumb_up_off_alt811

chat_bubble_outline57

repeat47

shareShare

Ani Baddepudi

@anibaddepudi

3 months ago

gemini's still the only frontier model that supports native video input (and is amazing at it!) incredible amount of real-world utility given how much of the world's information is increasingly in video

thumb_up_off_alt303

chat_bubble_outline22

repeat17

shareShare