Machel Reid (@machelreid) Twitter Tweets • TwiCopy

Logan Kilpatrick

a year ago

Say hello to Grounding with Google Search, available in the Gemini API + Google AI Studio! You can now access real time, fresh, up to date information from Google Search when building with Gemini by enabling the Grounding tool. developers.googleblog.com/en/gemini-api-…

thumb_up_off_alt1,1K

chat_bubble_outline123

repeat279

shareShare

Noam Shazeer

@noamshazeer

10 months ago

We’ve been *thinking* about how to improve model reasoning and explainability Introducing Gemini 2.0 Flash Thinking, an experimental model trained to think out loud, leading to stronger reasoning performance. Excited to get this first model into the hands of developers to try

thumb_up_off_alt3,3K

chat_bubble_outline83

repeat305

shareShare

Ali Eslami

@arkitus

10 months ago

Lateral thinking: "Wait a minute... could I turn one of the numbers upside down?"

thumb_up_off_alt101

chat_bubble_outline3

repeat5

shareShare

Jack Rae

@jack_w_rae

10 months ago

We released Gemini 2.0 Flash Thinking today! ⚡️🤔 It's a small step towards improved reasoning via inference-time compute, built on top of our small and mighty 2.0 Flash!

thumb_up_off_alt309

chat_bubble_outline8

repeat40

shareShare

Graham Neubig

@gneubig

9 months ago

I'm pretty amazed by how easy it is to localize frontend apps with AI agents. I had a meeting with people in Japan, and wanted to create a prototype of our app in Japanese. In about 8 hours, OpenHands generated 6200 lines of code and now our app is localized in 10 languages.

thumb_up_off_alt120

chat_bubble_outline5

repeat14

shareShare

Sebastian Ruder

@seb_ruder

9 months ago

A new year, a new challenge. I recently joined AI at Meta to improve evaluation and benchmarking of LLMs. I'm excited to push on making LLMs more useful and accessible, via open-sourcing data/models and real-world applications. I'll continue to be based in Berlin.

thumb_up_off_alt682

chat_bubble_outline37

repeat21

shareShare

Ani Baddepudi

@anibaddepudi

8 months ago

gemini 2.0 pro 😃

thumb_up_off_alt353

chat_bubble_outline19

repeat21

shareShare

koray kavukcuoglu

@koraykv

7 months ago

1/ Today we are releasing Gemini 2.5 Pro Experimental, our newest Gemini model with integrated “thinking” and significant performance gains. Very proud of the whole team! 🧵

thumb_up_off_alt466

chat_bubble_outline9

repeat21

shareShare

Oriol Vinyals

@oriolvinyalsml

7 months ago

Introducing Gemini 2.5 Pro Experimental! 🎉 Our newest Gemini model has stellar performance across math and science benchmarks. It’s an incredible model for coding and complex reasoning, and it’s #1 on the lmarena.ai leaderboard by a drastic 40 ELO margin. Only a handful of

thumb_up_off_alt1,1K

chat_bubble_outline53

repeat150

shareShare

Logan Kilpatrick

@officiallogank

6 months ago

Gemini 2.5 Flash is here, our first unified reasoning model with thinking budgets. 🔥 It’s on the perato frontier and punches above its price and size!! developers.googleblog.com/en/start-build…

thumb_up_off_alt3,3K

chat_bubble_outline172

repeat326

shareShare

Google AI Developers

@googleaidevs

6 months ago

⚡Gemini 2.5 Flash is now in Preview. Available on Google AI Studio, it’s our first fully hybrid reasoning model that lets you toggle thinking or set budgets for the optimal quality/cost/latency mix. Maintain 2.0 Flash speed + improved perf even when thinking is off. →

thumb_up_off_alt1,1K

chat_bubble_outline43

repeat181

shareShare

Melvin Johnson

@melvinjohnsonp

6 months ago

Excited to introduce Gemini 2.5 Flash our most cost-efficient thinking model. We are once again at the frontier here. Pretty good well rounded performance.

thumb_up_off_alt170

chat_bubble_outline1

repeat19

shareShare

Demis Hassabis

@demishassabis

6 months ago

We've just given our most powerful workhorse model a big upgrade to Gemini 2.5 Flash. You can try it now in preview on ai.dev - yet another Gemini data point on the cost-performance pareto frontier!

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat89

shareShare

Jonas Adler

@jonasaadler

5 months ago

Competing with ourselves is getting a bit boring

thumb_up_off_alt2,2K

chat_bubble_outline120

repeat88

shareShare

Nathan Lambert

@natolambert

5 months ago

Gemini 2.5 Pro shipping a granular thinking budget that works could actually be a pretty big deal. The glorious slider that comes before the model just knows how much thinking to do. Helps limit overthinking, helps collect good data on "how hard users think the model is"

thumb_up_off_alt177

chat_bubble_outline17

repeat9

shareShare

Sundar Pichai

@sundarpichai

4 months ago

Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads lmarena.ai with a 24pt Elo score jump since the previous version. We also

thumb_up_off_alt4,4K

chat_bubble_outline214

repeat468

shareShare

Melvin Johnson

@melvinjohnsonp

4 months ago

Our latest update to Gemini 2.5 Pro is here. It's SoTA on GPQA Diamond, AIDER and HLE. The team has also worked hard to improve the model on style, persona and creativity. We're excited to see what you build with it. Please let us know any feedback as we're eternally cooking.

thumb_up_off_alt230

chat_bubble_outline6

repeat17

shareShare

Oriol Vinyals

@oriolvinyalsml

4 months ago

Fiction.live We like long context. Go beyond 192k plz ; )

thumb_up_off_alt32

chat_bubble_outline1

repeat1

shareShare

Paul Gauthier

@paulgauthier

4 months ago

Gemini 2.5 Pro 06-05 has set a new SOTA on the aider polyglot coding benchmark, scoring 83% with 32k thinking tokens. The default thinking mode, where Gemini self-determines the thinking budget, scored 79%. Full leaderboard: aider.chat/docs/leaderboa…

thumb_up_off_alt667

chat_bubble_outline22

repeat53

shareShare

Demis Hassabis

@demishassabis

a month ago

Made it to no.1 in the App Store. Congrats to the Google Gemini App team for all their hard work, and this is just the start, so much more to come!

thumb_up_off_alt4,4K

chat_bubble_outline217

repeat303

shareShare