Archit Sharma (@archit_sharma97) Twitter Tweets • TwiCopy

Archit Sharma

@archit_sharma97

+ Follow

RL, post-training, reasoning research @GoogleDeepMind. prev: @Stanford @Google Brain @IITKanpur @MILAMontreal.

ID: 3269288695

calendar_today05-07-2015 17:03:56

306 Tweet

4,4K Followers

353 Following

Yoonho Lee

@yoonholeee

10 months ago

Excited to share our new work on test-time alignment! We introduce HyRe, a fast way to adapt large models (like LLM reward models) to new user preferences without extra training. Paper: arxiv.org/abs/2412.08812

thumb_up_off_alt265

chat_bubble_outline4

repeat47

shareShare

alphaXiv

@askalphaxiv

9 months ago

We used Gemini 2 Flash to build Cursor for arXiv papers Highlight any section of a paper to ask questions and “@” other papers to quickly add to context and compare results, benchmarks, etc.

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat176

shareShare

Anikait Singh

@anikait_singh_

8 months ago

Personalization in LLMs is crucial for meeting diverse user needs, yet collecting real-world preferences at scale remains a significant challenge. Introducing FSPO, a simple framework leveraging synthetic preference data to adapt new users with meta-learning for open-ended QA! 🧵

thumb_up_off_alt133

chat_bubble_outline1

repeat11

shareShare

Jeff Dean

@jeffdean

8 months ago

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding. Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on lmarena.ai (formerly lmsys.org) leaderboard. 🥇

thumb_up_off_alt2,2K

chat_bubble_outline94

repeat314

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

8 months ago

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

thumb_up_off_alt2,2K

chat_bubble_outline75

repeat421

shareShare

Jack Rae

@jack_w_rae

8 months ago

Today we are launching 2.5 Pro! I think it's the best model in the world. State-of-the-art reasoning and great vibes (+39 ELO gap on lmsys!) 2.5 Pro improves in coding, stem, multimodal, instruction following, and lots more. Available in AI Studio & the Gemini App!

thumb_up_off_alt474

chat_bubble_outline7

repeat37

shareShare

Sheryl Hsu

@sherylhsu02

7 months ago

Presenting this ICLR 2026 Saturday 3-5:30 Hall 2B/3 poster 540. Come say hi!!

Presenting this <a href="/iclr_conf/">ICLR 2026</a> Saturday 3-5:30 Hall 2B/3 poster 540. Come say hi!!

thumb_up_off_alt40

chat_bubble_outline1

repeat5

shareShare

Archit Sharma

@archit_sharma97

6 months ago

when i finished grad school, part of me was hoping that i would no longer be working on results right up to deadline…excited about tomorrow!

thumb_up_off_alt133

chat_bubble_outline5

repeat1

shareShare

Google DeepMind

@googledeepmind

6 months ago

Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.

thumb_up_off_alt3,3K

chat_bubble_outline68

repeat429

shareShare

Joe Stanton

@joe_stant

6 months ago

🚀🤔 Huge effort from our world class Research, Inference & Deployment teams

thumb_up_off_alt298

chat_bubble_outline13

repeat5

shareShare

Garrett Bingham

@gjb_ai

6 months ago

Gemini 2.5 Pro Deep Think is an SVG artist! Prompt: "Draw a SVG of a Pelican riding a bicycle" Left: Gemini 2.5 Pro Right: Gemini 2.5 Pro Deep Think Credit: simonwillison.net/2024/Oct/25/pe…

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare