Archit Sharma (@archit_sharma97) 's Twitter Profile
Archit Sharma

@archit_sharma97

RL, post-training, reasoning research @GoogleDeepMind. prev: @Stanford @Google Brain @IITKanpur @MILAMontreal.

ID: 3269288695

calendar_today05-07-2015 17:03:56

306 Tweet

4,4K Followers

353 Following

Yoonho Lee (@yoonholeee) 's Twitter Profile Photo

Excited to share our new work on test-time alignment! We introduce HyRe, a fast way to adapt large models (like LLM reward models) to new user preferences without extra training. Paper: arxiv.org/abs/2412.08812

Excited to share our new work on test-time alignment! We introduce HyRe, a fast way to adapt large models (like LLM reward models) to new user preferences without extra training.

Paper: arxiv.org/abs/2412.08812
alphaXiv (@askalphaxiv) 's Twitter Profile Photo

We used Gemini 2 Flash to build Cursor for arXiv papers Highlight any section of a paper to ask questions and “@” other papers to quickly add to context and compare results, benchmarks, etc.

Anikait Singh (@anikait_singh_) 's Twitter Profile Photo

Personalization in LLMs is crucial for meeting diverse user needs, yet collecting real-world preferences at scale remains a significant challenge. Introducing FSPO, a simple framework leveraging synthetic preference data to adapt new users with meta-learning for open-ended QA! 🧵

Personalization in LLMs is crucial for meeting diverse user needs, yet collecting real-world preferences at scale remains a significant challenge. Introducing FSPO, a simple framework leveraging synthetic preference data to adapt new users with meta-learning for open-ended QA! 🧵
Jeff Dean (@jeffdean) 's Twitter Profile Photo

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding. Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on lmarena.ai (formerly lmsys.org) leaderboard. 🥇

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on <a href="/lmarena_ai/">lmarena.ai (formerly lmsys.org)</a> leaderboard. 🥇
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer
Jack Rae (@jack_w_rae) 's Twitter Profile Photo

Today we are launching 2.5 Pro! I think it's the best model in the world. State-of-the-art reasoning and great vibes (+39 ELO gap on lmsys!) 2.5 Pro improves in coding, stem, multimodal, instruction following, and lots more. Available in AI Studio & the Gemini App!

Today we are launching 2.5 Pro!

I think it's the best model in the world. State-of-the-art reasoning and great vibes (+39 ELO gap on lmsys!)

2.5 Pro improves in coding, stem, multimodal, instruction following, and lots more. 

Available in AI Studio &amp; the Gemini App!
Archit Sharma (@archit_sharma97) 's Twitter Profile Photo

when i finished grad school, part of me was hoping that i would no longer be working on results right up to deadline…excited about tomorrow!

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.

Garrett Bingham (@gjb_ai) 's Twitter Profile Photo

Gemini 2.5 Pro Deep Think is an SVG artist! Prompt: "Draw a SVG of a Pelican riding a bicycle" Left: Gemini 2.5 Pro Right: Gemini 2.5 Pro Deep Think Credit: simonwillison.net/2024/Oct/25/pe…

Gemini 2.5 Pro Deep Think is an SVG artist!

Prompt: "Draw a SVG of a Pelican riding a bicycle"
Left: Gemini 2.5 Pro
Right: Gemini 2.5 Pro Deep Think

Credit: simonwillison.net/2024/Oct/25/pe…