Fangyu Liu (@hardy_qr) Twitter Tweets • TwiCopy

Fangyu Liu

@hardy_qr

+ Follow

Research Scientist @GoogleDeepMind working on Gemini♊ pretraining. PhD @CambridgeLTL. BMath @UWaterloo. From 成都🐼.
Opinions my own.

ID: 4923880075

linkhttp://fangyuliu.me/about calendar_today18-02-2016 04:12:15

229 Tweet

1,1K Followers

1,1K Following

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Massive News from Chatbot Arena🔥 Google DeepMind's latest Gemini (Exp 1114), tested with 6K+ community votes over the past week, now ranks joint #1 overall with an impressive 40+ score leap — matching 4o-latest in and surpassing o1-preview! It also claims #1 on Vision

Massive News from Chatbot Arena🔥

<a href="/GoogleDeepMind/">Google DeepMind</a>'s latest Gemini (Exp 1114), tested with 6K+ community votes over the past week, now ranks joint #1 overall with an impressive 40+ score leap — matching 4o-latest in and surpassing o1-preview! It also claims #1 on Vision

thumb_up_off_alt1,1K

chat_bubble_outline59

repeat307

shareShare

Kaushik Shivakumar

@19kaushiks

a year ago

Super excited for native image out to be released. Had the opportunity to work with a brilliant team to take this from idea to product over the past year. First going to early access partners, then more widely in early 2025. We'll be sharing some cool demos throughout the day

thumb_up_off_alt108

chat_bubble_outline2

repeat6

shareShare

Fangyu Liu

@hardy_qr

a year ago

It's cool to see capabilities being compounding. Progress at one front eventually accelerates progress at other fronts: ultra long-context, MM-in/out, reasoning/planning, agency, ... And it's all just one model!

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Robert Riachi

@robertriachi

a year ago

A simple yet powerful example of the new Gemini 2.0 Flash's native multimodal input + output. Precise conversational editing & reasoning! Next step, Chess!

thumb_up_off_alt418

chat_bubble_outline23

repeat48

shareShare

adi

@adonis_singh

a year ago

'massive organic church of gemini' - gemini 2.0 flash

thumb_up_off_alt149

chat_bubble_outline9

repeat8

shareShare

Jeff Dean

@jeffdean

10 months ago

Introducing Gemini 2.0 Flash Thinking, an experimental model that explicitly shows its thoughts. Built on 2.0 Flash’s speed and performance, this model is trained to use thoughts to strengthen its reasoning. And we see promising results when we increase inference time

thumb_up_off_alt3,3K

chat_bubble_outline129

repeat488

shareShare

Fangyu Liu

@hardy_qr

10 months ago

What's your Final Answer?

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Fangyu Liu

@hardy_qr

10 months ago

A good thinker doesn't necessarily have to underperform in other tasks 😉

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

Fangyu Liu

@hardy_qr

10 months ago

James made incredible contributions to the thinking models. Smart agents are only distillations of other smart agents.

thumb_up_off_alt31

chat_bubble_outline1

repeat0

shareShare

Jack Rae

@jack_w_rae

10 months ago

Appreciate Aidan McLaughlin looking into the thinking model results. Originally scores looked weak as the response was plucked from the thought content versus output. We are looking into ways of making thinking output less confusing for people running evals. This is why we 🚢, to

thumb_up_off_alt105

chat_bubble_outline5

repeat9

shareShare

Fangyu Liu

@hardy_qr

10 months ago

Felix was someone we all looked up to in the lab. I'm really sad.

thumb_up_off_alt20

chat_bubble_outline0

repeat0

shareShare

Fangyu Liu

@hardy_qr

9 months ago

Happy to see people like our hyperfitting paper. We are presenting it at ICLR 2025 in Singapore later this year 🇸🇬

thumb_up_off_alt50

chat_bubble_outline2

repeat4

shareShare

Machine Learning Street Talk

@mlstreettalk

9 months ago

Coding using Cursor 0.45 with the Google DeepMind (new) gemini-2.0-flash-thinking-exp model seems like the biggest step up in genai coding since Claude Sonnet 3.5 came out last June. This is unreal... forget about R1 folks - check out this new Gemini model! 🤯

thumb_up_off_alt1,1K

chat_bubble_outline53

repeat131

shareShare

Mostafa Dehghani

@m__dehghani

8 months ago

Anyone who has been in this room knows that it’s never just another day in here! This space has seen the extremes of chaos and genius! ...and we ship! developers.googleblog.com/en/experiment-… Happy Wednesday everyone!

thumb_up_off_alt209

chat_bubble_outline10

repeat29

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

7 months ago

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

thumb_up_off_alt2,2K

chat_bubble_outline75

repeat421

shareShare

Ankesh Anand

@ankesh_anand

7 months ago

📈📈📈

thumb_up_off_alt353

chat_bubble_outline12

repeat28

shareShare