Wei-Lin Chiang (@infwinston) Twitter Tweets • TwiCopy

Aydin Senkut

7 months ago

Super excited to partner w Ion Stoica, Anastasios Nikolas Angelopoulos & Wei-Lin Chiang and be part of lmarena.ai (formerly lmsys.org) mega seed round as it has rapidly transcended from an ambitious academic project to a critical cornerstone of AI model evaluations, loyally used by all the major players. In Google IO

thumb_up_off_alt48

chat_bubble_outline4

repeat6

shareShare

Wei-Lin Chiang

@infwinston

7 months ago

Super excited to share the NEW LMArena is now live. Huge shoutout to the team for all the hard work! Check it out and let us know your feedback.

thumb_up_off_alt21

chat_bubble_outline1

repeat1

shareShare

Robert Weber

@robertnweber

7 months ago

Huge launch from the LMArena team today 🎉 They’ve rebuilt the platform from the ground up with faster UI, mobile support, multimodal evals, and a cleaner leaderboard experience. All open and community-driven. And they're hiring! jobs.ashbyhq.com/lmarena

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

7 months ago

Breaking: Claude Opus 4 jumps to #1 in WebDev Arena! A strong comeback from Anthropic - Opus 4 and Sonnet 4 now on top of the chart, surpassing previous Claude 3.7 and matching Gemini 2.5 Pro. Massive congrats to Anthropic🔥

Breaking: Claude Opus 4 jumps to #1 in WebDev Arena!

A strong comeback from <a href="/AnthropicAI/">Anthropic</a> - Opus 4 and Sonnet 4 now on top of the chart, surpassing previous Claude 3.7 and matching Gemini 2.5 Pro.

Massive congrats to <a href="/AnthropicAI/">Anthropic</a>🔥

thumb_up_off_alt697

chat_bubble_outline19

repeat67

shareShare

Laude Ventures

@laudeventures

7 months ago

Proud to support Anastasios Nikolas Angelopoulos, Wei-Lin Chiang , and Ion Stoica as lmarena.ai lays the groundwork for a more rigorous and accountable AI ecosystem. newblog.lmarena.ai/new-lmarena/

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Anjney Midha 🇺🇸

@anjneymidha

7 months ago

1/ Humanity doesn't need more AI benchmarks. We need real time, real world, continuous testing of AI systems I sat down with Ion Stoica Wei-Lin Chiang Anastasios Nikolas Angelopoulos to unpack what lmarena.ai is building and why its critical for AI reliability

thumb_up_off_alt83

chat_bubble_outline4

repeat8

shareShare

a16z

@a16z

7 months ago

The future of AI evaluation: real-world feedback, from real users. lmarena.ai makes that possible: models tested side by side, in public, and voted on by the people who use them. Hear how it started — and why human preference is the foundation of reliable AI in the full

thumb_up_off_alt85

chat_bubble_outline10

repeat8

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

7 months ago

Exciting news: OpenAI’s GPT-Image-1 takes the #1 spot in the Text-to-Image Arena! 🖼️🏆 ➤ Outperforms Google’s Imagen-3.0 by 50+ points ➤ Major leap over DALL·E 3 Huge congrats to OpenAI! 👏

Exciting news: <a href="/OpenAI/">OpenAI</a>’s GPT-Image-1 takes the #1 spot in the Text-to-Image Arena! 🖼️🏆

➤ Outperforms Google’s Imagen-3.0 by 50+ points
➤ Major leap over DALL·E 3

Huge congrats to <a href="/OpenAI/">OpenAI</a>! 👏

thumb_up_off_alt307

chat_bubble_outline11

repeat21

shareShare

johnnn

@johnnnavent

7 months ago

Hello friends, at lmarena.ai we're on the lookout for designers who are obsessed with craft, comfortable with complexity and have a track record of creating interfaces people love. If you're interested in helping us mold this product and brand into something special, send me a

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Andy Konwinski

@andykonwinski

6 months ago

Today, I’m launching a deeply personal project. I’m betting $100M that we can help computer scientists create more upside impact for humanity. Built for and by researchers, including Jeff Dean & Joelle Pineau on the board, Laude Institute catalyzes research with real-world impact.

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat105

shareShare

Wei-Lin Chiang

@infwinston

6 months ago

Image Edit Arena is now live!!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

5 months ago

🚨 Breaking News: Grok 4's result is now live! With 4k+ community votes, xAI’s Grok-4 tied for #3 overall in Text Arena — a huge leap from Grok-3. It scores Top-3 across all categories (#1 in Math, #2 in Coding, #3 in Hard Prompts). Detailed analysis in the thread 🧵

thumb_up_off_alt1,1K

chat_bubble_outline91

repeat178

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

5 months ago

We’re delivering a bundle of polish to the LMArena experience, most of them inspired directly by your feedback 💬 Here’s a look at what’s new👇

thumb_up_off_alt63

chat_bubble_outline4

repeat5

shareShare

Wei-Lin Chiang

@infwinston

5 months ago

Congrats Kimi.ai team!

thumb_up_off_alt16

chat_bubble_outline0

repeat0

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

5 months ago

🧵Top 10 Open Models by Provider Though proprietary models often top the charts, open models are also paired in battle mode, and ranked on our public leaderboards. Here are the top 10 when stacked by top open model by provider. - #1 Kimi K2 (Modified MIT) Kimi.ai - #2

thumb_up_off_alt402

chat_bubble_outline21

repeat63

shareShare

Clayton Thorrez

@cthorrez

5 months ago

A story in 3 parts: :D

thumb_up_off_alt179

chat_bubble_outline8

repeat6

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

5 months ago

Exciting Text-to-Image leaderboard update! Two new Imagen 4.0 models from Google DeepMind just dropped: 🥇 Imagen 4.0 Ultra (v2) ties at #1 with OpenAI’s GPT-Image-1 🥉 Imagen 4.0 (v2) lands strong at #3 Congrats to the Google Imagen team!

Exciting Text-to-Image leaderboard update!

Two new Imagen 4.0 models from <a href="/GoogleDeepMind/">Google DeepMind</a> just dropped:
🥇 Imagen 4.0 Ultra (v2) ties at #1 with <a href="/OpenAI/">OpenAI</a>’s GPT-Image-1
🥉 Imagen 4.0 (v2) lands strong at #3

Congrats to the Google Imagen team!

thumb_up_off_alt453

chat_bubble_outline18

repeat56

shareShare

Aäron van den Oord

@avdnoord

5 months ago

We updated our Imagen 4 models and Ultra is tied for #1 on the lmarena leaderboard! The models are available in Google AI Studio and the Gemini API - try them out and let us know what you think.

thumb_up_off_alt247

chat_bubble_outline13

repeat27

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

5 months ago

We've been busy lately: new arenas, new models, and new methodologies! So we've created a changelog page where you can track all the updates we make to the leaderboards. In addition to the new Search Arena, and new models like the latest Imagen 4, Grok 4, Kimi K2, Seedream 3 and

thumb_up_off_alt133

chat_bubble_outline3

repeat7

shareShare

Clayton Thorrez

@cthorrez

5 months ago

Been a fun first 2 weeks :)

thumb_up_off_alt4

chat_bubble_outline2

repeat1

shareShare