May 2026 LM Arena Leaderboard Update: Ernie 5.1 Holds Chinese #1, DeepSeek V4 Pro Rises to 23rd

Leaderboard Results at a Glance

The early May 2026 LM Arena text model leaderboard update reveals a noteworthy trend: the collective rise of Chinese models in the global arena.

Key Rankings

Global Rank	Model	Company	Country	Change
13	Ernie 5.1 Preview	Baidu	🇨🇳	Stable
16	GPT-5.5	OpenAI	🇺🇸	↓
22	mimo-v2.5-pro	Xiaomi	🇨🇳	↑
23	DeepSeek V4 Pro	DeepSeek	🇨🇳	↑

Data Interpretation

Ernie 5.1 Preview holds Chinese #1: At global rank 13, Baidu’s Ernie 5.1 Preview remains the highest-ranked Chinese model — stable in the top 15% globally.

GPT-5.5 no longer “solidly holding”: Rank 16 represents a decline. Evaluators note it “no longer solidly catches” — suggesting OpenAI’s flagship model’s advantage is narrowing as Chinese models catch up.

Xiaomi and DeepSeek’s upward momentum: Both models surpassed GPT-5.5’s position, unthinkable just six months ago.

Landscape Judgment

Chinese Models: From “Catching Up” to “Running Alongside”

Current Chinese model distribution in LM Arena:

Global 10-15: Ernie 5.1 Preview (Chinese ceiling)
Global 20-25: mimo-v2.5-pro, DeepSeek V4 Pro (catching-up group)
Global 25-35: GLM-5.1, Kimi K2.6, Qwen 3.6 (main force)

Chinese models have collectively moved up 5-10 positions compared to six months ago.

US Models: Advantage is Shrinking

GPT-5.5’s decline signals:

Chinese models’ improvement speed exceeds US models’ iteration speed
Price factors begin influencing voting behavior
“US models only” is no longer the optimal strategy

Actionable Recommendations

Decision Dimension	US Models	Chinese Models
Absolute Performance	Still leading	Gap rapidly closing
Price	Higher	Significantly lower
Compliance Risk	Exists	None
Local Support	Limited	Comprehensive

For China-market businesses, Chinese models’ advantages in compliance, cost, and localization can already compensate for minor performance gaps.

Leaderboard Results at a Glance

Key Rankings

Data Interpretation

Landscape Judgment

Chinese Models: From “Catching Up” to “Running Alongside”

US Models: Advantage is Shrinking

Actionable Recommendations

Related

YC Summer 2026 RFS Released: 16 AI-Native Tracks, Software Companies Being Redefined

OpenAI Codex Is No Longer a “Code-Writing Tool”: A Role-Based Workflow Engine Redefines the AI Agent Application Paradigm

2026 Model Selection Paradigm Shift: From "Pick the Strongest" to "Assign by Task"