Compare AI providers across score, stability, momentum, and market share. Rankings derived from composite scores of 300 tracked models, updated hourly.
| # | Provider | Models | Avg Score | Best Model | Top Rank | 24h Avg | 7d Avg | Stability% | Free |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Xiaomi | 3 | 84.2 | MiMo-V2-Omni(85.0) | #21 | +5.7 | +18.7 | 33% | 0 |
| 2 | ByteDance | 5 | 80.5 | Seed-2.0-Lite(85.0) | #24 | -1.8 | +4.0 | 20% | 0 |
| 3 | xAI | 10 | 78.7 | Grok 4.1 Fast(86.9) | #15 | +6.5 | -0.7 | 60% | 0 |
| 4 | Kuaishou | 1 | 77.4 | KAT-Coder-Pro V1(77.4) | #89 | +7.0 | -12.0 | 0% | 0 |
| 5 | Anthropic | 13 | 77.3 | Claude Opus 4.6(92.1) | #6 | -5.0 | -0.1 | 54% | 0 |
| 6 | reka | 1 | 77.2 | Reka Edge(77.2) | #93 | -4.0 | +208.0 | 0% | 0 |
| 7 | Cursor | 2 | 76.4 | Composer 2(76.4) | #101 | +9.0 | +6.5 | 50% | 0 |
| 8 | StepFun | 2 | 75.7 | Step 3.5 Flash (free)(78.2) | #85 | +5.0 | -10.0 | 50% | 1 |
| 9 | Moonshot AI | 4 | 73.8 | Kimi K2.5(85.0) | #31 | +4.0 | +4.5 | 0% | 0 |
| 10 | Meituan | 1 | 72.6 | LongCat Flash Chat(72.6) | #138 | -23.0 | -3.0 | 100% | 0 |
| 11 | Upstage | 1 | 72.5 | Solar Pro 3(72.5) | #139 | +1.0 | -19.0 | 0% | 0 |
| 12 | OpenAI | 60 | 72.4 | GPT-5.4 Pro(94.0) | #1 | +1.4 | +1.9 | 55% | 2 |
| 13 | MiniMax | 8 | 72.2 | MiniMax M2.5 (free)(83.4) | #51 | -3.9 | -11.0 | 38% | 1 |
| 14 | Tencent | 1 | 72.1 | Hunyuan A13B Instruct(72.1) | #142 | -19.0 | -29.0 | 0% | 0 |
| 15 | DeepSeek | 11 | 71.4 | R1 0528(77.6) | #88 | +8.6 | -3.1 | 55% | 0 |
| 16 | AI21 Labs | 1 | 71.0 | Jamba Large 1.7(71.0) | #151 | +3.0 | +7.0 | 0% | 0 |
| 17 | Alibaba | 50 | 70.9 | Qwen3.5-9B(85.0) | #25 | -0.9 | -0.6 | 38% | 3 |
| 18 | NVIDIA | 11 | 70.5 | Nemotron 3 Super (free)(84.1) | #47 | +2.3 | +3.8 | 27% | 4 |
| 19 | Inception | 3 | 70.5 | Mercury 2(81.3) | #69 | +7.0 | -1.3 | 33% | 0 |
| 20 | 23 | 69.2 | Gemini 3 Flash Preview(89.4) | #11 | +2.0 | -2.9 | 39% | 5 | |
| 21 | Baidu | 5 | 68.5 | ERNIE 4.5 VL 28B A3B(74.8) | #112 | -3.0 | +0.8 | 20% | 0 |
| 22 | deepcogito | 1 | 66.7 | Cogito v2.1 671B(66.7) | #175 | +10.0 | +15.0 | 0% | 0 |
| 23 | essentialai | 1 | 64.8 | Rnj 1 Instruct(64.8) | #188 | +8.0 | -6.0 | 0% | 0 |
| 24 | Writer | 1 | 64.7 | Palmyra X5(64.7) | #190 | -10.0 | -11.0 | 0% | 0 |
| 25 | arcee-ai | 7 | 64.7 | Trinity Mini(82.4) | #59 | -5.0 | -1.1 | 57% | 2 |
| 26 | Perplexity | 5 | 63.7 | Sonar Pro Search(85.0) | #39 | -4.8 | -9.8 | 40% | 0 |
| 27 | Amazon | 5 | 63.5 | Nova Premier 1.0(77.8) | #87 | -7.8 | +1.8 | 40% | 0 |
| 28 | aion-labs | 3 | 60.7 | Aion-2.0(69.2) | #159 | 0 | -2.3 | 33% | 0 |
| 29 | Allen AI | 4 | 60.1 | Olmo 3 32B Think(66.3) | #176 | -2.0 | +6.3 | 50% | 0 |
| 30 | Mistral AI | 25 | 59.7 | Mistral Small 4(79.4) | #79 | -4.1 | +3.6 | 48% | 1 |
| 31 | IBM | 1 | 55.1 | Granite 4.0 Micro(55.1) | #244 | +4.0 | +6.0 | 0% | 0 |
| 32 | Liquid AI | 5 | 54.4 | LFM2.5-1.2B-Thinking (free)(59.0) | #226 | -1.2 | -6.8 | 40% | 2 |
| 33 | Cohere | 4 | 49.9 | Command A(59.9) | #220 | +2.0 | +1.3 | 75% | 0 |
| 34 | Meta | 14 | 49.2 | Llama 4 Maverick(76.6) | #99 | -0.5 | -0.1 | 71% | 2 |
| 35 | Windsurf | 1 | 49.1 | SWE-1.5(49.1) | #263 | -2.0 | +1.0 | 100% | 0 |
| 36 | eleutherai | 1 | 47.3 | Llemma 7b(47.3) | #266 | 0 | +3.0 | 100% | 0 |
| 37 | Microsoft | 2 | 45.8 | Phi 4(59.5) | #222 | +2.5 | -1.5 | 100% | 0 |
| 38 | Vercel | 1 | 38.8 | autofixer-01(38.8) | #287 | -2.0 | +3.0 | 100% | 0 |
| 39 | Inflection | 2 | 36.6 | Inflection 3 Productivity(36.6) | #290 | +0.5 | 0 | 100% | 0 |
| 40 | JetBrains | 1 | 32.4 | Mellum(32.4) | #294 | +5.0 | 0 | 100% | 0 |
Providers with at least 2 models, ranked by percentage of models in a "stable" state. Higher stability means more consistent performance over time.
Distribution of tracked models across providers. Shows each provider's share of the total 300 models in the leaderboard.
| Provider | Models | Share | % |
|---|---|---|---|
| OpenAI | 60 | 20.0% | |
| Alibaba | 50 | 16.7% | |
| Mistral AI | 25 | 8.3% | |
| 23 | 7.7% | ||
| Meta | 14 | 4.7% | |
| Anthropic | 13 | 4.3% | |
| DeepSeek | 11 | 3.7% | |
| NVIDIA | 11 | 3.7% | |
| xAI | 10 | 3.3% | |
| MiniMax | 8 | 2.7% | |
| arcee-ai | 7 | 2.3% | |
| ByteDance | 5 | 1.7% | |
| Baidu | 5 | 1.7% | |
| Perplexity | 5 | 1.7% | |
| Amazon | 5 | 1.7% | |
| Liquid AI | 5 | 1.7% | |
| Moonshot AI | 4 | 1.3% | |
| Allen AI | 4 | 1.3% | |
| Cohere | 4 | 1.3% | |
| Xiaomi | 3 | 1.0% | |
| Inception | 3 | 1.0% | |
| aion-labs | 3 | 1.0% | |
| Cursor | 2 | 0.7% | |
| StepFun | 2 | 0.7% | |
| Microsoft | 2 | 0.7% | |
| Inflection | 2 | 0.7% | |
| Kuaishou | 1 | 0.3% | |
| reka | 1 | 0.3% | |
| Meituan | 1 | 0.3% | |
| Upstage | 1 | 0.3% | |
| Tencent | 1 | 0.3% | |
| AI21 Labs | 1 | 0.3% | |
| deepcogito | 1 | 0.3% | |
| essentialai | 1 | 0.3% | |
| Writer | 1 | 0.3% | |
| IBM | 1 | 0.3% | |
| Windsurf | 1 | 0.3% | |
| eleutherai | 1 | 0.3% | |
| Vercel | 1 | 0.3% | |
| JetBrains | 1 | 0.3% |
Providers are compared across multiple dimensions: average composite score across all models, best individual model score, stability rate (percentage of models in a stable state), 24-hour and 7-day momentum, model count, and availability of free models. Data is derived from hourly-updated rankings of 290+ tracked models.
The stability percentage represents the fraction of a provider's models that are in a "stable" ranking state. A higher percentage means the provider's models consistently maintain their positions over time, suggesting reliable and predictable performance across their lineup.
Market concentration is shown as each provider's share of the total tracked models. The visual bar chart and percentage table reveal how models are distributed among providers, helping identify whether the market is dominated by a few large players or spread across many competitors.