Best AI Models 2026

The definitive ranking of the top AI models in 2026. Our composite scoring system evaluates 343+ models across performance benchmarks, pricing, context window, capabilities, and recency. Rankings update hourly with live data.

Top 10 AI Models Overall

Claude Fable 5by Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

97 ptsContext: 1MOutput: $50.00/M6/7 capabilities

Claude Opus 4.7 (Fast)by Anthropic

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

95 ptsContext: 1MOutput: $150.00/M6/7 capabilities

Claude Opus 4.7by Anthropic

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

95 ptsContext: 1MOutput: $25.00/M6/7 capabilities

Claude Opus 4.8 (Fast)by Anthropic

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

94 ptsContext: 1MOutput: $50.00/M6/7 capabilities

Claude Opus 4.8by Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

94 ptsContext: 1MOutput: $25.00/M6/7 capabilities

GPT-5.5by OpenAI

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

92 ptsContext: 1.1MOutput: $30.00/M6/7 capabilities

Gemini 3.1 Pro Preview Custom Toolsby Google

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

92 ptsContext: 1.0MOutput: $12.00/M6/7 capabilities

Gemini 3.1 Pro Previewby Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

92 ptsContext: 1.0MOutput: $12.00/M6/7 capabilities

GPT-5.4 Proby OpenAI

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

92 ptsContext: 1.1MOutput: $180.00/M6/7 capabilities

#10

GPT-5.4by OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...

92 ptsContext: 1.1MOutput: $15.00/M6/7 capabilities

Best in Category

Our top picks across different use cases and requirements for 2026.

Best for CodingTop composite score

Claude Fable 5

Anthropic

97composite score

1M context / $50.00/M output

Best FreeNo API costs

Gemma 4 31B (free)

Google

80composite score

262K context / Free/M output

Best Open SourceWeights available

DeepSeek V4 Pro

DeepSeek

86composite score

1.0M context / $0.87/M output

Best BudgetUnder $1/M tokens

DeepSeek V4 Pro

DeepSeek

86composite score

1.0M context / $0.87/M output

Best for ReasoningChain-of-thought

Claude Fable 5

Anthropic

97composite score

1M context / $50.00/M output

Best for AgentsTools + JSON + streaming

Claude Fable 5

Anthropic

97composite score

1M context / $50.00/M output

Full Top 30 Rankings

Top 30 AI Models by Composite Score

#	Model	Provider	Score	Context	Output $/1M	Reasoning	Tools
1	Claude Fable 5Anthropic	Anthropic	97	1M	$50.00
2	Claude Opus 4.7 (Fast)Anthropic	Anthropic	95	1M	$150.00
3	Claude Opus 4.7Anthropic	Anthropic	95	1M	$25.00
4	Claude Opus 4.8 (Fast)Anthropic	Anthropic	94	1M	$50.00
5	Claude Opus 4.8Anthropic	Anthropic	94	1M	$25.00
6	GPT-5.5OpenAI	OpenAI	92	1.1M	$30.00
7	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	92	1.0M	$12.00
8	Gemini 3.1 Pro PreviewGoogle	Google	92	1.0M	$12.00
9	GPT-5.4 ProOpenAI	OpenAI	92	1.1M	$180.00
10	GPT-5.4OpenAI	OpenAI	92	1.1M	$15.00
11	GPT-5.5 ProOpenAI	OpenAI	90	1.1M	$180.00
12	GPT-5.2-CodexOpenAI	OpenAI	90	400K	$14.00
13	GPT-5.2 ProOpenAI	OpenAI	90	400K	$168.00
14	GPT-5.2OpenAI	OpenAI	90	400K	$14.00
15	Claude Opus 4.6 (Fast)Anthropic	Anthropic	90	1M	$150.00
16	Claude Opus 4.6Anthropic	Anthropic	90	1M	$25.00
17	Grok 4.20xAI	xAI	88	2M	$2.50
18	GPT-5.3-CodexOpenAI	OpenAI	88	400K	$14.00
19	GPT-5 ProOpenAI	OpenAI	88	400K	$120.00
20	GPT-5 CodexOpenAI	OpenAI	88	400K	$10.00
21	GPT-5OpenAI	OpenAI	88	400K	$10.00
22	Gemini 3 Flash PreviewGoogle	Google	88	1.0M	$3.00
23	Grok 4.20 Multi-AgentxAI	xAI	87	2M	$2.50		—
24	GPT-5.1-Codex-MaxOpenAI	OpenAI	87	400K	$10.00
25	GPT-5.1OpenAI	OpenAI	87	400K	$10.00
26	GPT-5.1-CodexOpenAI	OpenAI	87	400K	$10.00
27	GPT-5.1-Codex-MiniOpenAI	OpenAI	87	400K	$2.00
28	GPT-5.3 ChatOpenAI	OpenAI	87	128K	$14.00	—
29	o3 Deep ResearchOpenAI	OpenAI	86	200K	$40.00
30	o3 ProOpenAI	OpenAI	86	200K	$80.00

New AI Models Released in 2026

119 models have been released in 2026 so far. Here are the latest arrivals.

2026 Model Releases

Model	Provider	Released	Score	Output $/1M
Fugu Ultrasakana	sakana	Jun 24	—	$30.00
Nano Banana 2 (Gemini 3.1 Flash Image)Google	Google	Jun 18	—	$3.00
Nano Banana Pro (Gemini 3 Pro Image)Google	Google	Jun 18	—	$12.00
North Mini Code (free)Cohere	Cohere	Jun 17	—	Free
GLM 5.2Zhipu AI	Zhipu AI	Jun 16	—	$3.00
Kimi K2.7 CodeMoonshot AI	Moonshot AI	Jun 12	—	$3.50
Claude Fable Latest~anthropic	~anthropic	Jun 9	—	$50.00
Claude Fable 5Anthropic	Anthropic	Jun 9	97	$50.00
Nemotron 3.5 Content Safety (free)NVIDIA	NVIDIA	Jun 4	—	Free
Nemotron 3 Ultra (free)NVIDIA	NVIDIA	Jun 4	—	Free
Nemotron 3 UltraNVIDIA	NVIDIA	Jun 4	—	$2.20
Qwen3.7 PlusAlibaba	Alibaba	Jun 3	—	$1.28
MiniMax M3MiniMax	MiniMax	May 31	—	$1.20
Step 3.7 FlashStepFun	StepFun	May 28	—	$1.15
Claude Opus 4.8 (Fast)Anthropic	Anthropic	May 27	94	$50.00
Claude Opus 4.8Anthropic	Anthropic	May 27	94	$25.00
Qwen3.7 MaxAlibaba	Alibaba	May 21	—	$3.75
Grok Build 0.1xAI	xAI	May 20	—	$2.00
Gemini 3.5 FlashGoogle	Google	May 19	—	$9.00
Claude Opus 4.7 (Fast)Anthropic	Anthropic	May 12	95	$150.00

How We Rank AI Models

Composite Score (0-100)

Every model receives a score from 0 to 100, driven primarily by benchmark performance (90%) from MMLU, GPQA, HumanEval, SWE-bench, and 15+ standardized evaluations. Capabilities and context window serve as tiebreakers (10%).

Live Data Pipeline

Rankings update hourly from live API data. We track pricing changes, new model releases, and capability updates across all major providers. No stale benchmarks or manual curation.

Capability Assessment

We evaluate 7 core capabilities: vision, function calling, streaming, JSON mode, reasoning, web search, and image output. Models that support more capabilities score higher on versatility.

Pricing & Value

Price is not the only factor. We balance cost against capability to surface the best value at every price point -- from free open-source models to premium frontier models.

Read full methodology

Provider Overview

Which AI providers dominate the top 30 in 2026.

Providers with Models in Top 30

Provider	In Top 30	Total Models	Best Rank	Top Model
OpenAI	18	64	#6	GPT-5.5
Anthropic	7	15	#1	Claude Fable 5
Google	3	30	#7	Gemini 3.1 Pro Preview Custom Tools
xAI	2	4	#17	Grok 4.20

Explore More Rankings

Dive deeper into specific categories, compare models head-to-head, or find the right model for your use case.

LLM Leaderboard Best for Coding Free Models Open Source Cheapest Models Fastest Models Reasoning Models Agent Models New Models Compare Models API Pricing

Frequently Asked Questions

The best AI model depends on your use case. For coding, models with strong SWE-bench scores lead. For general reasoning, high Arena Elo models excel. For budget-friendly options, open-source models offer excellent performance at no cost. Our leaderboard ranks all 290+ models across multiple dimensions.

We use a composite scoring system that weighs benchmark performance (90%) from MMLU, GPQA, HumanEval, SWE-bench, and 15+ standardized evaluations, with capabilities and context window as tiebreakers (10%). This balanced approach ensures no single factor dominates the ranking.

Check our coding leaderboard for the latest rankings. Top coding models are evaluated on SWE-bench, HumanEval, and real-world coding tasks. The ranking updates hourly as new models are released and benchmarks are refreshed.