AI Model Pricing History

Track how AI model API pricing changes over time. Covering 343+ models across 60 providers, with weekly snapshots since 2026-02-16. For the dispersion view of how tightly frontier prices cluster, see the LMC Price Convergence Index.

Models Tracked

343

Price Drops

Price Increases

Stable Pricing

357

Average Pricing by Provider (Current)

Average input and output cost per 1M tokens across each provider's paid models.

Provider	Models	Avg Input $/M	Avg Output $/M
IBM	2	$0.034	$0.106
Liquid AI	1	$0.030	$0.120
rekaai	2	$0.100	$0.150
Meta	10	$0.151	$0.285
poolside	2	$0.150	$0.300
Microsoft	3	$0.257	$0.370
Tencent	2	$0.102	$0.390
inclusionai	3	$0.053	$0.427
Allen AI	1	$0.150	$0.500
Xiaomi	2	$0.270	$0.575
Upstage	1	$0.150	$0.600
StepFun	2	$0.145	$0.725
arcee-ai	4	$0.386	$0.737
Inception	1	$0.250	$0.750
NVIDIA	4	$0.260	$0.813
DeepSeek	11	$0.355	$0.960
ByteDance	5	$0.155	$0.980
MiniMax	8	$0.267	$1.19
Kuaishou	1	$0.300	$1.20
Baidu	1	$0.420	$1.25

Recent Price Drops (49 models)

Models that have decreased in price since tracking began.

Model	Provider	Previous Output $/M	Current Output $/M	Change
Ling-2.6-flash	inclusionai	$0.240	$0.030	-87.5%
MiMo-V2.5	Xiaomi	$2.00	$0.280	-86.0%
Qwen3 235B A22B Thinking 2507	Alibaba	$0.600	$0.100	-83.3%
Ling-2.6-1T	inclusionai	$2.50	$0.625	-75.0%
MiMo-V2.5-Pro	Xiaomi	$3.00	$0.870	-71.0%
DeepSeek V3.2 Speciale	DeepSeek	$1.20	$0.431	-64.1%
Grok 4.20	xAI	$6.00	$2.50	-58.3%
Grok 4.20 Multi-Agent	xAI	$6.00	$2.50	-58.3%
Llama Guard 3 8B	Meta	$0.060	$0.030	-50.0%
Qwen3.7 Max	Alibaba	$7.50	$3.75	-50.0%
Llama 3.1 8B Instruct	Meta	$0.050	$0.030	-40.0%
DeepSeek V4 Flash	DeepSeek	$0.280	$0.180	-35.7%
Qwen3 30B A3B Instruct 2507	Alibaba	$0.300	$0.193	-35.6%
Qwen3 Next 80B A3B Thinking	Alibaba	$1.20	$0.780	-35.0%
Qwen3 Max	Alibaba	$6.00	$3.90	-35.0%
Qwen-Plus	Alibaba	$1.20	$0.780	-35.0%
Qwen3.5-Flash	Alibaba	$0.400	$0.260	-35.0%
Qwen VL Max	Alibaba	$3.20	$2.08	-35.0%
Kimi K2.6	Moonshot AI	$4.66	$3.41	-26.7%
MiniMax M2.5	MiniMax	$1.20	$0.900	-25.0%
Mistral Nemo	Mistral AI	$0.040	$0.030	-25.0%
Qwen3.5 Plus 2026-04-20	Alibaba	$2.40	$1.80	-25.0%
Qwen3.6 Flash	Alibaba	$1.50	$1.13	-25.0%
GLM 5.2	Zhipu AI	$4.00	$3.00	-25.0%
Qwen3.5-35B-A3B	Alibaba	$1.30	$1.00	-23.1%

Recent Price Increases (40 models)

Model	Provider	Previous Output $/M	Current Output $/M	Change
Llama 3.2 11B Vision Instruct	Meta	$0.049	$0.345	+604.1%
Qwen2.5 Coder 32B Instruct	Alibaba	$0.200	$1.00	+400.0%
Llama 3 8B Instruct	Meta	$0.040	$0.140	+250.0%
Google Gemini Flash Latest	~google	$3.00	$9.00	+200.0%
Gemma 3n 4B	Google	$0.040	$0.120	+200.0%
Qwen3 Coder 480B A35B	Alibaba	$1.00	$1.80	+80.0%
Qwen3 30B A3B	Alibaba	$0.280	$0.500	+78.6%
QwQ 32B	Alibaba	$0.400	$0.580	+45.0%
Qwen2.5 VL 72B Instruct	Alibaba	$0.800	$1.00	+25.0%
Kimi K2 Thinking	Moonshot AI	$2.00	$2.50	+25.0%
Kimi K2 0905	Moonshot AI	$2.00	$2.50	+25.0%
Gemma 3 4B	Google	$0.080	$0.100	+25.0%
DeepSeek V3.1 Terminus	DeepSeek	$0.790	$0.950	+20.3%
Qwen3 30B A3B Thinking 2507	Alibaba	$0.340	$0.400	+17.6%
Qwen3 32B	Alibaba	$0.240	$0.280	+16.7%
Gemma 3 12B	Google	$0.130	$0.150	+15.4%
Mistral Small 3.2 24B	Mistral AI	$0.180	$0.200	+11.1%
Gemma 3 27B	Google	$0.150	$0.160	+6.7%
Qwen3 Coder Next	Alibaba	$0.750	$0.800	+6.7%
DeepSeek V3.1	DeepSeek	$0.750	$0.790	+5.3%
Qwen3.5 397B A17B	Alibaba	$2.34	$2.45	+4.7%
Kimi K2 0711	Moonshot AI	$2.20	$2.30	+4.5%
Qwen3.6 35B A3B	Alibaba	$0.965	$1.00	+3.6%
MiMo-V2-Flash	Xiaomi	$0.290	$0.300	+3.4%
Qwen2.5 72B Instruct	Alibaba	$0.390	$0.400	+2.6%

AI API Pricing Trends

AI model API pricing has been on a consistent downward trajectory since 2023. OpenAI's GPT-4 launched at $60/M output tokens; today, models with comparable capability cost under $5/M. This represents a 90%+ price reduction in under three years.

Key pricing trends observed across the industry:

Race to the bottom: Competition between OpenAI, Anthropic, Google, and open-source providers drives continuous price cuts.
Tiered pricing: Providers now offer multiple tiers (mini/flash models) at 10-50x lower cost than flagship models.
Free tiers expanding: Google Gemini Flash, DeepSeek, and many open-source models available at zero cost through various providers.
Caching discounts: Prompt caching can reduce input costs by 50-90% for repeated context.

Explore More

Current Pricing Cost Calculator Cheapest Models API Pricing Guide Pricing Trends Tracker

How Benchmarks Work|LLM Parameters|Compare Models|Subscription Calculator|Cost Optimization

Frequently Asked Questions

Every Sunday at 23:00 UTC a cron job snapshots the full paid model catalog (currently 343 models across 60 providers) and writes a flat JSON file under data/weekly-snapshots/. The first snapshot in the table above was captured on 2026-02-16. Each row of every table on this page is sourced from those raw weekly files - no smoothing, no fills, no retroactive edits.

Most large providers adjust list pricing two to four times a year, typically alongside a new model generation. Mid-generation cuts of 50 to 90 percent are routine once a newer flagship ships at the same quality tier. The Recent Price Drops table above only counts changes greater than one percent, so cosmetic rounding does not show up as movement.

A handful of providers run experimental promotional pricing, particularly during model launches, and revert later. We treat every snapshot as authoritative for the week it was captured rather than smoothing those reversals away, so a temporary cut followed by a reversion shows as a drop in one week and an increase in another. This is intentional: it preserves the audit trail.

Google and DeepSeek have driven the steepest cuts in the dataset above, both by releasing flagship-tier reasoning models at fractions of incumbent pricing and by maintaining free tiers on Gemini Flash variants. Anthropic and OpenAI tend to cut older SKUs at the moment a new generation launches rather than adjusting the active flagship.

See /trackers/price-convergence-index. That page covers the LMC Price Convergence Index, which measures how tightly the frontier-tier price distribution clusters around its median in log space, with bias-corrected confidence intervals and a per-provider variance contribution breakdown. This page covers the per-model price trail: who moved when, and by how much.

Provider

Models

Avg Input $/M

Avg Output $/M

IBM

$0.034

$0.106

Liquid AI

$0.030

$0.120

rekaai

$0.100

$0.150

Meta

$0.151

$0.285

poolside

$0.150

$0.300

Microsoft

$0.257

$0.370

Tencent

$0.102

$0.390

inclusionai

$0.053

$0.427

Allen AI

$0.150

$0.500

Xiaomi

$0.270

$0.575

Upstage

$0.150

$0.600

StepFun

$0.145

$0.725

arcee-ai

$0.386

$0.737

Inception

$0.250

$0.750

NVIDIA

$0.260

$0.813

DeepSeek

$0.355

$0.960

ByteDance

$0.155

$0.980

MiniMax

$0.267

$1.19

Kuaishou

$0.300

$1.20

Baidu

$0.420

$1.25

Model

Provider

Previous Output $/M

Current Output $/M

Change

Ling-2.6-flash

inclusionai

$0.240

$0.030

-87.5%

MiMo-V2.5

Xiaomi

$2.00

$0.280

-86.0%

Qwen3 235B A22B Thinking 2507

Alibaba

$0.600

$0.100

-83.3%

Ling-2.6-1T

inclusionai

$2.50

$0.625

-75.0%

MiMo-V2.5-Pro

Xiaomi

$3.00

$0.870

-71.0%

DeepSeek V3.2 Speciale

DeepSeek

$1.20

$0.431

-64.1%

Grok 4.20

xAI

$6.00

$2.50

-58.3%

Grok 4.20 Multi-Agent

xAI

$6.00

$2.50

-58.3%

Llama Guard 3 8B

Meta

$0.060

$0.030

-50.0%

Qwen3.7 Max

Alibaba

$7.50

$3.75

-50.0%

Llama 3.1 8B Instruct

Meta

$0.050

$0.030

-40.0%

DeepSeek V4 Flash

DeepSeek

$0.280

$0.180

-35.7%

Qwen3 30B A3B Instruct 2507

Alibaba

$0.300

$0.193

-35.6%

Qwen3 Next 80B A3B Thinking

Alibaba

$1.20

$0.780

-35.0%

Qwen3 Max

Alibaba

$6.00

$3.90

-35.0%

Qwen-Plus

Alibaba

$1.20

$0.780

-35.0%

Qwen3.5-Flash

Alibaba

$0.400

$0.260

-35.0%

Qwen VL Max

Alibaba

$3.20

$2.08

-35.0%

Kimi K2.6

Moonshot AI

$4.66

$3.41

-26.7%

MiniMax M2.5

MiniMax

$1.20

$0.900

-25.0%

Mistral Nemo

Mistral AI

$0.040

$0.030

-25.0%

Qwen3.5 Plus 2026-04-20

Alibaba

$2.40

$1.80

-25.0%

Qwen3.6 Flash

Alibaba

$1.50

$1.13

-25.0%

GLM 5.2

Zhipu AI

$4.00

$3.00

-25.0%

Qwen3.5-35B-A3B

Alibaba

$1.30

$1.00

-23.1%

Model

Provider

Previous Output $/M

Current Output $/M

Change

Llama 3.2 11B Vision Instruct

Meta

$0.049

$0.345

+604.1%

Qwen2.5 Coder 32B Instruct

Alibaba

$0.200

$1.00

+400.0%

Llama 3 8B Instruct

Meta

$0.040

$0.140

+250.0%

Google Gemini Flash Latest

~google

$3.00

$9.00

+200.0%

Gemma 3n 4B

Google

$0.040

$0.120

+200.0%

Qwen3 Coder 480B A35B

Alibaba

$1.00

$1.80

+80.0%

Qwen3 30B A3B

Alibaba

$0.280

$0.500

+78.6%

QwQ 32B

Alibaba

$0.400

$0.580

+45.0%

Qwen2.5 VL 72B Instruct

Alibaba

$0.800

$1.00

+25.0%

Kimi K2 Thinking

Moonshot AI

$2.00

$2.50

+25.0%

Kimi K2 0905

Moonshot AI

$2.00

$2.50

+25.0%

Gemma 3 4B

Google

$0.080

$0.100

+25.0%

DeepSeek V3.1 Terminus

DeepSeek

$0.790

$0.950

+20.3%

Qwen3 30B A3B Thinking 2507

Alibaba

$0.340

$0.400

+17.6%

Qwen3 32B

Alibaba

$0.240

$0.280

+16.7%

Gemma 3 12B

Google

$0.130

$0.150

+15.4%

Mistral Small 3.2 24B

Mistral AI

$0.180

$0.200

+11.1%

Gemma 3 27B

Google

$0.150

$0.160

+6.7%

Qwen3 Coder Next

Alibaba

$0.750

$0.800

+6.7%

DeepSeek V3.1

DeepSeek

$0.750

$0.790

+5.3%

Qwen3.5 397B A17B

Alibaba

$2.34

$2.45

+4.7%

Kimi K2 0711

Moonshot AI

$2.20

$2.30

+4.5%

Qwen3.6 35B A3B

Alibaba

$0.965

$1.00

+3.6%

MiMo-V2-Flash

Xiaomi

$0.290

$0.300

+3.4%

Qwen2.5 72B Instruct

Alibaba

$0.390

$0.400

+2.6%

AI API Pricing Trends

Key pricing trends observed across the industry:

Race to the bottom: Competition between OpenAI, Anthropic, Google, and open-source providers drives continuous price cuts.
Tiered pricing: Providers now offer multiple tiers (mini/flash models) at 10-50x lower cost than flagship models.
Free tiers expanding: Google Gemini Flash, DeepSeek, and many open-source models available at zero cost through various providers.
Caching discounts: Prompt caching can reduce input costs by 50-90% for repeated context.