Track how AI model API pricing changes over time. Covering 367+ models across 59 providers, with weekly snapshots since 2026-02-16. For the dispersion view of how tightly frontier prices cluster, see the LMC Price Convergence Index.
Average input and output cost per 1M tokens across each provider's paid models.
| Provider | Models | Avg Input $/M | Avg Output $/M |
|---|---|---|---|
| IBM | 2 | $0.034 | $0.105 |
| Liquid AI | 1 | $0.030 | $0.120 |
| essentialai | 1 | $0.150 | $0.150 |
| rekaai | 2 | $0.100 | $0.150 |
| Meta | 12 | $0.190 | $0.287 |
| StepFun | 1 | $0.100 | $0.300 |
| NVIDIA | 4 | $0.070 | $0.302 |
| Microsoft | 3 | $0.255 | $0.370 |
| Tencent | 2 | $0.103 | $0.415 |
| Allen AI | 1 | $0.150 | $0.500 |
| Upstage | 1 | $0.150 | $0.600 |
| Baidu | 5 | $0.196 | $0.694 |
| Inception | 1 | $0.250 | $0.750 |
| DeepSeek | 13 | $0.347 | $0.882 |
| ByteDance | 5 | $0.155 | $0.980 |
| arcee-ai | 7 | $0.392 | $0.990 |
| Kuaishou | 1 | $0.300 | $1.20 |
| deepcogito | 1 | $1.25 | $1.25 |
| MiniMax | 7 | $0.271 | $1.26 |
| Alibaba | 50 | $0.258 | $1.34 |
Models that have decreased in price since tracking began.
| Model | Provider | Previous Output $/M | Current Output $/M | Change |
|---|---|---|---|---|
| DeepSeek V3.2 Speciale | DeepSeek | $1.20 | $0.431 | -64.1% |
| Grok 4.20 | xAI | $6.00 | $2.50 | -58.3% |
| Llama Guard 3 8B | Meta | $0.060 | $0.030 | -50.0% |
| Qwen3 Next 80B A3B Thinking | Alibaba | $1.20 | $0.780 | -35.0% |
| Qwen3 Max | Alibaba | $6.00 | $3.90 | -35.0% |
| Qwen-Plus | Alibaba | $1.20 | $0.780 | -35.0% |
| Qwen3.5-Flash | Alibaba | $0.400 | $0.260 | -35.0% |
| Qwen VL Max | Alibaba | $3.20 | $2.08 | -35.0% |
| Mistral Nemo | Mistral AI | $0.040 | $0.030 | -25.0% |
| Kimi K2.6 | Moonshot AI | $4.66 | $3.50 | -24.8% |
| Qwen3.5-35B-A3B | Alibaba | $1.30 | $1.00 | -23.1% |
| Gemma 4 26B A4B | $0.400 | $0.330 | -17.5% | |
| GLM 5 | Zhipu AI | $2.30 | $1.92 | -16.5% |
| Nemotron 3 Super | NVIDIA | $0.500 | $0.450 | -10.0% |
| Kimi K2.5 | Moonshot AI | $2.20 | $2.00 | -9.1% |
| Qwen2.5 VL 72B Instruct | Alibaba | $0.800 | $0.750 | -6.2% |
| DeepSeek V3.2 | DeepSeek | $0.400 | $0.378 | -5.5% |
| gpt-oss-120b | OpenAI | $0.190 | $0.180 | -5.3% |
| Gemma 4 31B | $0.400 | $0.380 | -5.0% | |
| MiniMax M2.5 | MiniMax | $1.20 | $1.15 | -4.2% |
| Qwen3.5-9B | Alibaba | $0.150 | $0.150 | 0.0% |
| Qwen3.6 35B A3B | Alibaba | $0.965 | $1.00 | 3.6% |
| Qwen3 Coder Next | Alibaba | $0.750 | $0.800 | 6.7% |
| Model | Provider | Previous Output $/M | Current Output $/M | Change |
|---|---|---|---|---|
| Llama 3.2 11B Vision Instruct | Meta | $0.049 | $0.245 | +400.0% |
| Qwen2.5 Coder 32B Instruct | Alibaba | $0.200 | $1.00 | +400.0% |
| Gemma 3n 4B | $0.040 | $0.120 | +200.0% | |
| Qwen3 235B A22B Thinking 2507 | Alibaba | $0.600 | $1.50 | +149.2% |
| Qwen3 Coder 480B A35B | Alibaba | $1.00 | $1.80 | +80.0% |
| Qwen3 30B A3B | Alibaba | $0.280 | $0.450 | +60.7% |
| QwQ 32B | Alibaba | $0.400 | $0.580 | +45.0% |
| Kimi K2 Thinking | Moonshot AI | $2.00 | $2.50 | +25.0% |
| DeepSeek V3.1 Terminus | DeepSeek | $0.790 | $0.950 | +20.3% |
| Qwen3 30B A3B Thinking 2507 | Alibaba | $0.340 | $0.400 | +17.6% |
| Qwen3 32B | Alibaba | $0.240 | $0.280 | +16.7% |
| GLM 5.1 | Zhipu AI | $3.15 | $3.50 | +11.1% |
| Mistral Small 3.2 24B | Mistral AI | $0.180 | $0.200 | +11.1% |
| Gemma 3 27B | $0.150 | $0.160 | +6.7% | |
| Qwen3 Coder Next | Alibaba | $0.750 | $0.800 | +6.7% |
| Kimi K2 0711 | Moonshot AI | $2.20 | $2.30 | +4.5% |
| Qwen3.6 35B A3B | Alibaba | $0.965 | $1.00 | +3.6% |
| MiMo-V2-Flash | Xiaomi | $0.290 | $0.300 | +3.4% |
| Qwen2.5 72B Instruct | Alibaba | $0.390 | $0.400 | +2.6% |
| MoonshotAI Kimi Latest | ~moonshotai | $3.49 | $3.50 | +0.3% |
| MiniMax M2.1 | MiniMax | $0.950 | $0.950 | +0.0% |
| R1 0528 | DeepSeek | $2.15 | $2.15 | +0.0% |
| Llama 3 8B Instruct | Meta | $0.040 | $0.040 | +0.0% |
| Phi 4 | Microsoft | $0.140 | $0.140 | +0.0% |
| GLM 4.7 | Zhipu AI | $1.75 | $1.75 | +0.0% |
AI model API pricing has been on a consistent downward trajectory since 2023. OpenAI's GPT-4 launched at $60/M output tokens; today, models with comparable capability cost under $5/M. This represents a 90%+ price reduction in under three years.
Key pricing trends observed across the industry:
Every Sunday at 23:00 UTC a cron job snapshots the full paid model catalog (currently 367 models across 59 providers) and writes a flat JSON file under data/weekly-snapshots/. The first snapshot in the table above was captured on 2026-02-16. Each row of every table on this page is sourced from those raw weekly files - no smoothing, no fills, no retroactive edits.
Most large providers adjust list pricing two to four times a year, typically alongside a new model generation. Mid-generation cuts of 50 to 90 percent are routine once a newer flagship ships at the same quality tier. The Recent Price Drops table above only counts changes greater than one percent, so cosmetic rounding does not show up as movement.
A handful of providers run experimental promotional pricing, particularly during model launches, and revert later. We treat every snapshot as authoritative for the week it was captured rather than smoothing those reversals away, so a temporary cut followed by a reversion shows as a drop in one week and an increase in another. This is intentional: it preserves the audit trail.
Google and DeepSeek have driven the steepest cuts in the dataset above, both by releasing flagship-tier reasoning models at fractions of incumbent pricing and by maintaining free tiers on Gemini Flash variants. Anthropic and OpenAI tend to cut older SKUs at the moment a new generation launches rather than adjusting the active flagship.
See /trackers/price-convergence-index. That page covers the LMC Price Convergence Index, which measures how tightly the frontier-tier price distribution clusters around its median in log space, with bias-corrected confidence intervals and a per-provider variance contribution breakdown. This page covers the per-model price trail: who moved when, and by how much.