A comprehensive comparison of AI API pricing across 343+ models from 60+ providers. Understand token costs, compare price tiers, and estimate real-world expenses for LLM API usage in 2026.
How 343 models break down across pricing tiers (based on output cost per 1M tokens).
| Provider | Models | Avg Input $/1M | Avg Output $/1M |
|---|---|---|---|
| TII | 7 | Free | Free |
| Liquid AI | 3 | $0.010 | $0.040 |
| IBM | 2 | $0.034 | $0.106 |
| poolside | 4 | $0.075 | $0.150 |
| rekaai | 2 | $0.100 | $0.150 |
| Meta | 12 | $0.126 | $0.238 |
| NVIDIA | 11 | $0.095 | $0.295 |
| Microsoft | 3 | $0.257 | $0.370 |
| Tencent | 2 | $0.102 | $0.390 |
| inclusionai | 3 | $0.053 | $0.427 |
| HUMAIN | 4 | $0.450 | $0.450 |
| Xiaomi | 2 | $0.270 | $0.575 |
| StepFun | 2 | $0.145 | $0.725 |
| arcee-ai | 4 | $0.386 | $0.737 |
| DeepSeek | 11 | $0.355 | $0.960 |
| ByteDance | 5 | $0.155 | $0.980 |
| MiniMax | 9 | $0.237 | $1.06 |
| Alibaba | 49 | $0.266 | $1.28 |
| Mistral AI | 19 | $0.573 | $1.93 |
| Zhipu AI | 12 | $0.621 | $2.14 |
| xAI | 4 | $1.19 | $2.38 |
| Moonshot AI | 6 | $0.591 | $2.71 |
| aion-labs | 3 | $1.83 | $3.67 |
| Amazon | 5 | $0.739 | $3.72 |
| Cohere | 5 | $1.04 | $4.15 |
| Cursor | 2 | $1.00 | $5.00 |
| Perplexity | 5 | $2.20 | $9.40 |
| Inflection | 2 | $2.50 | $10.00 |
| 2 | $1.75 | $10.50 | |
| ~openai | 2 | $2.88 | $17.25 |
| ~anthropic | 4 | $4.75 | $23.75 |
| Anthropic | 15 | $9.35 | $46.75 |
| OpenAI | 64 | $6.70 | $655.05 |
| 30 | $0.571 | $13003.56 | |
| Stability AI | 2 | Free | $17500.00 |
| Kuaishou | 2 | $0.150 | $35000.60 |
104 scored models fall under $1/1M output tokens with an average composite score of 53. The top-scoring budget model is DeepSeek V4 Pro at score 86 for just $0.870/1M output.
35 scored models cost $15+/1M output tokens with an average score of 74. Premium models score on average 40% higher than budget models, but cost significantly more per token. The trade-off is meaningful for high-stakes tasks like code generation and complex reasoning.
Key insight: Price and quality correlate but diminishing returns are significant. Many models in the $1--$5 range deliver 80--90% of the performance of premium models at a fraction of the cost. For most production workloads, mid-range models offer the best balance.
| Model | Input $/1M | Output $/1M |
|---|---|---|
| North Mini Code (free)Cohere | Free | Free |
| Nemotron 3.5 Content Safety (free)NVIDIA | Free | Free |
| Nemotron 3 Ultra (free)NVIDIA | Free | Free |
| Nemotron 3 Nano Omni (free)NVIDIA | Free | Free |
| Laguna XS.2 (free)poolside | Free | Free |
| Laguna M.1 (free)poolside | Free | Free |
| Gemma 4 26B A4B (free)Google | Free | Free |
| Gemma 4 31B (free)Google | Free | Free |
| Lyria 3 Pro PreviewGoogle | Free | Free |
| Lyria 3 Clip PreviewGoogle | Free | Free |
| Nemotron 3 Super (free)NVIDIA | Free | Free |
| LFM2.5-1.2B-Thinking (free)Liquid AI | Free | Free |
| LFM2.5-1.2B-Instruct (free)Liquid AI | Free | Free |
| Nemotron 3 Nano 30B A3B (free)NVIDIA | Free | Free |
| Nemotron Nano 12B 2 VL (free)NVIDIA | Free | Free |
| Qwen3 Next 80B A3B Instruct (free)Alibaba | Free | Free |
| Nemotron Nano 9B V2 (free)NVIDIA | Free | Free |
| gpt-oss-120b (free)OpenAI | Free | Free |
| gpt-oss-20b (free)OpenAI | Free | Free |
| Qwen3 Coder 480B A35B (free)Alibaba | Free | Free |
| Llama 3.3 70B Instruct (free)Meta | Free | Free |
| Llama 3.2 3B Instruct (free)Meta | Free | Free |
| Midjourney v6.1Midjourney | Free | Free |
| Adobe Firefly 3Adobe | Free | Free |
| Leonardo PhoenixLeonardo AI | Free | Free |
| SoraOpenAI | Free | Free |
| Runway Gen-3 AlphaRunway | Free | Free |
| Pika 2.0Pika | Free | Free |
| MiniMax Video-01MiniMax | Free | Free |
| Luma Dream MachineLuma AI | Free | Free |
| Stable Video DiffusionStability AI | Free | Free |
| Wan 2.1 T2VWan AI | Free | Free |
| LTX-Video 2Lightricks | Free | Free |
| SWE-1.5Windsurf | Free | Free |
| autofixer-01Vercel | Free | Free |
| MellumJetBrains | Free | Free |
| ALLaM 7B Instruct (preview)HUMAIN | Free | Free |
| ALLaM 2 7B InstructHUMAIN | Free | Free |
| ALLaM 34BHUMAIN | Free | Free |
| Falcon-H1-Arabic 34B InstructTII | Free | Free |
| Falcon-H1-Arabic 7B InstructTII | Free | Free |
| Falcon-H1-Arabic 3B InstructTII | Free | Free |
| Falcon Arabic 7B InstructTII | Free | Free |
| Falcon3 10B InstructTII | Free | Free |
| Falcon3 7B InstructTII | Free | Free |
| Falcon Mamba 7B InstructTII | Free | Free |
| Ling-2.6-flashinclusionai | $0.010 | $0.030 |
| Llama 3.1 8B InstructMeta | $0.020 | $0.030 |
| Mistral NemoMistral AI | $0.020 | $0.030 |
| Mistral Small 3Mistral AI | $0.050 | $0.080 |
| Granite 4.1 8BIBM | $0.050 | $0.100 |
| Reka Edgerekaai | $0.100 | $0.100 |
| Ministral 3 3B 2512Mistral AI | $0.100 | $0.100 |
| Qwen3 235B A22B Thinking 2507Alibaba | $0.100 | $0.100 |
| Qwen3 235B A22B Instruct 2507Alibaba | $0.090 | $0.100 |
| Gemma 3 4BGoogle | $0.050 | $0.100 |
| Qwen2.5 7B InstructAlibaba | $0.040 | $0.100 |
| Granite 4.0 MicroIBM | $0.017 | $0.112 |
| LFM2-24B-A2BLiquid AI | $0.030 | $0.120 |
| Gemma 3n 4BGoogle | $0.060 | $0.120 |
| gpt-oss-20bOpenAI | $0.029 | $0.140 |
| Phi 4Microsoft | $0.070 | $0.140 |
| Nova Micro 1.0Amazon | $0.035 | $0.140 |
| Llama 3 8B InstructMeta | $0.140 | $0.140 |
| Qwen3.5-9BAlibaba | $0.100 | $0.150 |
| Ministral 3 8B 2512Mistral AI | $0.150 | $0.150 |
| Trinity Miniarcee-ai | $0.045 | $0.150 |
| Gemma 3 12BGoogle | $0.050 | $0.150 |
| Command R7B (12-2024)Cohere | $0.037 | $0.150 |
| Gemma 3 27BGoogle | $0.080 | $0.160 |
| DeepSeek V4 FlashDeepSeek | $0.090 | $0.180 |
| gpt-oss-120bOpenAI | $0.039 | $0.180 |
| Llama Guard 4 12BMeta | $0.180 | $0.180 |
| Qwen3 30B A3B Instruct 2507Alibaba | $0.048 | $0.193 |
| Laguna XS.2poolside | $0.100 | $0.200 |
| Nemotron 3 Nano 30B A3BNVIDIA | $0.050 | $0.200 |
| Ministral 3 14B 2512Mistral AI | $0.200 | $0.200 |
| UI-TARS 7B ByteDance | $0.100 | $0.200 |
| Mistral Small 3.2 24BMistral AI | $0.075 | $0.200 |
| Reka Flash 3rekaai | $0.100 | $0.200 |
| Llama 3.2 1B InstructMeta | $0.027 | $0.201 |
| Hy3 previewTencent | $0.063 | $0.210 |
| Qwen3 14BAlibaba | $0.100 | $0.240 |
| Nova Lite 1.0Amazon | $0.060 | $0.240 |
| Qwen3.5-FlashAlibaba | $0.065 | $0.260 |
| Qwen3 Coder 30B A3B InstructAlibaba | $0.070 | $0.270 |
| MiMo-V2.5Xiaomi | $0.105 | $0.280 |
| Qwen3 32BAlibaba | $0.080 | $0.280 |
| Step 3.5 FlashStepFun | $0.090 | $0.300 |
| Seed 1.6 FlashByteDance | $0.075 | $0.300 |
| Voxtral Small 24B 2507Mistral AI | $0.100 | $0.300 |
| gpt-oss-safeguard-20bOpenAI | $0.075 | $0.300 |
| Llama 4 ScoutMeta | $0.100 | $0.300 |
| Llama 3.3 70B InstructMeta | $0.100 | $0.320 |
| Gemma 4 26B A4B Google | $0.060 | $0.330 |
| Llama 3.2 3B InstructMeta | $0.051 | $0.335 |
| DeepSeek V3.2DeepSeek | $0.229 | $0.343 |
| Llama 3.2 11B Vision InstructMeta | $0.345 | $0.345 |
| Gemma 4 31BGoogle | $0.120 | $0.350 |
| Phi 4 Mini InstructMicrosoft | $0.080 | $0.350 |
| Laguna M.1poolside | $0.200 | $0.400 |
| Seed-2.0-MiniByteDance | $0.100 | $0.400 |
| GLM 4.7 FlashZhipu AI | $0.060 | $0.400 |
| Llama 3.3 Nemotron Super 49B V1.5NVIDIA | $0.400 | $0.400 |
| Gemini 2.5 Flash Lite Preview 09-2025Google | $0.100 | $0.400 |
| Qwen3 30B A3B Thinking 2507Alibaba | $0.080 | $0.400 |
| GPT-5 NanoOpenAI | $0.050 | $0.400 |
| Gemini 2.5 Flash LiteGoogle | $0.100 | $0.400 |
| Qwen3 8BAlibaba | $0.050 | $0.400 |
| GPT-4.1 NanoOpenAI | $0.100 | $0.400 |
| Qwen2.5 72B InstructAlibaba | $0.360 | $0.400 |
| Llama 3.1 70B InstructMeta | $0.400 | $0.400 |
| DeepSeek V3.2 ExpDeepSeek | $0.270 | $0.410 |
| Qwen3 VL 32B InstructAlibaba | $0.104 | $0.416 |
| Nemotron 3 SuperNVIDIA | $0.090 | $0.450 |
| Olmo 3 32B ThinkAllen AI | $0.150 | $0.500 |
| Qwen3 VL 8B InstructAlibaba | $0.080 | $0.500 |
| Qwen3 30B A3BAlibaba | $0.120 | $0.500 |
| Qwen3 VL 30B A3B InstructAlibaba | $0.130 | $0.520 |
| Mistral Small 3.1 24BMistral AI | $0.351 | $0.555 |
| Hunyuan A13B InstructTencent | $0.140 | $0.570 |
| Mistral Small 4Mistral AI | $0.150 | $0.600 |
| Solar Pro 3Upstage | $0.150 | $0.600 |
| Llama 4 MaverickMeta | $0.150 | $0.600 |
| GPT-4o-mini Search PreviewOpenAI | $0.150 | $0.600 |
| SabaMistral AI | $0.200 | $0.600 |
| Command R (08-2024)Cohere | $0.150 | $0.600 |
| GPT-4o-mini (2024-07-18)OpenAI | $0.150 | $0.600 |
| GPT-4o-miniOpenAI | $0.150 | $0.600 |
| WizardLM-2 8x22BMicrosoft | $0.620 | $0.620 |
| Ring-2.6-1Tinclusionai | $0.075 | $0.625 |
| Ling-2.6-1Tinclusionai | $0.075 | $0.625 |
| Gemma 2 27BGoogle | $0.650 | $0.650 |
| Mercury 2Inception | $0.250 | $0.750 |
| DeepSeek V3 0324DeepSeek | $0.200 | $0.770 |
| Qwen3 Next 80B A3B ThinkingAlibaba | $0.098 | $0.780 |
| Qwen Plus 0728 (thinking)Alibaba | $0.260 | $0.780 |
| Qwen Plus 0728Alibaba | $0.260 | $0.780 |
| Qwen-PlusAlibaba | $0.260 | $0.780 |
| DeepSeek V3.1DeepSeek | $0.210 | $0.790 |
| Trinity Large Thinkingarcee-ai | $0.250 | $0.800 |
| Qwen3 Coder NextAlibaba | $0.110 | $0.800 |
| Coder Largearcee-ai | $0.500 | $0.800 |
| R1 Distill Llama 70BDeepSeek | $0.800 | $0.800 |
| DeepSeek V3DeepSeek | $0.200 | $0.800 |
| GLM 4.5 AirZhipu AI | $0.130 | $0.850 |
| DeepSeek V4 ProDeepSeek | $0.435 | $0.870 |
| MiMo-V2.5-ProXiaomi | $0.435 | $0.870 |
| Qwen3 VL 235B A22B InstructAlibaba | $0.200 | $0.880 |
| MiniMax M2.5MiniMax | $0.150 | $0.900 |
| GLM 4.6VZhipu AI | $0.300 | $0.900 |
| Codestral 2508Mistral AI | $0.300 | $0.900 |
| MiniMax M2.1MiniMax | $0.290 | $0.950 |
| DeepSeek V3.1 TerminusDeepSeek | $0.270 | $0.950 |
| MiniMax M2.7MiniMax | $0.240 | $0.960 |
| Qwen3 Coder FlashAlibaba | $0.195 | $0.975 |
| Qwen3.6 35B A3BAlibaba | $0.140 | $1.00 |
| Qwen3.5-35B-A3BAlibaba | $0.140 | $1.00 |
| MiniMax M2MiniMax | $0.255 | $1.00 |
| Qwen2.5 VL 72B InstructAlibaba | $0.800 | $1.00 |
| SonarPerplexity | $1.00 | $1.00 |
| Qwen2.5 Coder 32B InstructAlibaba | $0.660 | $1.00 |
| Qwen3 Next 80B A3B InstructAlibaba | $0.090 | $1.10 |
| MiniMax-01MiniMax | $0.200 | $1.10 |
| Qwen3.6 FlashAlibaba | $0.188 | $1.13 |
| Step 3.7 FlashStepFun | $0.200 | $1.15 |
| MiniMax M3MiniMax | $0.300 | $1.20 |
| KAT-Coder-Pro V2Kuaishou | $0.300 | $1.20 |
| MiniMax M2-herMiniMax | $0.300 | $1.20 |
| Virtuoso Largearcee-ai | $0.750 | $1.20 |
| GPT-5.4 NanoOpenAI | $0.200 | $1.25 |
| Cogito v2.1 671Bdeepcogito | $1.25 | $1.25 |
| ERNIE 4.5 VL 424B A47B Baidu | $0.420 | $1.25 |
| Claude 3 HaikuAnthropic | $0.250 | $1.25 |
| Qwen3.7 PlusAlibaba | $0.320 | $1.28 |
| Qwen3 VL 8B ThinkingAlibaba | $0.117 | $1.36 |
| Aion-1.0-Miniaion-labs | $0.700 | $1.40 |
| Perceptron Mk1perceptron | $0.150 | $1.50 |
| Gemini 3.1 Flash LiteGoogle | $0.250 | $1.50 |
| Gemini 3.1 Flash Lite PreviewGoogle | $0.250 | $1.50 |
| Mistral Large 3 2512Mistral AI | $0.500 | $1.50 |
| GPT-3.5 TurboOpenAI | $0.500 | $1.50 |
| Qwen3.5-27BAlibaba | $0.195 | $1.56 |
| Qwen3.5 Plus 2026-02-15Alibaba | $0.260 | $1.56 |
| Qwen3 VL 30B A3B ThinkingAlibaba | $0.130 | $1.56 |
| Aion-2.0aion-labs | $0.800 | $1.60 |
| GPT-4.1 MiniOpenAI | $0.400 | $1.60 |
| GLM 4.6Zhipu AI | $0.430 | $1.74 |
| GLM 4.7Zhipu AI | $0.400 | $1.75 |
| Qwen3.5 Plus 2026-04-20Alibaba | $0.300 | $1.80 |
| GLM 4.5VZhipu AI | $0.600 | $1.80 |
| Qwen3 Coder 480B A35BAlibaba | $0.220 | $1.80 |
| ALLaM 1 13B InstructHUMAIN | $1.80 | $1.80 |
| Qwen3 235B A22BAlibaba | $0.455 | $1.82 |
| GLM 5Zhipu AI | $0.600 | $1.92 |
| Qwen3.6 PlusAlibaba | $0.325 | $1.95 |
| Grok Build 0.1xAI | $1.00 | $2.00 |
| Seed-2.0-LiteByteDance | $0.250 | $2.00 |
| Seed 1.6ByteDance | $0.250 | $2.00 |
| Devstral 2 2512Mistral AI | $0.400 | $2.00 |
| GPT-5.1-Codex-MiniOpenAI | $0.250 | $2.00 |
| GPT-5 Image MiniOpenAI | $2.50 | $2.00 |
| Mistral Medium 3.1Mistral AI | $0.400 | $2.00 |
| GPT-5 MiniOpenAI | $0.250 | $2.00 |
| Mistral Medium 3Mistral AI | $0.400 | $2.00 |
| GPT-3.5 Turbo (older v0613)OpenAI | $1.00 | $2.00 |
| GPT-3.5 Turbo InstructOpenAI | $1.50 | $2.00 |
| Kimi K2.5Moonshot AI | $0.375 | $2.02 |
| Qwen3.5-122B-A10BAlibaba | $0.260 | $2.08 |
| R1 0528DeepSeek | $0.500 | $2.15 |
| Nemotron 3 UltraNVIDIA | $0.500 | $2.20 |
| GLM 4.5Zhipu AI | $0.600 | $2.20 |
| MiniMax M1MiniMax | $0.400 | $2.20 |
| Kimi K2 0711Moonshot AI | $0.570 | $2.30 |
| GPT Audio MiniOpenAI | $0.600 | $2.40 |
| Qwen3.5 397B A17BAlibaba | $0.385 | $2.45 |
| Grok 4.3xAI | $1.25 | $2.50 |
| Grok 4.20 Multi-AgentxAI | $1.25 | $2.50 |
| Grok 4.20xAI | $1.25 | $2.50 |
| Nova 2 LiteAmazon | $0.300 | $2.50 |
| Kimi K2 ThinkingMoonshot AI | $0.600 | $2.50 |
| Nano Banana (Gemini 2.5 Flash Image)Google | $0.300 | $2.50 |
| Kimi K2 0905Moonshot AI | $0.600 | $2.50 |
| Gemini 2.5 FlashGoogle | $0.300 | $2.50 |
| R1DeepSeek | $0.700 | $2.50 |
| Composer 2Cursor | $0.500 | $2.50 |
| Qwen3 VL 235B A22B ThinkingAlibaba | $0.260 | $2.60 |
| Nano Banana 2 (Gemini 3.1 Flash Image)Google | $0.500 | $3.00 |
| GLM 5.2Zhipu AI | $0.950 | $3.00 |
| Nano Banana 2 (Gemini 3.1 Flash Image Preview)Google | $0.500 | $3.00 |
| Gemini 3 Flash PreviewGoogle | $0.500 | $3.00 |
| GLM 5.1Zhipu AI | $0.980 | $3.08 |
| Qwen3.6 27BAlibaba | $0.289 | $3.17 |
| Nova Pro 1.0Amazon | $0.800 | $3.20 |
| Qwen3 Coder PlusAlibaba | $0.650 | $3.25 |
| MoonshotAI Kimi Latest~moonshotai | $0.660 | $3.41 |
| Kimi K2.6Moonshot AI | $0.660 | $3.41 |
| Kimi K2.7 CodeMoonshot AI | $0.740 | $3.50 |
| Qwen3.7 MaxAlibaba | $1.25 | $3.75 |
| Qwen3 Max ThinkingAlibaba | $0.780 | $3.90 |
| Qwen3 MaxAlibaba | $0.780 | $3.90 |
| GLM 5V TurboZhipu AI | $1.20 | $4.00 |
| GLM 5 TurboZhipu AI | $1.20 | $4.00 |
| GPT-3.5 Turbo 16kOpenAI | $3.00 | $4.00 |
| o4 Mini HighOpenAI | $1.10 | $4.40 |
| o4 MiniOpenAI | $1.10 | $4.40 |
| o3 Mini HighOpenAI | $1.10 | $4.40 |
| o3 MiniOpenAI | $1.10 | $4.40 |
| OpenAI GPT Mini Latest~openai | $0.750 | $4.50 |
| GPT-5.4 MiniOpenAI | $0.750 | $4.50 |
| Anthropic Claude Haiku Latest~anthropic | $1.00 | $5.00 |
| Claude Haiku 4.5Anthropic | $1.00 | $5.00 |
| Palmyra X5Writer | $0.600 | $6.00 |
| Mistral Large 2407Mistral AI | $2.00 | $6.00 |
| Mixtral 8x22B InstructMistral AI | $2.00 | $6.00 |
| Mistral LargeMistral AI | $2.00 | $6.00 |
| Qwen3.6 Max PreviewAlibaba | $1.04 | $6.24 |
| Mistral Medium 3.5Mistral AI | $1.50 | $7.50 |
| Composer 2 FastCursor | $1.50 | $7.50 |
| o4 Mini Deep ResearchOpenAI | $2.00 | $8.00 |
| Jamba Large 1.7AI21 Labs | $2.00 | $8.00 |
| o3OpenAI | $2.00 | $8.00 |
| GPT-4.1OpenAI | $2.00 | $8.00 |
| Sonar Reasoning ProPerplexity | $2.00 | $8.00 |
| Sonar Deep ResearchPerplexity | $2.00 | $8.00 |
| Aion-1.0aion-labs | $4.00 | $8.00 |
| Gemini 3.5 FlashGoogle | $1.50 | $9.00 |
| Google Gemini Flash Latest~google | $1.50 | $9.00 |
| GPT AudioOpenAI | $2.50 | $10.00 |
| GPT-5.1-Codex-MaxOpenAI | $1.25 | $10.00 |
| GPT-5.1OpenAI | $1.25 | $10.00 |
| GPT-5.1 ChatOpenAI | $1.25 | $10.00 |
| GPT-5.1-CodexOpenAI | $1.25 | $10.00 |
| GPT-5 ImageOpenAI | $10.00 | $10.00 |
| GPT-5 CodexOpenAI | $1.25 | $10.00 |
| GPT-5 ChatOpenAI | $1.25 | $10.00 |
| GPT-5OpenAI | $1.25 | $10.00 |
| Gemini 2.5 ProGoogle | $1.25 | $10.00 |
| Gemini 2.5 Pro Preview 06-05Google | $1.25 | $10.00 |
| Gemini 2.5 Pro Preview 05-06Google | $1.25 | $10.00 |
| Command ACohere | $2.50 | $10.00 |
| GPT-4o Search PreviewOpenAI | $2.50 | $10.00 |
| GPT-4o (2024-11-20)OpenAI | $2.50 | $10.00 |
| Inflection 3 ProductivityInflection | $2.50 | $10.00 |
| Inflection 3 PiInflection | $2.50 | $10.00 |
| Command R+ (08-2024)Cohere | $2.50 | $10.00 |
| GPT-4o (2024-08-06)OpenAI | $2.50 | $10.00 |
| GPT-4oOpenAI | $2.50 | $10.00 |
| Nano Banana Pro (Gemini 3 Pro Image)Google | $2.00 | $12.00 |
| Google Gemini Pro Latest~google | $2.00 | $12.00 |
| Gemini 3.1 Pro Preview Custom ToolsGoogle | $2.00 | $12.00 |
| Gemini 3.1 Pro PreviewGoogle | $2.00 | $12.00 |
| Nano Banana Pro (Gemini 3 Pro Image Preview)Google | $2.00 | $12.00 |
| Nova Premier 1.0Amazon | $2.50 | $12.50 |
| GPT-5.3 ChatOpenAI | $1.75 | $14.00 |
| GPT-5.3-CodexOpenAI | $1.75 | $14.00 |
| GPT-5.2-CodexOpenAI | $1.75 | $14.00 |
| GPT-5.2 ChatOpenAI | $1.75 | $14.00 |
| GPT-5.2OpenAI | $1.75 | $14.00 |
| Anthropic Claude Sonnet Latest~anthropic | $3.00 | $15.00 |
| GPT-5.4 Image 2OpenAI | $8.00 | $15.00 |
| GPT-5.4OpenAI | $2.50 | $15.00 |
| Claude Sonnet 4.6Anthropic | $3.00 | $15.00 |
| Sonar Pro SearchPerplexity | $3.00 | $15.00 |
| Claude Sonnet 4.5Anthropic | $3.00 | $15.00 |
| Claude Sonnet 4Anthropic | $3.00 | $15.00 |
| Sonar ProPerplexity | $3.00 | $15.00 |
| GPT-4o (2024-05-13)OpenAI | $5.00 | $15.00 |
| Claude Opus 4.8Anthropic | $5.00 | $25.00 |
| Claude Opus Latest~anthropic | $5.00 | $25.00 |
| Claude Opus 4.7Anthropic | $5.00 | $25.00 |
| Claude Opus 4.6Anthropic | $5.00 | $25.00 |
| Claude Opus 4.5Anthropic | $5.00 | $25.00 |
| Fugu Ultrasakana | $5.00 | $30.00 |
| GPT Chat LatestOpenAI | $5.00 | $30.00 |
| OpenAI GPT Latest~openai | $5.00 | $30.00 |
| GPT-5.5OpenAI | $5.00 | $30.00 |
| GPT-4 TurboOpenAI | $10.00 | $30.00 |
| GPT-4 Turbo PreviewOpenAI | $10.00 | $30.00 |
| o3 Deep ResearchOpenAI | $10.00 | $40.00 |
| Claude Fable Latest~anthropic | $10.00 | $50.00 |
| Claude Fable 5Anthropic | $10.00 | $50.00 |
| Claude Opus 4.8 (Fast)Anthropic | $10.00 | $50.00 |
| o1OpenAI | $15.00 | $60.00 |
| GPT-4OpenAI | $30.00 | $60.00 |
| Claude Opus 4.1Anthropic | $15.00 | $75.00 |
| Claude Opus 4Anthropic | $15.00 | $75.00 |
| o3 ProOpenAI | $20.00 | $80.00 |
| GPT-5 ProOpenAI | $15.00 | $120.00 |
| Claude Opus 4.7 (Fast)Anthropic | $30.00 | $150.00 |
| Claude Opus 4.6 (Fast)Anthropic | $30.00 | $150.00 |
| GPT-5.2 ProOpenAI | $21.00 | $168.00 |
| GPT-5.5 ProOpenAI | $30.00 | $180.00 |
| GPT-5.4 ProOpenAI | $30.00 | $180.00 |
| o1-proOpenAI | $150.00 | $600.00 |
| Stable Diffusion 3.5Stability AI | Free | $35000.00 |
| DALL-E 3OpenAI | Free | $40000.00 |
| Recraft V3Recraft | Free | $40000.00 |
| Imagen 3Google | Free | $40000.00 |
| FLUX.1 ProBlack Forest Labs | Free | $50000.00 |
| Kling 1.6Kuaishou | Free | $70000.00 |
| Ideogram 2.0Ideogram | Free | $80000.00 |
| Veo 2Google | Free | $350000.00 |
AI APIs charge per token, not per request. A token is roughly 3/4 of a word in English. For example, the sentence "Hello, how are you?" is about 6 tokens. Prices are quoted per million tokens (1M tokens is approximately 750,000 words).
Input tokens are what you send to the model (prompts, context, system instructions). Output tokens are what the model generates. Output tokens are typically 2--5x more expensive than input tokens because they require more computation.
Some providers offer batch pricing at 50% discount for non-time-sensitive workloads. Batch requests are queued and processed within a 24-hour window, making them ideal for data processing, content generation pipelines, and evaluation runs.
Providers like Anthropic and OpenAI offer prompt caching, which reduces input costs by up to 90% for repeated prefixes. If you send the same system prompt across requests, cached tokens are charged at a fraction of the standard rate.
Estimated daily costs for common use cases across different price points.
500 conversations/day, ~800 input + 400 output tokens each
100 articles/day, ~2,000 input + 1,500 output tokens each
1,000 completions/day, ~500 input + 200 output tokens each
DeepSeek offers the lowest prices for high-quality models, often 10-50x cheaper than OpenAI. Google Gemini Flash offers a generous free tier. For open-source models, Together AI and Groq provide competitive hosted inference pricing.
Prices vary dramatically - from $0.01/M tokens (budget models) to $60/M tokens (premium reasoning models). The best value depends on your quality requirements. Mid-tier models like GPT-4o Mini and Claude 3.5 Haiku offer strong price-performance ratios.
Yes - AI API prices have dropped 90%+ over the past two years. Competition, efficiency improvements, and open-source alternatives continue to drive prices down. Our pricing history page tracks these trends.