Compare context windows, output capacity, and cost efficiency across 367+ models. Data sourced live from upstream provider APIs.
Note: This page shows capacity specs and pricing aggregated from upstream provider APIs. Real-world latency and tokens-per-second vary by load, prompt length, and provider infrastructure. For speed benchmarks, see Artificial Analysis or the model provider's own documentation.
Largest Context
2M
Grok 4.20 Multi-Agent
Largest Output
1.0M
MiniMax-01
Cheapest (non-free)
$0.02/1M in
Granite 4.0 Micro
Best Output Ratio
100%
GPT-3.5 Turbo (older v0613)
| Model | Provider | Context Window | Max Output | Output Ratio | Input $/1M | Output $/1M | Efficiency (derived) | Capabilities |
|---|---|---|---|---|---|---|---|---|
| Grok 4.20 Multi-Agent | xAI | 2M | — | — | $2.00 | $6.00 | 40 | |
| Grok 4.20 | xAI | 2M | — | — | $1.25 | $2.50 | 48 | |
| Grok 4.1 Fast | xAI | 2M | 30K | 2% | $0.20 | $0.50 | 83 | |
| Grok 4 Fast | xAI | 2M | 30K | 2% | $0.20 | $0.50 | 83 | |
| OpenAI GPT Latest | ~openai | 1.1M | 128K | 12% | $5.00 | $30.00 | 28 | |
| GPT-5.5 Pro | OpenAI | 1.1M | 128K | 12% | $30.00 | $180.00 | 17 | |
| GPT-5.5 | OpenAI | 1.1M | 128K | 12% | $5.00 | $30.00 | 28 | |
| GPT-5.4 Pro | OpenAI | 1.1M | 128K | 12% | $30.00 | $180.00 | 17 | |
| GPT-5.4 | OpenAI | 1.1M | 128K | 12% | $2.50 | $15.00 | 36 | |
| Gemini 3.1 Flash Lite | 1.0M | 66K | 6% | $0.25 | $1.50 | 76 | ||
| Google Gemini Pro Latest | 1.0M | 66K | 6% | $2.00 | $12.00 | 39 | ||
| Google Gemini Flash Latest | 1.0M | 66K | 6% | $0.50 | $3.00 | 63 | ||
| DeepSeek V4 Pro | DeepSeek | 1.0M | 384K | 37% | $0.43 | $0.87 | 66 | |
| DeepSeek V4 Flash | DeepSeek | 1.0M | 384K | 37% | $0.14 | $0.28 | 84 | |
| MiMo-V2.5-Pro | Xiaomi | 1.0M | 16K | 2% | $1.00 | $3.00 | 50 | |
| MiMo-V2.5 | Xiaomi | 1.0M | 131K | 13% | $0.40 | $2.00 | 67 | |
| Lyria 3 Pro Preview | 1.0M | 66K | 6% | Free | Free | 100 | ||
| Lyria 3 Clip Preview | 1.0M | 66K | 6% | Free | Free | 100 | ||
| MiMo-V2-Pro | Xiaomi | 1.0M | 131K | 13% | $1.00 | $3.00 | 50 | |
| Gemini 3.1 Flash Lite Preview | 1.0M | 66K | 6% | $0.25 | $1.50 | 76 | ||
| Gemini 3.1 Pro Preview Custom Tools | 1.0M | 66K | 6% | $2.00 | $12.00 | 39 | ||
| Gemini 3.1 Pro Preview | 1.0M | 66K | 6% | $2.00 | $12.00 | 39 | ||
| Gemini 3 Flash Preview | 1.0M | 66K | 6% | $0.50 | $3.00 | 63 | ||
| Gemini 2.5 Flash Lite Preview 09-2025 | 1.0M | 66K | 6% | $0.10 | $0.40 | 88 | ||
| Gemini 2.5 Flash Lite | 1.0M | 66K | 6% | $0.10 | $0.40 | 88 | ||
| Gemini 2.5 Flash | 1.0M | 66K | 6% | $0.30 | $2.50 | 73 | ||
| Gemini 2.5 Pro | 1.0M | 66K | 6% | $1.25 | $10.00 | 46 | ||
| Gemini 2.5 Pro Preview 06-05 | 1.0M | 66K | 6% | $1.25 | $10.00 | 46 | ||
| Gemini 2.5 Pro Preview 05-06 | 1.0M | 66K | 6% | $1.25 | $10.00 | 46 | ||
| Llama 4 Maverick | Meta | 1.0M | 16K | 2% | $0.15 | $0.60 | 83 | |
| Gemini 2.0 Flash Lite | 1.0M | 8K | 1% | $0.07 | $0.30 | 91 | ||
| GPT-4.1 | OpenAI | 1.0M | — | — | $2.00 | $8.00 | 39 | |
| GPT-4.1 Mini | OpenAI | 1.0M | 33K | 3% | $0.40 | $1.60 | 67 | |
| GPT-4.1 Nano | OpenAI | 1.0M | 33K | 3% | $0.10 | $0.40 | 88 | |
| Palmyra X5 | Writer | 1.0M | 8K | 1% | $0.60 | $6.00 | 60 | |
| MiniMax-01 | MiniMax | 1.0M | 1.0M | 100% | $0.20 | $1.10 | 79 | |
| Grok 4.3 | xAI | 1M | — | — | $1.25 | $2.50 | 46 | |
| Anthropic Claude Sonnet Latest | ~anthropic | 1M | 128K | 13% | $3.00 | $15.00 | 33 | |
| Qwen3.5 Plus 2026-04-20 | Alibaba | 1M | 66K | 7% | $0.40 | $2.40 | 67 | |
| Qwen3.6 Flash | Alibaba | 1M | 66K | 7% | $0.25 | $1.50 | 75 | |
| Claude Opus Latest | ~anthropic | 1M | 128K | 13% | $5.00 | $25.00 | 28 | |
| Claude Opus 4.7 | Anthropic | 1M | 128K | 13% | $5.00 | $25.00 | 28 | |
| Claude Opus 4.6 (Fast) | Anthropic | 1M | 128K | 13% | $30.00 | $150.00 | 17 | |
| Qwen3.6 Plus | Alibaba | 1M | 66K | 7% | $0.33 | $1.95 | 71 | |
| Qwen3.5-Flash | Alibaba | 1M | 66K | 7% | $0.07 | $0.26 | 91 | |
| Claude Sonnet 4.6 | Anthropic | 1M | 128K | 13% | $3.00 | $15.00 | 33 | |
| Qwen3.5 Plus 2026-02-15 | Alibaba | 1M | 66K | 7% | $0.26 | $1.56 | 75 | |
| Claude Opus 4.6 | Anthropic | 1M | 128K | 13% | $5.00 | $25.00 | 28 | |
| Nova 2 Lite | Amazon | 1M | 66K | 7% | $0.30 | $2.50 | 72 | |
| Nova Premier 1.0 | Amazon | 1M | 32K | 3% | $2.50 | $12.50 | 35 | |
| Claude Sonnet 4.5 | Anthropic | 1M | 64K | 6% | $3.00 | $15.00 | 33 | |
| Qwen3 Coder Plus | Alibaba | 1M | 66K | 7% | $0.65 | $3.25 | 58 | |
| Qwen3 Coder Flash | Alibaba | 1M | 66K | 7% | $0.20 | $0.97 | 79 | |
| Qwen Plus 0728 (thinking) | Alibaba | 1M | 33K | 3% | $0.26 | $0.78 | 75 | |
| Qwen Plus 0728 | Alibaba | 1M | 33K | 3% | $0.26 | $0.78 | 75 | |
| MiniMax M1 | MiniMax | 1M | 40K | 4% | $0.40 | $2.20 | 67 | |
| Claude Sonnet 4 | Anthropic | 1M | 64K | 6% | $3.00 | $15.00 | 33 | |
| Gemini 2.0 Flash | 1M | 8K | 1% | $0.10 | $0.40 | 88 | ||
| Qwen-Plus | Alibaba | 1M | 33K | 3% | $0.26 | $0.78 | 75 | |
| GPT Chat Latest | OpenAI | 400K | 128K | 32% | $5.00 | $30.00 | 26 | |
| OpenAI GPT Mini Latest | ~openai | 400K | 128K | 32% | $0.75 | $4.50 | 51 | |
| GPT-5.4 Nano | OpenAI | 400K | 128K | 32% | $0.20 | $1.25 | 74 | |
| GPT-5.4 Mini | OpenAI | 400K | 128K | 32% | $0.75 | $4.50 | 51 | |
| GPT-5.3-Codex | OpenAI | 400K | 128K | 32% | $1.75 | $14.00 | 38 | |
| GPT-5.2-Codex | OpenAI | 400K | 128K | 32% | $1.75 | $14.00 | 38 | |
| GPT-5.2 Pro | OpenAI | 400K | 128K | 32% | $21.00 | $168.00 | 17 | |
| GPT-5.2 | OpenAI | 400K | 128K | 32% | $1.75 | $14.00 | 38 | |
| GPT-5.1-Codex-Max | OpenAI | 400K | 128K | 32% | $1.25 | $10.00 | 43 | |
| GPT-5.1 | OpenAI | 400K | 128K | 32% | $1.25 | $10.00 | 43 | |
| GPT-5.1-Codex | OpenAI | 400K | 128K | 32% | $1.25 | $10.00 | 43 | |
| GPT-5.1-Codex-Mini | OpenAI | 400K | 128K | 32% | $0.25 | $2.00 | 70 | |
| GPT-5 Pro | OpenAI | 400K | 128K | 32% | $15.00 | $120.00 | 19 | |
| GPT-5 Codex | OpenAI | 400K | 128K | 32% | $1.25 | $10.00 | 43 | |
| GPT-5 | OpenAI | 400K | 128K | 32% | $1.25 | $10.00 | 43 | |
| GPT-5 Mini | OpenAI | 400K | 128K | 32% | $0.25 | $2.00 | 70 | |
| GPT-5 Nano | OpenAI | 400K | — | — | $0.05 | $0.40 | 87 | |
| Llama 4 Scout | Meta | 328K | 16K | 5% | $0.08 | $0.30 | 82 | |
| Nova Lite 1.0 | Amazon | 300K | 5K | 2% | $0.06 | $0.24 | 84 | |
| Nova Pro 1.0 | Amazon | 300K | 5K | 2% | $0.80 | $3.20 | 49 | |
| Ring-2.6-1T (free) | inclusionai | 262K | 66K | 25% | Free | Free | 90 | |
| Mistral Medium 3.5 | Mistral AI | 262K | — | — | $1.50 | $7.50 | 39 | |
| MoonshotAI Kimi Latest | ~moonshotai | 262K | 16K | 6% | $0.75 | $3.50 | 50 | |
| Qwen3.6 35B A3B | Alibaba | 262K | 262K | 100% | $0.15 | $1.00 | 75 | |
| Qwen3.6 Max Preview | Alibaba | 262K | 66K | 25% | $1.04 | $6.24 | 44 | |
| Qwen3.6 27B | Alibaba | 262K | 82K | 31% | $0.32 | $3.20 | 64 | |
| Ling-2.6-1T | inclusionai | 262K | 33K | 13% | $0.30 | $2.50 | 65 | |
| Hy3 preview | Tencent | 262K | 262K | 100% | $0.07 | $0.26 | 82 | |
| Ling-2.6-flash | inclusionai | 262K | 33K | 13% | $0.08 | $0.24 | 81 | |
| Kimi K2.6 | Moonshot AI | 262K | 16K | 6% | $0.75 | $3.50 | 50 | |
| Gemma 4 26B A4B (free) | 262K | 33K | 13% | Free | Free | 90 | ||
| Gemma 4 26B A4B | 262K | — | — | $0.06 | $0.33 | 83 | ||
| Gemma 4 31B (free) | 262K | 33K | 13% | Free | Free | 90 | ||
| Gemma 4 31B | 262K | 16K | 6% | $0.13 | $0.38 | 77 | ||
| Trinity Large Thinking | arcee-ai | 262K | 262K | 100% | $0.22 | $0.85 | 70 | |
| MiMo-V2-Omni | Xiaomi | 262K | 66K | 25% | $0.40 | $2.00 | 61 | |
| Mistral Small 4 | Mistral AI | 262K | — | — | $0.15 | $0.60 | 75 | |
| Nemotron 3 Super (free) | NVIDIA | 262K | 262K | 100% | Free | Free | 90 | |
| Nemotron 3 Super | NVIDIA | 262K | — | — | $0.09 | $0.45 | 80 | |
| Seed-2.0-Lite | ByteDance | 262K | 131K | 50% | $0.25 | $2.00 | 68 | |
| Qwen3.5-9B | Alibaba | 262K | 82K | 31% | $0.04 | $0.15 | 85 | |
| Seed-2.0-Mini | ByteDance | 262K | 131K | 50% | $0.10 | $0.40 | 79 | |
| Qwen3.5-35B-A3B | Alibaba | 262K | 82K | 31% | $0.14 | $1.00 | 76 | |
| Qwen3.5-27B | Alibaba | 262K | 66K | 25% | $0.20 | $1.56 | 72 | |
| Qwen3.5-122B-A10B | Alibaba | 262K | 66K | 25% | $0.26 | $2.08 | 67 | |
| Qwen3.5 397B A17B | Alibaba | 262K | 66K | 25% | $0.39 | $2.34 | 61 | |
| Qwen3 Max Thinking | Alibaba | 262K | 33K | 13% | $0.78 | $3.90 | 49 | |
| Qwen3 Coder Next | Alibaba | 262K | 262K | 100% | $0.11 | $0.80 | 78 | |
| Step 3.5 Flash | StepFun | 262K | 66K | 25% | $0.10 | $0.30 | 79 | |
| Kimi K2.5 | Moonshot AI | 262K | 66K | 25% | $0.44 | $2.00 | 59 | |
| Seed 1.6 Flash | ByteDance | 262K | 33K | 13% | $0.07 | $0.30 | 81 | |
| Seed 1.6 | ByteDance | 262K | 33K | 13% | $0.25 | $2.00 | 68 | |
| MiMo-V2-Flash | Xiaomi | 262K | 66K | 25% | $0.10 | $0.30 | 79 | |
| Nemotron 3 Nano 30B A3B | NVIDIA | 262K | 228K | 87% | $0.05 | $0.20 | 84 | |
| Devstral 2 2512 | Mistral AI | 262K | — | — | $0.40 | $2.00 | 61 | |
| Ministral 3 14B 2512 | Mistral AI | 262K | — | — | $0.20 | $0.20 | 71 | |
| Ministral 3 8B 2512 | Mistral AI | 262K | — | — | $0.15 | $0.15 | 75 | |
| Mistral Large 3 2512 | Mistral AI | 262K | — | — | $0.50 | $1.50 | 57 | |
| Kimi K2 Thinking | Moonshot AI | 262K | 262K | 100% | $0.60 | $2.50 | 54 | |
| Qwen3 VL 235B A22B Instruct | Alibaba | 262K | 16K | 6% | $0.20 | $0.88 | 71 | |
| Qwen3 Max | Alibaba | 262K | 33K | 13% | $0.78 | $3.90 | 49 | |
| Qwen3 Next 80B A3B Instruct (free) | Alibaba | 262K | — | — | Free | Free | 90 | |
| Qwen3 Next 80B A3B Instruct | Alibaba | 262K | 16K | 6% | $0.09 | $1.10 | 80 | |
| Kimi K2 0905 | Moonshot AI | 262K | 262K | 100% | $0.40 | $2.00 | 61 | |
| Qwen3 30B A3B Instruct 2507 | Alibaba | 262K | 262K | 100% | $0.09 | $0.30 | 80 | |
| Qwen3 Coder 480B A35B | Alibaba | 262K | 66K | 25% | $0.22 | $1.80 | 70 | |
| Qwen3 235B A22B Instruct 2507 | Alibaba | 262K | 16K | 6% | $0.07 | $0.10 | 82 | |
| Falcon-H1-Arabic 34B Instruct | TII | 262K | 8K | 3% | Free | Free | 90 | |
| Falcon-H1-Arabic 7B Instruct | TII | 262K | 8K | 3% | Free | Free | 90 | |
| Qwen3 Coder 480B A35B (free) | Alibaba | 262K | 262K | 100% | Free | Free | 90 | |
| Nemotron 3 Nano Omni (free) | NVIDIA | 256K | 66K | 26% | Free | Free | 90 | |
| KAT-Coder-Pro V2 | Kuaishou | 256K | 80K | 31% | $0.30 | $1.20 | 65 | |
| Nemotron 3 Nano 30B A3B (free) | NVIDIA | 256K | — | — | Free | Free | 90 | |
| Grok Code Fast 1 | xAI | 256K | 10K | 4% | $0.20 | $1.50 | 71 | |
| Jamba Large 1.7 | AI21 Labs | 256K | 4K | 2% | $2.00 | $8.00 | 35 | |
| Codestral 2508 | Mistral AI | 256K | — | — | $0.30 | $0.90 | 65 | |
| Grok 4 | xAI | 256K | — | — | $3.00 | $15.00 | 30 | |
| Command A | Cohere | 256K | 8K | 3% | $2.50 | $10.00 | 32 | |
| GLM 4.6 | Zhipu AI | 205K | 205K | 100% | $0.39 | $1.90 | 60 | |
| GLM 5.1 | Zhipu AI | 203K | 66K | 32% | $1.05 | $3.50 | 43 | |
| GLM 5V Turbo | Zhipu AI | 203K | 131K | 65% | $1.20 | $4.00 | 41 | |
| GLM 5 Turbo | Zhipu AI | 203K | 131K | 65% | $1.20 | $4.00 | 41 | |
| GLM 5 | Zhipu AI | 203K | — | — | $0.60 | $1.92 | 53 | |
| GLM 4.7 Flash | Zhipu AI | 203K | 16K | 8% | $0.06 | $0.40 | 81 | |
| GLM 4.7 | Zhipu AI | 203K | 131K | 65% | $0.40 | $1.75 | 59 | |
| Anthropic Claude Haiku Latest | ~anthropic | 200K | 64K | 32% | $1.00 | $5.00 | 44 | |
| Claude Opus 4.5 | Anthropic | 200K | 64K | 32% | $5.00 | $25.00 | 25 | |
| Sonar Pro Search | Perplexity | 200K | 8K | 4% | $3.00 | $15.00 | 29 | |
| Claude Haiku 4.5 | Anthropic | 200K | 64K | 32% | $1.00 | $5.00 | 44 | |
| o3 Deep Research | OpenAI | 200K | 100K | 50% | $10.00 | $40.00 | 20 | |
| o4 Mini Deep Research | OpenAI | 200K | 100K | 50% | $2.00 | $8.00 | 34 | |
| Claude Opus 4.1 | Anthropic | 200K | 32K | 16% | $15.00 | $75.00 | 18 | |
| o3 Pro | OpenAI | 200K | 100K | 50% | $20.00 | $80.00 | 16 | |
| Claude Opus 4 | Anthropic | 200K | 32K | 16% | $15.00 | $75.00 | 18 | |
| o4 Mini High | OpenAI | 200K | 100K | 50% | $1.10 | $4.40 | 43 | |
| o3 | OpenAI | 200K | 100K | 50% | $2.00 | $8.00 | 34 | |
| o4 Mini | OpenAI | 200K | 100K | 50% | $1.10 | $4.40 | 43 | |
| o1-pro | OpenAI | 200K | 100K | 50% | $150.00 | $600.00 | 11 | |
| Sonar Pro | Perplexity | 200K | 8K | 4% | $3.00 | $15.00 | 29 | |
| Claude 3.7 Sonnet | Anthropic | 200K | 64K | 32% | $3.00 | $15.00 | 29 | |
| Claude 3.7 Sonnet (thinking) | Anthropic | 200K | 64K | 32% | $3.00 | $15.00 | 29 | |
| o3 Mini High | OpenAI | 200K | 100K | 50% | $1.10 | $4.40 | 43 | |
| o3 Mini | OpenAI | 200K | 100K | 50% | $1.10 | $4.40 | 43 | |
| o1 | OpenAI | 200K | 100K | 50% | $15.00 | $60.00 | 18 | |
| Claude 3.5 Haiku | Anthropic | 200K | 8K | 4% | $0.80 | $4.00 | 48 | |
| Claude 3 Haiku | Anthropic | 200K | 4K | 2% | $0.25 | $1.25 | 67 | |
| Composer 2 | Cursor | 200K | 66K | 33% | $0.50 | $2.50 | 56 | |
| Composer 2 Fast | Cursor | 200K | 66K | 33% | $1.50 | $7.50 | 38 | |
| MiniMax M2.7 | MiniMax | 197K | 131K | 67% | $0.30 | $1.20 | 64 | |
| MiniMax M2.5 (free) | MiniMax | 197K | 8K | 4% | Free | Free | 88 | |
| MiniMax M2.5 | MiniMax | 197K | 197K | 100% | $0.15 | $1.15 | 73 | |
| MiniMax M2.1 | MiniMax | 197K | 197K | 100% | $0.29 | $0.95 | 64 | |
| MiniMax M2 | MiniMax | 197K | 197K | 100% | $0.26 | $1.00 | 66 | |
| DeepSeek V3.2 Speciale | DeepSeek | 164K | 164K | 100% | $0.29 | $0.43 | 63 | |
| DeepSeek V3.2 Exp | DeepSeek | 164K | 66K | 40% | $0.27 | $0.41 | 64 | |
| DeepSeek V3.1 Terminus | DeepSeek | 164K | 33K | 20% | $0.27 | $0.95 | 64 | |
| R1 0528 | DeepSeek | 164K | 33K | 20% | $0.50 | $2.15 | 55 | |
| Llama Guard 4 12B | Meta | 164K | 16K | 10% | $0.18 | $0.18 | 70 | |
| DeepSeek V3 0324 | DeepSeek | 164K | 16K | 10% | $0.20 | $0.77 | 69 | |
| DeepSeek V3 | DeepSeek | 164K | 16K | 10% | $0.32 | $0.89 | 62 | |
| Qwen3 Coder 30B A3B Instruct | Alibaba | 160K | 33K | 20% | $0.07 | $0.27 | 79 | |
| CoBuddy (free) | Baidu | 131K | 66K | 50% | Free | Free | 85 | |
| Granite 4.1 8B | IBM | 131K | 131K | 100% | $0.05 | $0.10 | 79 | |
| Laguna XS.2 (free) | poolside | 131K | 8K | 6% | Free | Free | 85 | |
| Laguna M.1 (free) | poolside | 131K | 8K | 6% | Free | Free | 85 | |
| Aion-2.0 | aion-labs | 131K | 33K | 25% | $0.80 | $1.60 | 46 | |
| GLM 4.6V | Zhipu AI | 131K | 24K | 18% | $0.30 | $0.90 | 62 | |
| Ministral 3 3B 2512 | Mistral AI | 131K | — | — | $0.10 | $0.10 | 75 | |
| Trinity Mini | arcee-ai | 131K | 131K | 100% | $0.04 | $0.15 | 80 | |
| DeepSeek V3.2 | DeepSeek | 131K | 66K | 50% | $0.25 | $0.38 | 64 | |
| gpt-oss-safeguard-20b | OpenAI | 131K | 66K | 50% | $0.07 | $0.30 | 77 | |
| Qwen3 VL 32B Instruct | Alibaba | 131K | 33K | 25% | $0.10 | $0.42 | 74 | |
| Qwen3 VL 8B Thinking | Alibaba | 131K | 33K | 25% | $0.12 | $1.36 | 73 | |
| Qwen3 VL 8B Instruct | Alibaba | 131K | 33K | 25% | $0.08 | $0.50 | 77 | |
| Llama 3.3 Nemotron Super 49B V1.5 | NVIDIA | 131K | 16K | 13% | $0.10 | $0.40 | 75 | |
| ERNIE 4.5 21B A3B Thinking | Baidu | 131K | 66K | 50% | $0.07 | $0.28 | 77 | |
| Qwen3 VL 30B A3B Thinking | Alibaba | 131K | 33K | 25% | $0.13 | $1.56 | 72 | |
| Qwen3 VL 30B A3B Instruct | Alibaba | 131K | 33K | 25% | $0.13 | $0.52 | 72 | |
| Qwen3 VL 235B A22B Thinking | Alibaba | 131K | 33K | 25% | $0.26 | $2.60 | 64 | |
| Tongyi DeepResearch 30B A3B | Alibaba | 131K | 131K | 100% | $0.09 | $0.45 | 76 | |
| Qwen3 Next 80B A3B Thinking | Alibaba | 131K | 33K | 25% | $0.10 | $0.78 | 75 | |
| Nemotron Nano 9B V2 | NVIDIA | 131K | 16K | 13% | $0.04 | $0.16 | 80 | |
| Qwen3 30B A3B Thinking 2507 | Alibaba | 131K | 131K | 100% | $0.08 | $0.40 | 77 | |
| Mistral Medium 3.1 | Mistral AI | 131K | — | — | $0.40 | $2.00 | 57 | |
| gpt-oss-120b (free) | OpenAI | 131K | 131K | 100% | Free | Free | 85 | |
| gpt-oss-120b | OpenAI | 131K | — | — | $0.04 | $0.18 | 81 | |
| gpt-oss-20b (free) | OpenAI | 131K | 8K | 6% | Free | Free | 85 | |
| gpt-oss-20b | OpenAI | 131K | 131K | 100% | $0.03 | $0.14 | 82 | |
| GLM 4.5 | Zhipu AI | 131K | 98K | 75% | $0.60 | $2.20 | 51 | |
| GLM 4.5 Air (free) | Zhipu AI | 131K | 96K | 73% | Free | Free | 85 | |
| GLM 4.5 Air | Zhipu AI | 131K | 98K | 75% | $0.13 | $0.85 | 72 | |
| Qwen3 235B A22B Thinking 2507 | Alibaba | 131K | — | — | $0.15 | $1.50 | 71 | |
| Kimi K2 0711 | Moonshot AI | 131K | 33K | 25% | $0.57 | $2.30 | 51 | |
| Devstral Medium | Mistral AI | 131K | — | — | $0.40 | $2.00 | 57 | |
| Devstral Small 1.1 | Mistral AI | 131K | — | — | $0.10 | $0.30 | 75 | |
| Hunyuan A13B Instruct | Tencent | 131K | 131K | 100% | $0.14 | $0.57 | 71 | |
| Grok 3 Mini | xAI | 131K | — | — | $0.30 | $0.50 | 62 | |
| Grok 3 | xAI | 131K | — | — | $3.00 | $15.00 | 28 | |
| Mistral Medium 3 | Mistral AI | 131K | — | — | $0.40 | $2.00 | 57 | |
| Spotlight | arcee-ai | 131K | 66K | 50% | $0.18 | $0.18 | 69 | |
| Maestro Reasoning | arcee-ai | 131K | 32K | 24% | $0.90 | $3.30 | 44 | |
| Virtuoso Large | arcee-ai | 131K | 64K | 49% | $0.75 | $1.20 | 47 | |
| Qwen3 235B A22B | Alibaba | 131K | 8K | 6% | $0.45 | $1.82 | 55 | |
| Grok 3 Mini Beta | xAI | 131K | — | — | $0.30 | $0.50 | 62 | |
| Grok 3 Beta | xAI | 131K | — | — | $3.00 | $15.00 | 28 | |
| Gemma 3 4B | 131K | 16K | 13% | $0.04 | $0.08 | 80 | ||
| Gemma 3 12B | 131K | 16K | 13% | $0.04 | $0.13 | 80 | ||
| Gemma 3 27B | 131K | 16K | 13% | $0.08 | $0.16 | 77 | ||
| Llama Guard 3 8B | Meta | 131K | — | — | $0.48 | $0.03 | 54 | |
| Qwen VL Plus | Alibaba | 131K | 8K | 6% | $0.14 | $0.41 | 72 | |
| Aion-1.0 | aion-labs | 131K | 33K | 25% | $4.00 | $8.00 | 26 | |
| Aion-1.0-Mini | aion-labs | 131K | 33K | 25% | $0.70 | $1.40 | 48 | |
| Qwen VL Max | Alibaba | 131K | 33K | 25% | $0.52 | $2.08 | 53 | |
| Qwen-Turbo | Alibaba | 131K | 8K | 6% | $0.03 | $0.13 | 81 | |
| R1 Distill Llama 70B | DeepSeek | 131K | 16K | 13% | $0.70 | $0.80 | 48 | |
| Llama 3.3 70B Instruct | Meta | 131K | 16K | 13% | $0.10 | $0.32 | 75 | |
| Mistral Large 2411 | Mistral AI | 131K | — | — | $2.00 | $6.00 | 33 | |
| Mistral Large 2407 | Mistral AI | 131K | — | — | $2.00 | $6.00 | 33 | |
| Pixtral Large 2411 | Mistral AI | 131K | — | — | $2.00 | $6.00 | 33 | |
| Llama 3.2 3B Instruct (free) | Meta | 131K | — | — | Free | Free | 85 | |
| Llama 3.2 11B Vision Instruct | Meta | 131K | 16K | 13% | $0.24 | $0.24 | 65 | |
| Llama 3.1 70B Instruct | Meta | 131K | 16K | 13% | $0.40 | $0.40 | 57 | |
| Mistral Nemo | Mistral AI | 131K | — | — | $0.02 | $0.03 | 83 | |
| Falcon-H1-Arabic 3B Instruct | TII | 131K | 8K | 6% | Free | Free | 85 | |
| Trinity Large Preview | arcee-ai | 131K | — | — | $0.15 | $0.45 | 71 | |
| Granite 4.0 Micro | IBM | 131K | — | — | $0.02 | $0.11 | 83 | |
| Mercury 2 | Inception | 128K | 50K | 39% | $0.25 | $0.75 | 64 | |
| GPT-5.3 Chat | OpenAI | 128K | 16K | 13% | $1.75 | $14.00 | 34 | |
| Solar Pro 3 | Upstage | 128K | — | — | $0.15 | $0.60 | 71 | |
| GPT Audio | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| GPT Audio Mini | OpenAI | 128K | 16K | 13% | $0.60 | $2.40 | 51 | |
| GPT-5.2 Chat | OpenAI | 128K | 32K | 25% | $1.75 | $14.00 | 34 | |
| Cogito v2.1 671B | deepcogito | 128K | — | — | $1.25 | $1.25 | 39 | |
| GPT-5.1 Chat | OpenAI | 128K | 16K | 13% | $1.25 | $10.00 | 39 | |
| Nemotron Nano 12B 2 VL (free) | NVIDIA | 128K | 128K | 100% | Free | Free | 85 | |
| Phi 4 Mini Instruct | Microsoft | 128K | 128K | 100% | $0.08 | $0.35 | 76 | |
| Nemotron Nano 9B V2 (free) | NVIDIA | 128K | — | — | Free | Free | 85 | |
| GPT-4o Audio | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| GPT-5 Chat | OpenAI | 128K | 16K | 13% | $1.25 | $10.00 | 39 | |
| GLM 4 32B | Zhipu AI | 128K | — | — | $0.10 | $0.10 | 75 | |
| UI-TARS 7B | ByteDance | 128K | 2K | 2% | $0.10 | $0.20 | 75 | |
| Mistral Small 3.2 24B | Mistral AI | 128K | 16K | 13% | $0.07 | $0.20 | 77 | |
| Mistral Small 3.1 24B | Mistral AI | 128K | — | — | $0.35 | $0.56 | 59 | |
| GPT-4o-mini Search Preview | OpenAI | 128K | 16K | 13% | $0.15 | $0.60 | 71 | |
| GPT-4o Search Preview | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| Sonar Reasoning Pro | Perplexity | 128K | — | — | $2.00 | $8.00 | 33 | |
| Sonar Deep Research | Perplexity | 128K | — | — | $2.00 | $8.00 | 33 | |
| Command R7B (12-2024) | Cohere | 128K | 4K | 3% | $0.04 | $0.15 | 81 | |
| Nova Micro 1.0 | Amazon | 128K | 5K | 4% | $0.04 | $0.14 | 81 | |
| GPT-4o (2024-11-20) | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| Command R+ (08-2024) | Cohere | 128K | 4K | 3% | $2.50 | $10.00 | 30 | |
| Command R (08-2024) | Cohere | 128K | 4K | 3% | $0.15 | $0.60 | 71 | |
| GPT-4o (2024-08-06) | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| GPT-4o-mini (2024-07-18) | OpenAI | 128K | 16K | 13% | $0.15 | $0.60 | 71 | |
| GPT-4o-mini | OpenAI | 128K | 16K | 13% | $0.15 | $0.60 | 71 | |
| GPT-4o (2024-05-13) | OpenAI | 128K | 4K | 3% | $5.00 | $15.00 | 24 | |
| GPT-4o | OpenAI | 128K | 16K | 13% | $2.50 | $10.00 | 30 | |
| GPT-4 Turbo | OpenAI | 128K | 4K | 3% | $10.00 | $30.00 | 19 | |
| Mistral Large | Mistral AI | 128K | — | — | $2.00 | $6.00 | 33 | |
| GPT-4 Turbo Preview | OpenAI | 128K | 4K | 3% | $10.00 | $30.00 | 19 | |
| GPT-4 Turbo (older v1106) | OpenAI | 128K | 4K | 3% | $10.00 | $30.00 | 19 | |
| Sonar | Perplexity | 127K | — | — | $1.00 | $1.00 | 42 | |
| ERNIE 4.5 VL 424B A47B | Baidu | 123K | 16K | 13% | $0.42 | $1.25 | 56 | |
| ERNIE 4.5 300B A47B | Baidu | 123K | 12K | 10% | $0.28 | $1.10 | 62 | |
| ERNIE 4.5 21B A3B | Baidu | 120K | 8K | 7% | $0.07 | $0.28 | 77 | |
| Llama 3.2 3B Instruct | Meta | 80K | — | — | $0.05 | $0.34 | 76 | |
| Qianfan-OCR-Fast (free) | Baidu | 66K | 29K | 44% | Free | Free | 80 | |
| MiniMax M2-her | MiniMax | 66K | 2K | 3% | $0.30 | $1.20 | 58 | |
| Olmo 3 32B Think | Allen AI | 66K | 66K | 100% | $0.15 | $0.50 | 67 | |
| GLM 4.5V | Zhipu AI | 66K | 16K | 25% | $0.60 | $1.80 | 48 | |
| Reka Flash 3 | rekaai | 66K | 66K | 100% | $0.10 | $0.20 | 70 | |
| Llama 3.3 70B Instruct (free) | Meta | 66K | — | — | Free | Free | 80 | |
| Mixtral 8x22B Instruct | Mistral AI | 66K | — | — | $2.00 | $6.00 | 31 | |
| WizardLM-2 8x22B | Microsoft | 66K | 8K | 12% | $0.62 | $0.62 | 47 | |
| R1 | DeepSeek | 64K | 16K | 25% | $0.70 | $2.50 | 45 | |
| Llama 3.2 1B Instruct | Meta | 60K | — | — | $0.03 | $0.20 | 76 | |
| Qwen3 30B A3B | Alibaba | 41K | 20K | 49% | $0.09 | $0.45 | 68 | |
| Qwen3 8B | Alibaba | 41K | 8K | 20% | $0.05 | $0.40 | 72 | |
| Qwen3 14B | Alibaba | 41K | 41K | 100% | $0.06 | $0.24 | 71 | |
| Qwen3 32B | Alibaba | 41K | 16K | 40% | $0.08 | $0.28 | 69 | |
| LFM2-24B-A2B | Liquid AI | 33K | — | — | $0.03 | $0.12 | 72 | |
| LFM2.5-1.2B-Thinking (free) | Liquid AI | 33K | — | — | Free | Free | 75 | |
| LFM2.5-1.2B-Instruct (free) | Liquid AI | 33K | — | — | Free | Free | 75 | |
| Rnj 1 Instruct | essentialai | 33K | — | — | $0.15 | $0.15 | 62 | |
| DeepSeek V3.1 | DeepSeek | 33K | 7K | 22% | $0.15 | $0.75 | 62 | |
| Gemma 3n 4B | 33K | — | — | $0.06 | $0.12 | 69 | ||
| Coder Large | arcee-ai | 33K | — | — | $0.50 | $0.80 | 47 | |
| Saba | Mistral AI | 33K | — | — | $0.20 | $0.60 | 59 | |
| Qwen-Max | Alibaba | 33K | 8K | 25% | $1.04 | $4.16 | 37 | |
| Mistral Small 3 | Mistral AI | 33K | 16K | 50% | $0.05 | $0.08 | 70 | |
| R1 Distill Qwen 32B | DeepSeek | 33K | 33K | 100% | $0.29 | $0.29 | 55 | |
| Qwen2.5 Coder 32B Instruct | Alibaba | 33K | — | — | $0.66 | $1.00 | 43 | |
| Qwen2.5 7B Instruct | Alibaba | 33K | 33K | 100% | $0.04 | $0.10 | 71 | |
| Qwen2.5 72B Instruct | Alibaba | 33K | 16K | 50% | $0.36 | $0.40 | 52 | |
| Falcon Arabic 7B Instruct | TII | 33K | 8K | 25% | Free | Free | 75 | |
| Falcon3 10B Instruct | TII | 33K | 8K | 25% | Free | Free | 75 | |
| Falcon3 7B Instruct | TII | 33K | 8K | 25% | Free | Free | 75 | |
| Falcon Mamba 7B Instruct | TII | 33K | 8K | 25% | Free | Free | 75 | |
| Qwen2.5 VL 72B Instruct | Alibaba | 32K | — | — | $0.25 | $0.75 | 57 | |
| ERNIE 4.5 VL 28B A3B | Baidu | 30K | 8K | 27% | $0.14 | $0.56 | 63 | |
| GPT-3.5 Turbo 16k | OpenAI | 16K | 4K | 25% | $3.00 | $4.00 | 23 | |
| GPT-3.5 Turbo | OpenAI | 16K | 4K | 25% | $0.50 | $1.50 | 44 | |
| Reka Edge | rekaai | 16K | 16K | 100% | $0.10 | $0.10 | 62 | |
| Phi 4 | Microsoft | 16K | 16K | 100% | $0.07 | $0.14 | 64 | |
| Llama 3.1 8B Instruct | Meta | 16K | 16K | 100% | $0.02 | $0.05 | 68 | |
| Gemma 2 27B | 8K | 2K | 25% | $0.65 | $0.65 | 38 | ||
| Llama 3 8B Instruct | Meta | 8K | 8K | 100% | $0.04 | $0.04 | 62 | |
| Llama 3 70B Instruct | Meta | 8K | 8K | 98% | $0.51 | $0.74 | 41 | |
| GPT-4 (older v0314) | OpenAI | 8K | 4K | 50% | $30.00 | $60.00 | 11 | |
| GPT-4 | OpenAI | 8K | 4K | 50% | $30.00 | $60.00 | 11 | |
| Inflection 3 Productivity | Inflection | 8K | 1K | 13% | $2.50 | $10.00 | 23 | |
| Inflection 3 Pi | Inflection | 8K | 1K | 13% | $2.50 | $10.00 | 23 | |
| ALLaM 7B Instruct (preview) | HUMAIN | 4K | 4K | 100% | Free | Free | 60 | |
| ALLaM 1 13B Instruct | HUMAIN | 4K | 4K | 100% | $1.80 | $1.80 | 24 | |
| ALLaM 2 7B Instruct | HUMAIN | 4K | 4K | 100% | Free | Free | 60 | |
| ALLaM 34B | HUMAIN | 4K | 4K | 100% | Free | Free | 60 | |
| GPT-3.5 Turbo (older v0613) | OpenAI | 4K | 4K | 100% | $1.00 | $2.00 | 30 | |
| GPT-3.5 Turbo Instruct | OpenAI | 4K | 4K | 100% | $1.50 | $2.00 | 26 | |
| Mistral 7B Instruct v0.1 | Mistral AI | 3K | — | — | $0.11 | $0.19 | 50 | |
| SWE-1.5 | Windsurf | — | — | — | Free | Free | 0 | |
| autofixer-01 | Vercel | — | — | — | Free | Free | 0 | |
| Mellum | JetBrains | — | — | — | Free | Free | 0 |
| Model | Provider | Input $/1M | Output $/1M | Capabilities |
|---|---|---|---|---|
| GPT-5 Image | OpenAI | $10.00 | $10.00 | |
| GPT-5.4 Image 2 | OpenAI | $8.00 | $15.00 | |
| GPT-5 Image Mini | OpenAI | $2.50 | $2.00 | |
| Nano Banana Pro (Gemini 3 Pro Image Preview) | $2.00 | $12.00 | ||
| Nano Banana 2 (Gemini 3.1 Flash Image Preview) | $0.50 | $3.00 | ||
| Nano Banana (Gemini 2.5 Flash Image) | $0.30 | $2.50 | ||
| Midjourney v6.1 | Midjourney | Free | Free | |
| DALL-E 3 | OpenAI | Free | $40000.00 | |
| Stable Diffusion 3.5 | Stability AI | Free | $35000.00 | |
| FLUX.1 Pro | Black Forest Labs | Free | $50000.00 | |
| Ideogram 2.0 | Ideogram | Free | $80000.00 | |
| Recraft V3 | Recraft | Free | $40000.00 | |
| Imagen 3 | Free | $40000.00 | ||
| Adobe Firefly 3 | Adobe | Free | Free | |
| Leonardo Phoenix | Leonardo AI | Free | Free |
AI speed is measured by time-to-first-token (TTFT) and tokens-per-second (TPS). TTFT measures how quickly the model starts responding. TPS measures how fast it generates output. Both matter for different use cases.
Speed varies by provider. Groq-hosted Llama achieves the fastest inference. Among major providers, Gemini Flash and GPT-4o Mini are consistently fast. Reasoning models like o3 and R1 are intentionally slower for better accuracy.
Smaller, faster models may sacrifice some quality. However, provider optimizations (quantization, speculative decoding) can speed up models without quality loss. The same model runs at different speeds on different providers.