分析 300 个AI模型的价格分布,以及哪些服务商提供最优惠的价格。对比输入与输出成本、价格层级,并在每个预算范围内找到评分最高的模型。
| 层级 | 数量 | 平均评分 |
|---|---|---|
| Free | 23 | 65 |
| Ultra-Budget | 109 | 65 |
| Budget | 81 | 69 |
| Mid-Range | 68 | 73 |
| Premium | 9 | 66 |
| Enterprise | 10 | 76 |
I/O比 = 输出价格 / 输入价格。3.0x 的比率意味着输出令牌的成本是输入令牌的3倍。
| 模型 | $/M In | $/M Out | 比率 |
|---|---|---|---|
| Qwen3 Next 80B A3B Instruct | $0.09 | $1.10 | 12.2x |
| Qwen3 VL 30B A3B Thinking | $0.13 | $1.56 | 12.0x |
| Qwen3 VL 8B Thinking | $0.12 | $1.36 | 11.7x |
| Palmyra X5 | $0.60 | $6.00 | 10.0x |
| Qwen3 VL 235B A22B Thinking | $0.26 | $2.60 | 10.0x |
| Qwen3 235B A22B Thinking 2507 | $0.15 | $1.50 | 10.0x |
| Nova 2 Lite | $0.30 | $2.50 | 8.3x |
| Gemini 2.5 Flash | $0.30 | $2.50 | 8.3x |
| Seed-2.0-Lite | $0.25 | $2.00 | 8.0x |
| GPT-5.3 Chat | $1.75 | $14.00 | 8.0x |
| 模型 | $/M In | $/M Out | 比率 |
|---|---|---|---|
| Reka Edge | $0.20 | $0.20 | 1.0x |
| Rnj 1 Instruct | $0.15 | $0.15 | 1.0x |
| Ministral 3 14B 2512 | $0.20 | $0.20 | 1.0x |
| Ministral 3 8B 2512 | $0.15 | $0.15 | 1.0x |
| Ministral 3 3B 2512 | $0.10 | $0.10 | 1.0x |
| Cogito v2.1 671B | $1.25 | $1.25 | 1.0x |
| Spotlight | $0.18 | $0.18 | 1.0x |
| Llama Guard 4 12B | $0.18 | $0.18 | 1.0x |
| Qwen2.5 VL 72B Instruct | $0.80 | $0.80 | 1.0x |
| R1 Distill Qwen 32B | $0.29 | $0.29 | 1.0x |
在每个平均成本阈值内评分最高的模型。
| 最高 $/1M | 模型 | 评分 | 实际成本 |
|---|---|---|---|
| $0.10 | Qwen3.5-9B | 85 | $0.10/1M |
| $0.50 | Grok 4.1 Fast | 87 | $0.35/1M |
| $1.00 | Grok 4.1 Fast | 87 | $0.35/1M |
| $2.00 | Gemini 3 Flash Preview | 89 | $1.75/1M |
| $5.00 | GPT-5.4 Mini | 93 | $2.63/1M |
| $10.00 | GPT-5.4 | 94 | $8.75/1M |
| $20.00 | GPT-5.4 | 94 | $8.75/1M |
| $50.00 | GPT-5.4 | 94 | $8.75/1M |
| 提供商 | 模型 | 平均成本 |
|---|---|---|
| Liquid AI | 5 | $0.02 |
| StepFun | 2 | $0.10 |
| Meta | 14 | $0.17 |
| Allen AI | 4 | $0.29 |
| NVIDIA | 11 | $0.33 |
| Microsoft | 2 | $0.36 |
| Baidu | 5 | $0.45 |
| Inception | 3 | $0.50 |
| ByteDance | 5 | $0.57 |
| arcee-ai | 7 | $0.57 |
| Alibaba | 50 | $0.63 |
| MiniMax | 8 | $0.67 |
| DeepSeek | 11 | $0.68 |
| Xiaomi | 3 | $1.13 |
| Mistral AI | 25 | $1.16 |
| Moonshot AI | 4 | $1.30 |
| 23 | $1.60 | |
| Amazon | 5 | $2.23 |
| aion-labs | 3 | $2.75 |
| Cursor | 2 | $3.00 |
| Cohere | 4 | $3.24 |
| xAI | 10 | $3.73 |
| Perplexity | 5 | $5.80 |
| Inflection | 2 | $6.25 |
| Anthropic | 13 | $14.55 |
| OpenAI | 60 | $18.26 |
We categorize models into tiers based on average cost per million tokens: Free (zero cost), Budget (under $1), Mid ($1-$10), Premium ($10-$50), and Enterprise (over $50). This tiering helps developers quickly find models that match their budget constraints.
The I/O ratio compares the cost of output tokens to input tokens. A ratio of 3x means output costs three times more than input. This matters because output-heavy workloads (like content generation) will cost more with high-ratio models, while input-heavy workloads (like analysis) are less affected.
Use the "Best Value at Each Price Point" table to find the highest-scoring model within your budget. Set your maximum cost threshold, and the table shows which model delivers the best quality at or below that price. You can also check the scatter chart to visually identify models with the best score-to-cost ratio.