300 个AI模型的性价比分析。找到每一美元带来最高性能的模型,按价格层级、服务商和质量与实惠的最佳平衡点进行分类。
| # | 模型 | 评分 |
|---|---|---|
| 1 | Gemma 4 31B (free)Google | 81 |
| 2 | MiniMax M2.5 (free)MiniMax | 78 |
| 3 | Gemma 4 26B A4B (free)Google | 73 |
| 4 | GLM 4.5 Air (free)Zhipu AI | 71 |
| 5 | Qwen3 Next 80B A3B Instruct (free)Alibaba | 67 |
| 6 | Llama 3.3 70B Instruct (free)Meta | 66 |
| 7 | Trinity Large Thinking (free)arcee-ai | 64 |
| 8 | gpt-oss-20b (free)OpenAI | 57 |
| 9 | Ring-2.6-1T (free)inclusionai | 40 |
| 10 | CoBuddy (free)Baidu | 40 |
| 11 | Nemotron 3 Nano Omni (free)NVIDIA | 40 |
| 12 | Laguna XS.2 (free)poolside | 40 |
| 13 | Laguna M.1 (free)poolside | 40 |
| 14 | Qianfan-OCR-Fast (free)Baidu | 40 |
| 15 | Lyria 3 Pro PreviewGoogle | 40 |
| 16 | Lyria 3 Clip PreviewGoogle | 40 |
| 17 | Nemotron 3 Super (free)NVIDIA | 40 |
| 18 | LFM2.5-1.2B-Thinking (free)Liquid AI | 40 |
| 19 | LFM2.5-1.2B-Instruct (free)Liquid AI | 40 |
| 20 | Nemotron 3 Nano 30B A3B (free)NVIDIA | 40 |
| 21 | Nemotron Nano 12B 2 VL (free)NVIDIA | 40 |
| 22 | Nemotron Nano 9B V2 (free)NVIDIA | 40 |
| 23 | Qwen3 Coder 480B A35B (free)Alibaba | 40 |
性价比评分前20%且综合评分前50%的模型。两全其美的选择。
| 提供商 | 模型 | 平均评分 | 平均性价比 |
|---|---|---|---|
| IBM | 2 | 40 | 581.6 |
| Microsoft | 2 | 56 | 416.2 |
| Meta | 8 | 54 | 333.4 |
| rekaai | 2 | 40 | 333.3 |
| NVIDIA | 4 | 45 | 277.6 |
| Tencent | 2 | 55 | 268.0 |
| 19 | 72 | 223.3 | |
| arcee-ai | 6 | 48 | 178.2 |
| Amazon | 5 | 44 | 158.5 |
| Mistral AI | 18 | 46 | 144.5 |
The value score is computed as composite quality score divided by average cost per million tokens. A model scoring 80 at $0.50/1M tokens has a value score of 160, while a model scoring 90 at $15/1M tokens has a value score of 6. Higher values mean more quality per dollar spent. Free models are excluded from value rankings since division by zero is undefined.
Sweet Spot models sit in the top 20% for value score AND the top 50% for composite quality score. They represent the best of both worlds: genuinely high-quality models that also deliver excellent value for money. These are the models most likely to satisfy both quality requirements and budget constraints.
Budget tier models (under $1/1M tokens) typically offer the highest value scores because even moderate quality at very low cost produces a strong ratio. However, the sweet spot analysis shows that some mid-tier models deliver the best combination of absolute quality and value. The ideal choice depends on whether you prioritize raw quality or cost efficiency.