Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.
| 信号 | 强度 | 权重 | 影响 |
|---|---|---|---|
| Benchmarksjust now | 60 | 30% | +17.9 |
| Capabilitiesjust now | 67 | 20% | +13.3 |
| Recencyjust now | 73 | 15% | +10.9 |
| Context Windowjust now | 73 | 10% | +7.3 |
| Output Capacityjust now | 65 | 10% | +6.5 |
| Pricingjust now | 0 | 15% | +0.1 |
社区和从业者反馈在基准测试和价格之上增加了真实世界的信号。
Share your experience with Qwen3 8B and help the community make better decisions.
成本估算器
每月比类别平均节省$40.99
来自已验证的来源。