Best Chinese AI Models

Five major Chinese AI labs - Alibaba (Qwen), DeepSeek, Zhipu AI (GLM), MiniMax, and ByteDance (Seed) - ship 89+ models that compete head-to-head with US frontier models. Most are open-weight. This page ranks the flagship models from each lab, then lists top multilingual frontier models that support Chinese.

Tier A

Section 01

Chinese-origin flagship models

Top models from five major Chinese AI labs. Ranked by published benchmarks, architectural innovation, and availability. 10 flagships listed.

Qwen3.6 PlusFlagshipAlibaba

Qwen 3.6 Plus is Alibaba's latest frontier model with a 1M token context window. Tops OmniDocBench v1.5 (91.2) and Terminal-Bench 2.0 (61.6%, beating Claude Opus 4.5). Vision, reasoning, and agentic capabilities. API-only access.

1M ctxMMLU-Pro 88.5%In $0.325/M

Qwen3.5 397B A17BTop open-weightOpen weightAlibaba

The largest open-weight Qwen model at 397B total parameters with 17B activated per token (MoE). Apache 2.0 licensed. Hybrid linear attention + sparse MoE architecture with 262K context. Strong multimodal and reasoning capabilities.

262K ctxIn $0.390/M

DeepSeek V3.2Open-weight leaderOpen weightDeepSeek

DeepSeek V3.2 achieves Arena Elo 1424 (highest among all tested models at release). 671B total / 37B active MoE with Multi-head Latent Attention (MLA). AIME 96% (beats GPT-5-High), IMO gold medal level. Fully open-weight.

131K ctxArena Elo 1424In $0.252/M

R1Reasoning specialistOpen weightDeepSeek

DeepSeek R1 uses reinforcement learning for explicit chain-of-thought reasoning. Scored 96% on the Chinese National Medical Licensing Exam (vs GPT-o1 Pro at 75%). AIME 2025 87.5%, LiveCodeBench 84.4%. Fully open-weight with distilled variants at 7B, 14B, 32B, and 70B.

164K ctxMMLU-Pro 84.0%In $0.700/M

GLM 5.1Frontier closedZhipu AI

GLM 5.1 is a 744B MoE model (40B active) that tops SWE-Bench Pro at 58.4% (beating GPT-5.4 and Claude Opus 4.6). Trained entirely on Huawei Ascend 910B chips, demonstrating Chinese hardware independence. MMLU-Pro 89%. API-only.

203K ctxMMLU-Pro 89.0%In $0.980/M

GLM 4.5Open-weightOpen weightZhipu AI

GLM 4.5 is an open-weight 355B MoE model (32B active) with 131K context. MATH 500 at 98.2% (ties Claude Opus 4.6), AIME24 91%. Industry-best tool-use scores on tau-Bench and BFCL v3. 2.5-8x faster inference than comparable models. Apache 2.0 licensed.

131K ctxIn $0.600/M

MiniMax M2.7Self-evolvingMiniMax

MiniMax M2.7 introduces "self-evolution" - automated ML research that handles 30-50% of the RL workflow. SWE-Pro 56.2%, Terminal-Bench 2.0 57%. 204K context. Achieved 66.6% medal rate across 22 ML competitions. API-only.

205K ctxIn $0.279/M

MiniMax M2.5Open-weightOpen weightMiniMax

MiniMax M2.5 is an open-weight model with Arena Elo 1403 and SWE-bench Verified 75.8%. 196K context. 37% faster end-to-end runtime than M2-her. Available on Hugging Face with a free tier variant.

205K ctxArena Elo 1403In $0.150/M

Seed-2.0-LiteMultimodalByteDance

Seed 2.0 Lite is ByteDance's multimodal model with 262K context, adaptive deep thinking, and sparse MoE architecture. Strong on document understanding (OmniDocBench, DUDE, MMLongBench) and video comprehension (TVBench, TempCompass). Surpasses human-level on EgoTempo. API-only.

262K ctxIn $0.250/M

Qwen3 Coder 480B A35BCoding specialistOpen weightAlibaba

The largest Qwen Coder model at 480B total / 35B active (MoE). Purpose-built for software engineering with 262K context. Part of Alibaba's code-specific model line alongside Coder Flash (1M context) and Coder Plus.

1.0M ctxIn $0.220/M

View all Chinese models: Alibaba (Qwen) · DeepSeek · Zhipu AI (GLM) · MiniMax · ByteDance (Seed)

Tier B

Section 02

Multilingual frontier models with Chinese support

Top frontier models whose multilingual training mix supports Chinese. Ranked by composite score. Capped at 2 models per provider family.

#ModelContextOutput/1MScore

Claude Opus 4.7 (Fast)

Gemini 3.1 Pro Preview Custom Tools

Google

1.0M$12.0092 5

Gemini 3.1 Pro Preview

Section 03

Chinese AI ecosystem: key differentiators

MoE architecture dominance

Over 90% of top Chinese models use Mixture-of-Experts, activating only 5-10% of total parameters per token. DeepSeek pioneered Multi-head Latent Attention (MLA) for KV cache compression, now adopted across the industry.

Open-weight leadership

Chinese labs release more open-weight frontier models than any other region. DeepSeek V3.2, Qwen3 series (Apache 2.0), GLM-4.5/5, and MiniMax M2.5 are all freely downloadable. This enables local deployment without API dependency.

Cost efficiency

MoE architecture plus aggressive pricing makes Chinese models 3-10x cheaper than US equivalents at comparable quality. DeepSeek V3.2 input costs $0.26/M vs Claude Opus at $15/M. Qwen models start at $0.03/M input.

Section 04

Full Chinese model catalog (83 priced models)

All Chinese-origin models with published pricing, sorted by input cost (lowest first). Updated hourly.

Model	Lab	In $/M	Out $/M	Context
Qwen3.5-9BAlibaba	Alibaba	$0.040	$0.150	262K
Qwen2.5 7B InstructAlibaba	Alibaba	$0.040	$0.100	131K
Qwen3 8BAlibaba	Alibaba	$0.050	$0.400	131K
GLM 4.7 FlashZhipu AI	Zhipu AI	$0.060	$0.400	203K
Qwen3.5-FlashAlibaba	Alibaba	$0.065	$0.260	1M
Qwen3 Coder 30B A3B InstructAlibaba	Alibaba	$0.070	$0.270	160K
Qwen3 235B A22B Instruct 2507Alibaba	Alibaba	$0.071	$0.100	262K
Seed 1.6 FlashByteDance	ByteDance	$0.075	$0.300	262K
Qwen3 VL 8B InstructAlibaba	Alibaba	$0.080	$0.500	256K
Qwen3 30B A3B Thinking 2507Alibaba	Alibaba	$0.080	$0.400	131K
Qwen3 32BAlibaba	Alibaba	$0.080	$0.280	131K
Qwen3 Next 80B A3B InstructAlibaba	Alibaba	$0.090	$1.10	262K
Qwen3 30B A3B Instruct 2507Alibaba	Alibaba	$0.090	$0.300	262K
Qwen3 30B A3BAlibaba	Alibaba	$0.090	$0.450	131K
Qwen3 Next 80B A3B ThinkingAlibaba	Alibaba	$0.098	$0.780	262K
DeepSeek V4 FlashDeepSeek	DeepSeek	$0.100	$0.200	1.0M
Seed-2.0-MiniByteDance	ByteDance	$0.100	$0.400	262K
GLM 4 32B Zhipu AI	Zhipu AI	$0.100	$0.100	128K
UI-TARS 7B ByteDance	ByteDance	$0.100	$0.200	128K
Qwen3 14BAlibaba	Alibaba	$0.100	$0.240	132K
Qwen3 VL 32B InstructAlibaba	Alibaba	$0.104	$0.416	262K
Qwen3 Coder NextAlibaba	Alibaba	$0.110	$0.800	262K
Qwen3 VL 8B ThinkingAlibaba	Alibaba	$0.117	$1.36	256K
GLM 4.5 AirZhipu AI	Zhipu AI	$0.125	$0.840	131K
Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	$0.130	$1.56	131K
Qwen3 VL 30B A3B InstructAlibaba	Alibaba	$0.130	$0.520	262K
Qwen3.5-35B-A3BAlibaba	Alibaba	$0.139	$1.00	262K
Qwen3 235B A22B Thinking 2507Alibaba	Alibaba	$0.150	$1.50	262K
Qwen3.6 35B A3BAlibaba	Alibaba	$0.150	$1.00	262K
MiniMax M2.5MiniMax	MiniMax	$0.150	$1.15	205K
Qwen3.6 FlashAlibaba	Alibaba	$0.188	$1.13	1M
Qwen3.5-27BAlibaba	Alibaba	$0.195	$1.56	262K
Qwen3 Coder FlashAlibaba	Alibaba	$0.195	$0.975	1M
Qwen3 VL 235B A22B InstructAlibaba	Alibaba	$0.200	$0.880	262K
DeepSeek V3 0324DeepSeek	DeepSeek	$0.200	$0.770	164K
MiniMax-01MiniMax	MiniMax	$0.200	$1.10	1.0M
DeepSeek V3.1DeepSeek	DeepSeek	$0.210	$0.790	164K
Qwen3 Coder 480B A35BAlibaba	Alibaba	$0.220	$1.80	1.0M
DeepSeek V3DeepSeek	DeepSeek	$0.229	$0.914	131K
Seed-2.0-LiteByteDance	ByteDance	$0.250	$2.00	262K
Seed 1.6ByteDance	ByteDance	$0.250	$2.00	262K
Qwen2.5 VL 72B InstructAlibaba	Alibaba	$0.250	$0.750	131K
DeepSeek V3.2DeepSeek	DeepSeek	$0.252	$0.378	131K
MiniMax M2MiniMax	MiniMax	$0.255	$1.00	205K
Qwen3.5-122B-A10BAlibaba	Alibaba	$0.260	$2.08	262K
Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	$0.260	$1.56	1M
Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	$0.260	$2.60	131K
Qwen Plus 0728 (thinking)Alibaba	Alibaba	$0.260	$0.780	1M
Qwen Plus 0728Alibaba	Alibaba	$0.260	$0.780	1M
Qwen-PlusAlibaba	Alibaba	$0.260	$0.780	1M
DeepSeek V3.2 ExpDeepSeek	DeepSeek	$0.270	$0.410	164K
DeepSeek V3.1 TerminusDeepSeek	DeepSeek	$0.270	$0.950	164K
MiniMax M2.7MiniMax	MiniMax	$0.279	$1.20	205K
DeepSeek V3.2 SpecialeDeepSeek	DeepSeek	$0.287	$0.431	164K
MiniMax M2.1MiniMax	MiniMax	$0.290	$0.950	205K
R1 Distill Qwen 32BDeepSeek	DeepSeek	$0.290	$0.290	128K
Qwen3.5 Plus 2026-04-20Alibaba	Alibaba	$0.300	$1.80	1M
Qwen3.6 27BAlibaba	Alibaba	$0.300	$2.00	262K
MiniMax M2-herMiniMax	MiniMax	$0.300	$1.20	66K
GLM 4.6VZhipu AI	Zhipu AI	$0.300	$0.900	131K
Qwen3.6 PlusAlibaba	Alibaba	$0.325	$1.95	1M
Qwen2.5 72B InstructAlibaba	Alibaba	$0.360	$0.400	131K
Qwen3.5 397B A17BAlibaba	Alibaba	$0.390	$2.34	262K
GLM 4.7Zhipu AI	Zhipu AI	$0.400	$1.75	203K
MiniMax M1MiniMax	MiniMax	$0.400	$2.20	1M
GLM 4.6Zhipu AI	Zhipu AI	$0.430	$1.74	203K
DeepSeek V4 ProDeepSeek	DeepSeek	$0.435	$0.870	1.0M
Qwen3 235B A22BAlibaba	Alibaba	$0.455	$1.82	131K
R1 0528DeepSeek	DeepSeek	$0.500	$2.15	164K
GLM 5Zhipu AI	Zhipu AI	$0.600	$1.92	203K
GLM 4.5VZhipu AI	Zhipu AI	$0.600	$1.80	66K
GLM 4.5Zhipu AI	Zhipu AI	$0.600	$2.20	131K
Qwen3 Coder PlusAlibaba	Alibaba	$0.650	$3.25	1M
Qwen2.5 Coder 32B InstructAlibaba	Alibaba	$0.660	$1.00	128K
R1 Distill Llama 70BDeepSeek	DeepSeek	$0.700	$0.800	131K
R1DeepSeek	DeepSeek	$0.700	$2.50	164K
Qwen3 Max ThinkingAlibaba	Alibaba	$0.780	$3.90	262K
Qwen3 MaxAlibaba	Alibaba	$0.780	$3.90	262K
GLM 5.1Zhipu AI	Zhipu AI	$0.980	$3.08	203K
Qwen3.6 Max PreviewAlibaba	Alibaba	$1.04	$6.24	262K
GLM 5V TurboZhipu AI	Zhipu AI	$1.20	$4.00	203K
GLM 5 TurboZhipu AI	Zhipu AI	$1.20	$4.00	203K
Qwen3.7 MaxAlibaba	Alibaba	$2.50	$7.50	1M

Section 05

When to pick a Chinese-origin model vs a multilingual frontier model

Pick a Chinese-origin model

Your workload is Chinese-heavy: customer support, content generation, or document processing in Mandarin
You need open weights for local deployment, fine-tuning, or regulatory compliance
Cost efficiency is critical - Chinese models offer 3-10x lower pricing at comparable quality
You need specialized Chinese reasoning: medical, legal, or educational content in Chinese
You want 1M+ context windows (Qwen3.6 Plus, Qwen3 Coder Flash, MiniMax-01)

Pick a multilingual frontier model

Your workload is mixed-language with Chinese as one of several languages
You need tight integration with Western cloud platforms (AWS, Azure, GCP)
Your compliance requirements mandate US-based model providers
You need established enterprise support contracts and SLAs

Chinese AI Models FAQ

It depends on the use case. For raw benchmark performance, DeepSeek V3.2 holds the highest Arena Elo (1424) among all tested models and is fully open-weight. For 1M-context workloads, Qwen3.6 Plus leads with MMLU-Pro 88.5%. For reasoning tasks, DeepSeek R1 scored 96% on the Chinese National Medical Licensing Exam. For software engineering, GLM 5.1 tops SWE-Bench Pro at 58.4%. For agentic self-improvement, MiniMax M2.7 automates 30-50% of ML research workflows. All five labs have competitive flagships.

Most are. DeepSeek releases all models (V3, V3.1, V3.2, R1) as open-weight under permissive licenses. Alibaba's Qwen3 series is Apache 2.0 licensed, including the 397B flagship. Zhipu AI open-sources GLM-4.5, GLM-4.6, GLM-4.7, and GLM-5 on Hugging Face. MiniMax offers M2.5 (free variant available) and MiniMax-01 as open-weight. The main closed-source exceptions are Qwen3.6 Plus, GLM-5.1, MiniMax M2.7, and ByteDance Seed.

Chinese-origin models consistently outperform multilingual frontier models on Chinese-specific benchmarks like C-Eval, CMMLU, and SuperCLUE because they were trained with significantly more Chinese-language data. DeepSeek R1 scored 96% on the Chinese medical licensing exam versus 75% for GPT-o1 Pro. However, on English reasoning benchmarks like MMLU and GPQA, the gap has narrowed - DeepSeek V3.2 and Qwen3.6 Plus now match or exceed GPT-5 on several tasks.

Mixture-of-Experts (MoE) activates only a fraction of total parameters per token, which dramatically reduces inference cost. DeepSeek V3.2 has 671B total parameters but only activates 37B per token. Qwen3.5-397B activates 17B. GLM-5.1 has 744B total with 40B active. This architectural choice lets Chinese labs build models that match or exceed the performance of dense US models while running at a fraction of the compute cost. DeepSeek pioneered Multi-head Latent Attention (MLA) which further compresses the KV cache.

Qwen3.6 Plus and Qwen3 Coder Flash both support 1M tokens (roughly 750,000 words). MiniMax-01 supports 1M+ tokens. Most DeepSeek models support 163K tokens. Zhipu AI GLM models range from 131K to 202K tokens. ByteDance Seed models support 262K tokens. For comparison, Claude supports up to 200K and GPT-5 supports 128K-1M depending on variant.

Yes. Most Chinese labs publish open-weight models on Hugging Face and GitHub that you can run with standard tools (vLLM, llama.cpp, Ollama, transformers). DeepSeek R1 Distill Qwen 32B and Qwen3-32B are popular choices for local deployment. Smaller variants like Qwen3-8B, GLM-4.5 Air (12B active), and DeepSeek R1 Distill 7B run on consumer GPUs. The MoE architecture means even large models like Qwen3.5-397B only need enough VRAM for the active 17B parameters during inference.

Primary sources. DeepSeek: GitHub. Qwen: Hugging Face. Zhipu AI: THUDM on HF. MiniMax: minimax.io. C-Eval: cevalbenchmark.com. CMMLU: GitHub.