Best AI Models for Warp

CLI Agent

Warp is an AI-powered terminal with built-in command suggestions and natural language to shell translation. Fast, cheap models with strong instruction following work best.

Last updated: 57m ago

MiMo-V2.5-Pro

Xiaomi

Tool Score

Output $/M

$0.870

Arena Elo

1466

Gemma 4 31B

Google

Tool Score

Output $/M

$0.350

Arena Elo

1451

GLM 5.1

Zhipu AI

Tool Score

Output $/M

$3.08

Arena Elo

1475

What Matters for Warp

StreamingFunction CallingCost EfficientStrong Coding

Best Models for Warp

Top 15 by tool-optimized score

LMMarketCap.com

All Models Ranked for Warp (316 models)

Scored by: benchmark performance (90%) from MMLU, GPQA, HumanEval, SWE-bench, and 15+ standardized evaluations, with capabilities and context as tiebreakers (10%).

#	Model	Provider	Score	Coding	Caps	Output $/M	Context
1	MiMo-V2.5-Pro Arena Elo: 1466	Xiaomi	89	94	100%	$0.870	1.0M
2	Gemma 4 31B Arena Elo: 1451	Google	88	92	100%	$0.350	262K
3	GLM 5.1 Arena Elo: 1475	Zhipu AI	87	96	100%	$3.08	203K
4	Gemma 4 26B A4B Arena Elo: 1438	Google	87	90	100%	$0.330	262K
5	DeepSeek V4 Flash Arena Elo: 1433	DeepSeek	86	89	100%	$0.180	1.0M
6	MiMo-V2.5 Arena Elo: 1433	Xiaomi	86	89	100%	$0.280	1.0M
7	Kimi K2.6 Arena Elo: 1460	Moonshot AI	86	93	100%	$3.41	262K
8	DeepSeek V3.2 Exp Arena Elo: 1423	DeepSeek	86	87	100%	$0.410	164K
9	Llama 3.3 70B Instruct HumanEval: 88.4%	Meta	86	88	100%	$0.320	131K
10	GPT-4o-mini HumanEval: 87.2%	OpenAI	86	87	100%	$0.600	128K
11	MiniMax M3 Arena Elo: 1448	MiniMax	85	91	100%	$1.20	1.0M
12	Grok 4.3 Arena Elo: 1446	xAI	85	91	100%	$2.50	1.0M
13	Hy3 preview Arena Elo: 1413	Tencent	85	86	100%	$0.210	262K
14	Gemma 4 31B (free)	Google	85	80	100%	Free	262K
15	MiniMax M2.7 Arena Elo: 1417	MiniMax	85	86	100%	$0.960	205K
16	GLM 5 Arena Elo: 1457	Zhipu AI	85	93	100%	$1.92	203K
17	Qwen3 VL 235B A22B Instruct Arena Elo: 1415	Alibaba	85	86	100%	$0.880	262K
18	DeepSeek V3.1 Terminus Arena Elo: 1416	DeepSeek	85	86	100%	$0.950	164K
19	DeepSeek V3.1 Arena Elo: 1417	DeepSeek	85	86	100%	$0.790	164K
20	Gemini 3.5 Flash Arena Elo: 1477	Google	84	96	100%	$9.00	1.0M
21	Qwen3.6 Plus Arena Elo: 1444	Alibaba	84	91	100%	$1.95	1.0M
22	Qwen3.5 397B A17B Arena Elo: 1444	Alibaba	84	91	100%	$2.45	256K
23	GLM 4.7 Arena Elo: 1443	Zhipu AI	84	91	100%	$1.75	203K
24	GPT-5.2 Chat Arena Elo: 1475	OpenAI	84	96	100%	$14.00	128K
25	DeepSeek V3 0324 HumanEval: 84.5%	DeepSeek	84	85	100%	$0.770	164K
26	Qwen3.6 Max Preview Arena Elo: 1461	Alibaba	83	94	100%	$6.24	262K
27	Grok 4.20	xAI	83	88	100%	$2.50	2.0M
28	Gemini 3.1 Flash Lite Preview Arena Elo: 1432	Google	83	89	100%	$1.50	1.0M
29	Qwen3.5-Flash Arena Elo: 1397	Alibaba	83	83	100%	$0.260	1.0M
30	Step 3.5 Flash Arena Elo: 1395	StepFun	83	83	100%	$0.300	262K
31	GPT-5.1-Codex-Mini	OpenAI	83	87	100%	$2.00	400K
32	GLM 4.6 Arena Elo: 1425	Zhipu AI	83	88	100%	$1.74	203K
33	DeepSeek V4 Pro SWE-bench: 80.6%	DeepSeek	82	81	100%	$0.870	1.0M
34	Qwen3.5-122B-A10B Arena Elo: 1417	Alibaba	82	86	100%	$2.08	262K
35	Gemini 3.1 Pro Preview Custom Tools	Google	82	92	100%	$12.00	1.0M
36	GLM 4.6V Arena Elo: 1377	Zhipu AI	82	80	100%	$0.900	131K
37	GLM 4.5 Arena Elo: 1411	Zhipu AI	82	85	100%	$2.20	131K
38	Llama 3.1 70B Instruct HumanEval: 80.5%	Meta	82	81	100%	$0.400	131K
39	Mistral Large HumanEval: 92%	Mistral AI	82	92	100%	$6.00	128K
40	Claude Fable 5 SWE-bench: 95%	Anthropic	81	95	100%	$50.00	1.0M
41	Gemma 4 26B A4B (free)	Google	81	73	100%	Free	262K
42	Trinity Large Thinking Arena Elo: 1369	arcee-ai	81	78	100%	$0.800	262K
43	Qwen3.5-27B Arena Elo: 1409	Alibaba	81	85	100%	$1.56	262K
44	GLM 4.7 Flash Arena Elo: 1368	Zhipu AI	81	78	100%	$0.400	203K
45	GPT-5.2-Codex	OpenAI	81	90	100%	$14.00	400K
46	Gemini 2.5 Flash Lite Preview 09-2025	Google	81	79	100%	$0.400	1.0M
47	Qwen3 Next 80B A3B Thinking Arena Elo: 1370	Alibaba	81	78	100%	$0.780	262K
48	Qwen3 Next 80B A3B Instruct Arena Elo: 1402	Alibaba	81	84	100%	$1.10	262K
49	GLM 4.5 Air Arena Elo: 1373	Zhipu AI	81	79	100%	$0.850	131K
50	Gemini 2.5 Flash Lite	Google	81	79	100%	$0.400	1.0M

More Tool Rankings

Cursor Claude Code Windsurf GitHub Copilot Aider Cline Roo Code Open WebUI Continue Zed Lovable

Best for Coding Best for Reasoning Compare Models

Frequently Asked Questions

Based on our analysis of coding benchmarks, capability matching, and pricing, MiMo-V2.5-Pro currently ranks #1 for Warp. Rankings are rebuilt as benchmark, pricing, and provider data refresh.

We score models using benchmark performance (90%) from LMArena, HumanEval, SWE-bench, MMLU, and 15+ standardized evaluations. Capabilities and context serve as tiebreakers (10%). Only models with the capabilities Warp needs are included in the tool-specific rankings.

We currently track 316 AI models compatible with Warp. This includes models from OpenAI, Anthropic, Google, DeepSeek, and other providers accessible via API.

Many open-source models are compatible with Warp through API providers like OpenRouter, Together AI, and Groq. Check our rankings to see which open-source models perform best.

Rankings refresh whenever the underlying benchmark, pricing, and catalog sources refresh. That means some signals update faster than others, and the page reflects the latest verified source data available.