Best AI Models for Agents

Building AI agents? These 258 models support function calling and are ranked by their agentic capability coverage - tool use, reasoning, JSON output, streaming, vision, and web search.

258

Agent-Ready

177

+ Reasoning

223

+ JSON Mode

+ Web Search

Free

Full-Stack Agent Models

5+ agentic capabilities

#	Model	Provider	Score	Caps	$/1M Out
1	Claude Fable 5Anthropic	Anthropic	97	6	$50.00
2	Claude Opus 4.7 (Fast)Anthropic	Anthropic	95	6	$150.00
3	Claude Opus 4.7Anthropic	Anthropic	95	6	$25.00
4	Claude Opus 4.8 (Fast)Anthropic	Anthropic	94	6	$50.00
5	Claude Opus 4.8Anthropic	Anthropic	94	6	$25.00
6	GPT-5.5OpenAI	OpenAI	92	6	$30.00
7	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	92	6	$12.00
8	Gemini 3.1 Pro PreviewGoogle	Google	92	6	$12.00
9	GPT-5.4 ProOpenAI	OpenAI	92	6	$180.00
10	GPT-5.4OpenAI	OpenAI	92	6	$15.00
11	GPT-5.5 ProOpenAI	OpenAI	90	6	$180.00
12	GPT-5.2-CodexOpenAI	OpenAI	90	6	$14.00
13	GPT-5.2 ProOpenAI	OpenAI	90	6	$168.00
14	GPT-5.2OpenAI	OpenAI	90	6	$14.00
15	Claude Opus 4.6 (Fast)Anthropic	Anthropic	90	6	$150.00
16	Claude Opus 4.6Anthropic	Anthropic	90	6	$25.00
17	Grok 4.20xAI	xAI	88	6	$2.50
18	GPT-5.3-CodexOpenAI	OpenAI	88	6	$14.00
19	GPT-5 ProOpenAI	OpenAI	88	6	$120.00
20	GPT-5 CodexOpenAI	OpenAI	88	6	$10.00
21	GPT-5OpenAI	OpenAI	88	6	$10.00
22	Gemini 3 Flash PreviewGoogle	Google	88	6	$3.00
23	GPT-5.1-Codex-MaxOpenAI	OpenAI	87	6	$10.00
24	GPT-5.1OpenAI	OpenAI	87	6	$10.00
25	GPT-5.1-CodexOpenAI	OpenAI	87	6	$10.00
26	GPT-5.1-Codex-MiniOpenAI	OpenAI	87	6	$2.00
27	o3 Deep ResearchOpenAI	OpenAI	86	6	$40.00
28	o3 ProOpenAI	OpenAI	86	6	$80.00
29	o3OpenAI	OpenAI	86	6	$8.00
30	Claude Sonnet 4.6Anthropic	Anthropic	85	6	$15.00
31	Claude Opus 4.5Anthropic	Anthropic	85	6	$25.00
32	Gemini 2.5 ProGoogle	Google	83	6	$10.00
33	Gemini 2.5 Pro Preview 06-05Google	Google	83	6	$10.00
34	Gemini 2.5 Pro Preview 05-06Google	Google	83	6	$10.00
35	Claude Sonnet 4.5Anthropic	Anthropic	82	6	$15.00
36	o4 Mini Deep ResearchOpenAI	OpenAI	81	6	$8.00
37	o4 MiniOpenAI	OpenAI	81	6	$4.40
38	Grok 4.3xAI	xAI	81	6	$2.50
39	Gemini 3.1 Flash Lite PreviewGoogle	Google	79	6	$1.50
40	GPT-5.4 NanoOpenAI	OpenAI	79	6	$1.25
41	GPT-5.4 MiniOpenAI	OpenAI	79	6	$4.50
42	Gemini 2.5 Flash Lite Preview 09-2025Google	Google	79	6	$0.40
43	Gemini 2.5 Flash LiteGoogle	Google	79	6	$0.40
44	Gemini 2.5 FlashGoogle	Google	79	6	$2.50
45	Gemini 3.5 FlashGoogle	Google	79	6	$9.00
46	Claude Opus 4.1Anthropic	Anthropic	75	6	$75.00
47	o1OpenAI	OpenAI	74	6	$60.00
48	o4 Mini HighOpenAI	OpenAI	72	6	$4.40
49	Claude Haiku 4.5Anthropic	Anthropic	69	6	$5.00
50	GPT-5 MiniOpenAI	OpenAI	64	6	$2.00
51	GPT-5 NanoOpenAI	OpenAI	46	6	$0.40
52	Fugu Ultrasakana	sakana	40	6	$30.00
53	Claude Fable Latest~anthropic	~anthropic	40	6	$50.00
54	Grok Build 0.1xAI	xAI	40	6	$2.00
55	Anthropic Claude Haiku Latest~anthropic	~anthropic	40	6	$5.00
56	OpenAI GPT Mini Latest~openai	~openai	40	6	$4.50
57	Google Gemini Pro Latest~google	~google	40	6	$12.00
58	Google Gemini Flash Latest~google	~google	40	6	$9.00
59	Anthropic Claude Sonnet Latest~anthropic	~anthropic	40	6	$15.00
60	OpenAI GPT Latest~openai	~openai	40	6	$30.00
61	Claude Opus Latest~anthropic	~anthropic	40	6	$25.00
62	Nano Banana Pro (Gemini 3 Pro Image)Google	Google	-	6	$12.00
63	Gemini 3.1 Flash LiteGoogle	Google	-	6	$1.50
64	GPT-5.3 ChatOpenAI	OpenAI	87	5	$14.00
65	GPT-5.1 ChatOpenAI	OpenAI	86	5	$10.00
66	Claude Opus 4Anthropic	Anthropic	82	5	$75.00
67	Gemma 4 31B (free)Google	Google	80	5	Free
68	Gemma 4 31BGoogle	Google	80	5	$0.35
69	Qwen3.5 397B A17BAlibaba	Alibaba	79	5	$2.45
70	Qwen3.5-122B-A10BAlibaba	Alibaba	77	5	$2.08
71	GPT-5.2 ChatOpenAI	OpenAI	77	5	$14.00
72	Qwen3.5-27BAlibaba	Alibaba	77	5	$1.56
73	Qwen3.7 PlusAlibaba	Alibaba	76	5	$1.28
74	Qwen3.5-35B-A3BAlibaba	Alibaba	76	5	$1.00
75	Kimi K2.6Moonshot AI	Moonshot AI	75	5	$3.41
76	o3 MiniOpenAI	OpenAI	75	5	$4.40
77	MiniMax M3MiniMax	MiniMax	74	5	$1.20
78	Claude Sonnet 4Anthropic	Anthropic	74	5	$15.00
79	Qwen3.6 PlusAlibaba	Alibaba	74	5	$1.95
80	Gemma 4 26B A4B (free)Google	Google	73	5	Free
81	Gemma 4 26B A4B Google	Google	73	5	$0.33
82	MiMo-V2.5Xiaomi	Xiaomi	73	5	$0.28
83	Mistral Medium 3.5Mistral AI	Mistral AI	71	5	$7.50
84	Qwen3.5-FlashAlibaba	Alibaba	68	5	$0.26
85	Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	67	5	$2.60
86	GPT-4.1OpenAI	OpenAI	67	5	$8.00
87	Qwen3.5-9BAlibaba	Alibaba	66	5	$0.15
88	GLM 4.6VZhipu AI	Zhipu AI	65	5	$0.90
89	o3 Mini HighOpenAI	OpenAI	64	5	$4.40
90	GLM 4.5VZhipu AI	Zhipu AI	62	5	$1.80
91	Kimi K2.5Moonshot AI	Moonshot AI	59	5	$2.02
92	Kimi K2.7 CodeMoonshot AI	Moonshot AI	54	5	$3.50
93	GPT-4.1 MiniOpenAI	OpenAI	53	5	$1.60
94	GPT-4.1 NanoOpenAI	OpenAI	42	5	$0.40
95	Step 3.7 FlashStepFun	StepFun	40	5	$1.15
96	GPT Chat LatestOpenAI	OpenAI	40	5	$30.00
97	MoonshotAI Kimi Latest~moonshotai	~moonshotai	40	5	$3.41
98	Qwen3.5 Plus 2026-04-20Alibaba	Alibaba	40	5	$1.80
99	Qwen3.6 FlashAlibaba	Alibaba	40	5	$1.13
100	Qwen3.6 35B A3BAlibaba	Alibaba	40	5	$1.00
101	Qwen3.6 27BAlibaba	Alibaba	40	5	$2.65
102	GLM 5V TurboZhipu AI	Zhipu AI	40	5	$4.00
103	Mistral Small 4Mistral AI	Mistral AI	40	5	$0.60
104	Seed-2.0-LiteByteDance	ByteDance	40	5	$2.00
105	Seed-2.0-MiniByteDance	ByteDance	40	5	$0.40
106	Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	40	5	$1.56
107	Seed 1.6 FlashByteDance	ByteDance	40	5	$0.30
108	Seed 1.6ByteDance	ByteDance	40	5	$2.00
109	Qwen3 VL 8B ThinkingAlibaba	Alibaba	40	5	$1.36
110	Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	40	5	$1.56

Strong Agent Models

4 agentic capabilities

#	Model	Provider	Score	Context	$/1M Out
1	DeepSeek V4 ProDeepSeek	DeepSeek	86	1.0M	$0.87
2	DeepSeek V3.2DeepSeek	DeepSeek	81	131K	$0.34
3	R1 0528DeepSeek	DeepSeek	79	164K	$2.15
4	GLM 5.2Zhipu AI	Zhipu AI	78	1.0M	$3.00
5	MiniMax M2.5MiniMax	MiniMax	78	205K	$0.48
6	GLM 5Zhipu AI	Zhipu AI	78	203K	$1.92
7	DeepSeek V4 FlashDeepSeek	DeepSeek	77	1.0M	$0.18
8	GLM 5.1Zhipu AI	Zhipu AI	76	203K	$3.08
9	MiMo-V2.5-ProXiaomi	Xiaomi	76	1.0M	$0.87
10	GLM 4.5Zhipu AI	Zhipu AI	75	131K	$2.20
11	Qwen3.6 Max PreviewAlibaba	Alibaba	74	262K	$6.24
12	R1DeepSeek	DeepSeek	74	164K	$2.50
13	GLM 4.7Zhipu AI	Zhipu AI	72	203K	$1.75
14	MiniMax M2MiniMax	MiniMax	72	205K	$1.00
15	GPT-4o (2024-08-06)OpenAI	OpenAI	71	128K	$10.00
16	GPT-4o (2024-05-13)OpenAI	OpenAI	71	128K	$15.00
17	GPT-4oOpenAI	OpenAI	71	128K	$10.00
18	GLM 5 TurboZhipu AI	Zhipu AI	71	262K	$4.00
19	GLM 4.6Zhipu AI	Zhipu AI	70	203K	$1.74
20	DeepSeek V3.2 ExpDeepSeek	DeepSeek	70	164K	$0.41

What Makes a Model Good for Agents?

Function Calling (Required)

The foundation of agentic AI. Models must be able to decide when and how to call external tools - APIs, databases, file systems, web browsers. Without this, agents can only produce text, not take action.

JSON Mode (Highly Recommended)

Agents need to produce structured, parseable output reliably. JSON mode ensures the model always returns valid JSON, preventing the parsing errors that break automated workflows.

Reasoning (For Complex Tasks)

Chain-of-thought reasoning helps agents plan multi-step sequences, recover from errors, and make better decisions about which tools to use. Critical for complex autonomous workflows.

Streaming (For Real-Time UX)

Streaming lets you observe agent actions in real-time - see which tools it's calling, watch it reason through problems, and intervene early if it goes off track. Essential for interactive agent interfaces.

Vision (For Multimodal Agents)

Vision-capable agents can process screenshots, analyze charts, read documents, and interact with visual interfaces. Required for browser automation, document processing, and GUI agents.

Web Search (For Research Agents)

Built-in web search lets agents find current information without custom search tool integrations. Ideal for research agents, fact-checkers, and competitive analysis workflows.

Function Calling Models JSON Output Models Reasoning Models Enterprise Models LLM Leaderboard

Frequently Asked Questions

Agent-capable models need strong function calling (tool use), reasoning ability, long context windows, and reliable instruction following. They must decide when to use tools, handle multi-step plans, and recover from errors.

GPT-4o, Claude 3.5 Sonnet, and Gemini 2.0 are the top choices for AI agents. They offer reliable function calling, strong reasoning, and the ability to maintain context across long multi-step workflows.

Chatbots respond to individual messages. AI agents autonomously execute multi-step tasks - browsing the web, writing code, calling APIs, and making decisions. Agents use tool calling and planning capabilities that go beyond simple conversation.

Model

Score

Caps

Claude Fable 5Anthropic

Claude Opus 4.7 (Fast)Anthropic

Claude Opus 4.7Anthropic

Claude Opus 4.8 (Fast)Anthropic

Claude Opus 4.8Anthropic

GPT-5.5OpenAI

Gemini 3.1 Pro Preview Custom ToolsGoogle

Gemini 3.1 Pro PreviewGoogle

GPT-5.4 ProOpenAI

GPT-5.4OpenAI

GPT-5.5 ProOpenAI

GPT-5.2-CodexOpenAI

GPT-5.2 ProOpenAI

GPT-5.2OpenAI

Claude Opus 4.6 (Fast)Anthropic

Claude Opus 4.6Anthropic

Grok 4.20xAI

GPT-5.3-CodexOpenAI

GPT-5 ProOpenAI

GPT-5 CodexOpenAI

GPT-5OpenAI

Gemini 3 Flash PreviewGoogle

GPT-5.1-Codex-MaxOpenAI

GPT-5.1OpenAI

GPT-5.1-CodexOpenAI

GPT-5.1-Codex-MiniOpenAI

o3 Deep ResearchOpenAI

o3 ProOpenAI

o3OpenAI

Claude Sonnet 4.6Anthropic

Claude Opus 4.5Anthropic

Gemini 2.5 ProGoogle

Gemini 2.5 Pro Preview 06-05Google

Gemini 2.5 Pro Preview 05-06Google

Claude Sonnet 4.5Anthropic

o4 Mini Deep ResearchOpenAI

o4 MiniOpenAI

Grok 4.3xAI

Gemini 3.1 Flash Lite PreviewGoogle

GPT-5.4 NanoOpenAI

GPT-5.4 MiniOpenAI

Gemini 2.5 Flash Lite Preview 09-2025Google

Gemini 2.5 Flash LiteGoogle

Gemini 2.5 FlashGoogle

Gemini 3.5 FlashGoogle

Claude Opus 4.1Anthropic

o1OpenAI

o4 Mini HighOpenAI

Claude Haiku 4.5Anthropic

GPT-5 MiniOpenAI

GPT-5 NanoOpenAI

Fugu Ultrasakana

Claude Fable Latest~anthropic

Grok Build 0.1xAI

Anthropic Claude Haiku Latest~anthropic

OpenAI GPT Mini Latest~openai

Google Gemini Pro Latest~google

Google Gemini Flash Latest~google

Anthropic Claude Sonnet Latest~anthropic

OpenAI GPT Latest~openai

Claude Opus Latest~anthropic

Nano Banana Pro (Gemini 3 Pro Image)Google

Gemini 3.1 Flash LiteGoogle

GPT-5.3 ChatOpenAI

GPT-5.1 ChatOpenAI

Claude Opus 4Anthropic

Gemma 4 31B (free)Google

Gemma 4 31BGoogle

Qwen3.5 397B A17BAlibaba

Qwen3.5-122B-A10BAlibaba

GPT-5.2 ChatOpenAI

Qwen3.5-27BAlibaba

Qwen3.7 PlusAlibaba

Qwen3.5-35B-A3BAlibaba

Kimi K2.6Moonshot AI

o3 MiniOpenAI

MiniMax M3MiniMax

Claude Sonnet 4Anthropic

Qwen3.6 PlusAlibaba

Gemma 4 26B A4B (free)Google

Gemma 4 26B A4B Google

MiMo-V2.5Xiaomi

Mistral Medium 3.5Mistral AI

Qwen3.5-FlashAlibaba

Qwen3 VL 235B A22B ThinkingAlibaba

GPT-4.1OpenAI

Qwen3.5-9BAlibaba

GLM 4.6VZhipu AI

o3 Mini HighOpenAI

GLM 4.5VZhipu AI

Kimi K2.5Moonshot AI

Kimi K2.7 CodeMoonshot AI

GPT-4.1 MiniOpenAI

GPT-4.1 NanoOpenAI

Step 3.7 FlashStepFun

GPT Chat LatestOpenAI

MoonshotAI Kimi Latest~moonshotai

Qwen3.5 Plus 2026-04-20Alibaba

Qwen3.6 FlashAlibaba

100

Qwen3.6 35B A3BAlibaba

101

Qwen3.6 27BAlibaba

102

GLM 5V TurboZhipu AI

103

Mistral Small 4Mistral AI

104

Seed-2.0-LiteByteDance

105

Seed-2.0-MiniByteDance

106

Qwen3.5 Plus 2026-02-15Alibaba

107

Seed 1.6 FlashByteDance

108

Seed 1.6ByteDance

109

Qwen3 VL 8B ThinkingAlibaba

110

Qwen3 VL 30B A3B ThinkingAlibaba

Model

Score

DeepSeek V4 ProDeepSeek

DeepSeek V3.2DeepSeek

R1 0528DeepSeek

GLM 5.2Zhipu AI

MiniMax M2.5MiniMax

GLM 5Zhipu AI

DeepSeek V4 FlashDeepSeek

GLM 5.1Zhipu AI

MiMo-V2.5-ProXiaomi

GLM 4.5Zhipu AI

Qwen3.6 Max PreviewAlibaba

R1DeepSeek

GLM 4.7Zhipu AI

MiniMax M2MiniMax

GPT-4o (2024-08-06)OpenAI

GPT-4o (2024-05-13)OpenAI

GPT-4oOpenAI

GLM 5 TurboZhipu AI

GLM 4.6Zhipu AI

DeepSeek V3.2 ExpDeepSeek

What Makes a Model Good for Agents?

Function Calling (Required)

JSON Mode (Highly Recommended)

Agents need to produce structured, parseable output reliably. JSON mode ensures the model always returns valid JSON, preventing the parsing errors that break automated workflows.

Reasoning (For Complex Tasks)

Chain-of-thought reasoning helps agents plan multi-step sequences, recover from errors, and make better decisions about which tools to use. Critical for complex autonomous workflows.

Streaming (For Real-Time UX)

Vision (For Multimodal Agents)

Vision-capable agents can process screenshots, analyze charts, read documents, and interact with visual interfaces. Required for browser automation, document processing, and GUI agents.

Web Search (For Research Agents)

Built-in web search lets agents find current information without custom search tool integrations. Ideal for research agents, fact-checkers, and competitive analysis workflows.

Best AI Models for Agents

Full-Stack Agent Models

Strong Agent Models

What Makes a Model Good for Agents?

Function Calling (Required)

JSON Mode (Highly Recommended)

Reasoning (For Complex Tasks)

Streaming (For Real-Time UX)

Vision (For Multimodal Agents)

Web Search (For Research Agents)

Related Pages

Best AI Models for Agents

Full-Stack Agent Models

Strong Agent Models

What Makes a Model Good for Agents?

Function Calling (Required)

JSON Mode (Highly Recommended)

Reasoning (For Complex Tasks)

Streaming (For Real-Time UX)

Vision (For Multimodal Agents)

Web Search (For Research Agents)

Related Pages