Best AI models for agent workflows

AI models suited for tool-using autonomous agents — strong function calling, long-context planning, and reliability.

Agentic models reliably invoke tools, chain steps, and recover from errors. Ranked for tool-calling support and reasoning.

1
DeepSeek V3
84.0Frontier
DeepSeek
Context:
128K
Input:
$0.27/1M
Output:
$1.1/1M
MathAgenticFrontierOpen sourceCode
2
o3
83.5Frontier
OpenAI
Context:
200K
Input:
$2/1M
Output:
$8/1M
VisionMathAgenticLong contextFrontierReasoning
3
Gemini 2 Pro
82.8Frontier
Google
Context:
2M
Input:
$1.25/1M
Output:
$5/1M
VisionMathAgenticLong contextFrontierCode
4
Grok 3
80.1Frontier
xAI
Context:
131K
Input:
$3/1M
Output:
$15/1M
VisionMathAgenticFrontierReasoning
5
GPT-5
77.9Strong
OpenAI
Context:
272K
Input:
$1.25/1M
Output:
$10/1M
VisionMathAgenticLong contextReasoningCode
6
GPT-4o mini
77.1Strong
OpenAI
Context:
128K
Input:
$0.15/1M
Output:
$0.6/1M
VisionMathAgenticBudget
7
o1
76.3Strong
OpenAI
Context:
200K
Input:
$15/1M
Output:
$60/1M
VisionMathAgenticLong contextReasoning
8
Claude Opus 4
75.7Strong
Anthropic
Context:
200K
Input:
$5/1M
Output:
$25/1M
VisionAgenticLong contextReasoningCode
9
o3-mini
74.5Strong
OpenAI
Context:
200K
Input:
$1.1/1M
Output:
$4.4/1M
MathAgenticLong contextCode
10
Claude Sonnet 4
66.2Competent
Anthropic
Context:
200K
Input:
$3/1M
Output:
$15/1M
VisionAgenticLong context
11
GPT-5.4
59.3Competent
OpenAI
Context:
1.1M
VisionAgenticLong context
12
Claude Opus 4.7
57.2Competent
Anthropic
Context:
1M
VisionAgenticLong context
13
Claude Sonnet 4.6
44.8Basic
Anthropic
Context:
1M
VisionAgenticLong contextCode
14
GPT-5.4 nano
43.3Basic
OpenAI
Context:
272K
VisionAgenticLong context
15
GPT-5.4 mini
34.6Limited
OpenAI
Context:
272K
VisionAgenticLong context
16
Claude Haiku 4.5
31.1Limited
Anthropic
Context:
200K
Input:
$1/1M
Output:
$5/1M
VisionAgenticLong context