Best AI models for agent workflows
AI models suited for tool-using autonomous agents — strong function calling, long-context planning, and reliability.
Agentic models reliably invoke tools, chain steps, and recover from errors. Ranked for tool-calling support and reasoning.
- 1
DeepSeek V3
84.0FrontierDeepSeek- Context:
- 128K
- Input:
- $0.27/1M
- Output:
- $1.1/1M
MathAgenticFrontierOpen sourceCode - 2
o3
83.5FrontierOpenAI- Context:
- 200K
- Input:
- $2/1M
- Output:
- $8/1M
VisionMathAgenticLong contextFrontierReasoning - 3
Gemini 2 Pro
82.8FrontierGoogle- Context:
- 2M
- Input:
- $1.25/1M
- Output:
- $5/1M
VisionMathAgenticLong contextFrontierCode - 4
Grok 3
80.1FrontierxAI- Context:
- 131K
- Input:
- $3/1M
- Output:
- $15/1M
VisionMathAgenticFrontierReasoning - 5
GPT-5
77.9StrongOpenAI- Context:
- 272K
- Input:
- $1.25/1M
- Output:
- $10/1M
VisionMathAgenticLong contextReasoningCode - 6
GPT-4o mini
77.1StrongOpenAI- Context:
- 128K
- Input:
- $0.15/1M
- Output:
- $0.6/1M
VisionMathAgenticBudget - 7
o1
76.3StrongOpenAI- Context:
- 200K
- Input:
- $15/1M
- Output:
- $60/1M
VisionMathAgenticLong contextReasoning - 8
Claude Opus 4
75.7StrongAnthropic- Context:
- 200K
- Input:
- $5/1M
- Output:
- $25/1M
VisionAgenticLong contextReasoningCode - 9
o3-mini
74.5StrongOpenAI- Context:
- 200K
- Input:
- $1.1/1M
- Output:
- $4.4/1M
MathAgenticLong contextCode - 10
Claude Sonnet 4
66.2CompetentAnthropic- Context:
- 200K
- Input:
- $3/1M
- Output:
- $15/1M
VisionAgenticLong context - 11
GPT-5.4
59.3CompetentOpenAI- Context:
- 1.1M
VisionAgenticLong context - 12
Claude Opus 4.7
57.2CompetentAnthropic- Context:
- 1M
VisionAgenticLong context - 13
Claude Sonnet 4.6
44.8BasicAnthropic- Context:
- 1M
VisionAgenticLong contextCode - 14
GPT-5.4 nano
43.3BasicOpenAI- Context:
- 272K
VisionAgenticLong context - 15
GPT-5.4 mini
34.6LimitedOpenAI- Context:
- 272K
VisionAgenticLong context - 16
Claude Haiku 4.5
31.1LimitedAnthropic- Context:
- 200K
- Input:
- $1/1M
- Output:
- $5/1M
VisionAgenticLong context