Claude Opus 4

Active

Anthropic's most capable model for complex reasoning and long-context work.

Overview

Claude Opus 4 is Anthropic's flagship model, optimized for deep reasoning, coding, and extended agentic workflows. Supports 200k context, tool use, and structured outputs.

Benchmarks

BenchmarkScoreSource
AIME 2024Math48% accuracyThird-party
Artificial Analysis
GPQA DiamondReasoning74% accuracySelf-reported
Anthropic model card
GSM8KMath95.4% accuracySelf-reported
Anthropic model card
HumanEvalCoding93pass@1 %Self-reported
Anthropic model card
MATHMath78.2% accuracySelf-reported
Anthropic model card
MMLUGeneral knowledge87.5% accuracySelf-reported
Anthropic model card
MMLU-ProGeneral knowledge77.5% accuracySelf-reported
Anthropic model card
SWE-bench VerifiedCoding52% resolvedSelf-reported
Anthropic Claude 4 announcement

Integrations & tooling support

Tool calling
Supported
Structured outputs
Supported

Price vs quality

Overpriced

Mid-tier performance at frontier pricing.

Quality percentile
44.1%
vs 8 benchmarks
Effective price
$20/1M
/ 1M tokens (input + 3× output)
Pricing breakdown
$5/1M in
$25/1M out

Community ratings

No ratings yet. Be the first to rate Claude Opus 4.

Rate Claude Opus 4

Sign in to rate and review.

Comments

Sign in to leave a comment.