Grok 3

Active

xAI's frontier reasoning model with real-time web access.

Overview

Grok 3 is xAI's most capable model with strong reasoning capabilities and real-time access to information through X's data firehose. Available through the xAI API and to X Premium subscribers.

Benchmarks

Benchmark	Score	Source
AIME 2024Math	83.9% accuracy	Self-reported xAI model card
Aider PolyglotCoding	53.3% pass@2	Third-party Papers With Code
GPQA DiamondReasoning	84.6% accuracy	Self-reported xAI model card
HumanEvalCoding	88.5pass@1 %	Self-reported xAI model card
MATHMath	93.3% accuracy	Self-reported xAI model card
MMLUGeneral knowledge	87% accuracy	Self-reported xAI model card
MMLU-ProGeneral knowledge	79.3% accuracy	Self-reported xAI model card
SWE-bench VerifiedCoding	50% resolved	Third-party Papers With Code

Integrations & tooling support

Tool calling: Supported
Structured outputs: Supported

Price vs quality

Overpriced

Mid-tier performance at frontier pricing.

Quality percentile: 58.9%
Effective price: $12/1M
Pricing breakdown: $3/1M in
$15/1M out

Community ratings

No ratings yet. Be the first to rate Grok 3.