Grok 3
ActivexAI's frontier reasoning model with real-time web access.
Overview
Grok 3 is xAI's most capable model with strong reasoning capabilities and real-time access to information through X's data firehose. Available through the xAI API and to X Premium subscribers.
Benchmarks
| Benchmark | Score | Source |
|---|---|---|
| AIME 2024Math | 83.9% accuracy | Self-reported xAI model card |
| Aider PolyglotCoding | 53.3% pass@2 | Third-party Papers With Code |
| GPQA DiamondReasoning | 84.6% accuracy | Self-reported xAI model card |
| HumanEvalCoding | 88.5pass@1 % | Self-reported xAI model card |
| MATHMath | 93.3% accuracy | Self-reported xAI model card |
| MMLUGeneral knowledge | 87% accuracy | Self-reported xAI model card |
| MMLU-ProGeneral knowledge | 79.3% accuracy | Self-reported xAI model card |
| SWE-bench VerifiedCoding | 50% resolved | Third-party Papers With Code |
Integrations & tooling support
- Tool calling
- Supported
- Structured outputs
- Supported
Price vs quality
Overpriced
Mid-tier performance at frontier pricing.
- Quality percentile
- 58.9%
- Effective price
- $12/1M
- Pricing breakdown
- $3/1M in
$15/1M out
vs 8 benchmarks
/ 1M tokens (input + 3× output)
Community ratings
No ratings yet. Be the first to rate Grok 3.
Rate Grok 3
Sign in to rate and review.
Comments
Sign in to leave a comment.