Models
Providers
Benchmarks
Changelog
Compare
New to AI?

Models

88 models tracked across 12 providers.

Qwen 3 14B
Alibaba
Active
14B compact Qwen 3 model for efficient local deployment.
Context: 128KModalities: text
Qwen 3 235B
Alibaba
Active
Alibaba's frontier open-weight MoE model with hybrid thinking.
Context: 128KModalities: text
Qwen 3 32B
Alibaba
Active
32B Qwen 3 model offering strong reasoning at mid-size cost.
Context: 128KModalities: text
Qwen 3 72B
Alibaba
Active
72B dense open-weight model with hybrid thinking from Alibaba.
Context: 128KModalities: text
Amazon Nova 2 Lite
Amazon (AWS)
Active
Cost-efficient Nova 2 model with 1M context.
Context: 1MModalities: text, image
Amazon Nova 2 Pro
Amazon (AWS)
Active
Next-generation Nova flagship with 1M context from Amazon Bedrock.
Context: 1MModalities: text, image, video
Amazon Nova Lite
Amazon (AWS)
Active
Low-cost multimodal model from Amazon for high-throughput workloads.
Context: 300KModalities: text, image, video
Amazon Nova Micro
Amazon (AWS)
Active
Ultra-low-cost text-only model from Amazon.
Context: 128KModalities: text
Amazon Nova Pro
Amazon (AWS)
Active
Amazon's most capable multimodal model, available through Amazon Bedrock.
Context: 300KModalities: text, image, video
Claude 3 Opus
Anthropic
Deprecated
Previous Anthropic flagship, now superseded by Claude Opus 4.
Context: 200KModalities: text, image
Claude 3.5 Haiku
Anthropic
Active
Fast, low-cost model with stronger capabilities than Claude 3 Haiku.
Context: 200KModalities: text, image
Claude 3.5 Sonnet
Anthropic
Active
Mid-2024 release setting a new standard for coding and reasoning at mid-tier price.
Context: 200KModalities: text, image
Claude 3.7 Sonnet
Anthropic
Active
Anthropic Claude 3.7 Sonnet.
Context: 200KModalities: text, image
Claude Haiku 4.5
Anthropic
Active
Fast, low-cost Claude model for latency-sensitive workloads.
Context: 200KModalities: text, image
Claude Opus 4
Anthropic
Active
Anthropic's most capable model for complex reasoning and long-context work.
Context: 200KModalities: text, image
Claude Opus 4.1
Anthropic
Active
Most capable Claude Opus model.
Modalities: text, image
Claude Opus 4.5
Anthropic
Active
Anthropic Claude Opus 4.5.
Context: 200KModalities: text, image
Claude Opus 4.6
Anthropic
Active
Anthropic Claude Opus 4.6.
Modalities: text, image
Claude Opus 4.7
Anthropic
Active
Anthropic Claude Opus 4.7.
Context: 1MModalities: text, image
Claude Sonnet 4
Anthropic
Active
Balanced mid-tier Claude model with strong general capability and price.
Context: 200KModalities: text, image
Claude Sonnet 4.5
Anthropic
Active
Balanced performance and speed.
Context: 200KModalities: text, image
Claude Sonnet 4.6
Anthropic
Active
Anthropic Claude Sonnet 4.6.
Context: 1MModalities: text, image
Command A
Cohere
Active
Cohere's most capable model with 256K context, optimized for enterprise agentic tasks.
Context: 256KModalities: text
Command R
Cohere
Active
Efficient mid-size model from Cohere for RAG and agentic tasks.
Context: 128KModalities: text
Command R+
Cohere
Active
Cohere's flagship model optimized for enterprise RAG and complex tasks.
Context: 128KModalities: text
DeepSeek R1
DeepSeek
Active
Open-weight reasoning model matching o1 performance, fully open-source.
Context: 128KModalities: text
DeepSeek V3
DeepSeek
Active
Open-weight frontier model competitive with GPT-4o and Claude Sonnet at fraction of training cost.
Context: 128KModalities: text
DeepSeek V3 (2506)
DeepSeek
Active
Latest version of DeepSeek V3.
Context: 131KModalities: text
Gemini 1.5 Flash
Google
Deprecated
Fast multimodal model from the Gemini 1.5 generation.
Context: 1MModalities: text, image, audio, video
Gemini 1.5 Pro
Google
Deprecated
Previous Google flagship with 1M context window, superseded by Gemini 2.
Context: 1MModalities: text, image, audio, video
Gemini 2 Flash
Google
Active
Low-latency, low-cost multimodal model with 1M context.
Context: 1.0MModalities: text, image, audio, video
Gemini 2 Pro
Google
Active
Google's flagship multimodal model with very long context.
Context: 2MModalities: text, image, audio, video
Gemini 2.5 Flash
Google
Active
Fast and efficient multimodal model.
Context: 1.0MModalities: text, image, audio, video
Gemini 2.5 Flash Lite
Google
Active
Ultra-fast lightweight variant.
Context: 1.0MModalities: text, image
Gemini 2.5 Pro
Google
Active
Google Gemini 2.5 Pro — state-of-the-art thinking model.
Context: 1.0MModalities: text, image, audio, video
Gemma 2 27B
Google
Active
Open-weights 27B model from Google with state-of-the-art performance at its size.
Context: 8KModalities: text
Gemma 2 9B
Google
Active
Open-weights 9B model from Google, competitive with much larger models.
Context: 8KModalities: text
Llama 3 70B
Meta
Active
Open-weights 70B model for high-quality general use.
Context: 128KModalities: text
Llama 3 8B
Meta
Active
Smaller open-weights Llama for on-device and cost-sensitive use.
Context: 128KModalities: text
Llama 3.1 405B
Meta
Active
Meta's largest open-weights model, competitive with frontier closed models.
Context: 128KModalities: text
Llama 3.1 70B
Meta
Active
Updated 70B open-weights model with 128k context and improved tool calling.
Context: 128KModalities: text
Llama 3.2 11B
Meta
Active
Multimodal 11B model from Meta supporting text and image inputs.
Context: 128KModalities: text, image
Llama 3.2 3B
Meta
Active
Small on-device model for edge and mobile deployments.
Context: 128KModalities: text
Llama 3.3 70B
Meta
Active
Meta Llama 3.3 70B — improved instruction-following.
Modalities: text
Llama 4 Maverick
Meta
Active
High-performance multimodal model.
Modalities: text, image
Llama 4 Scout
Meta
Active
Efficient multimodal model with 17B active parameters.
Modalities: text, image
Phi-3.5 Mini
Microsoft
Active
3.8B instruction-following model targeting mobile and edge deployment.
Context: 128KModalities: text
Phi-4
Microsoft
Active
14B small language model from Microsoft Research with state-of-the-art STEM reasoning.
Context: 16KModalities: text
Phi-4 Mini
Microsoft
Active
Compact yet capable small language model.
Modalities: text
Phi-4 Reasoning
Microsoft
Active
14B reasoning-specialized Phi model with extended thinking.
Context: 32KModalities: text
Phi-4 Reasoning Vision
Microsoft
Active
15B multimodal reasoning model with image understanding.
Context: 32KModalities: text, image
Codestral
Mistral AI
Active
Mistral's code-specialized model with long context.
Context: 32KModalities: text
Codestral 2508
Mistral AI
Active
Specialized code generation model.
Context: 256KModalities: text
Devstral
Mistral AI
Active
Agentic coding model for software development.
Context: 256KModalities: text
Mistral 7B
Mistral AI
Active
Compact open-weights model that outperforms Llama 2 13B on many benchmarks.
Context: 32KModalities: text
Mistral Large
Mistral AI
Active
Mistral's flagship commercial model with tool calling and structured outputs.
Context: 262KModalities: text, image
Mistral Large 3
Mistral AI
Active
Top-tier reasoning and coding model.
Context: 262KModalities: text, image
Mistral Medium 3
Mistral AI
Active
Balanced performance and cost.
Modalities: text
Mistral NeMo
Mistral AI
Active
12B open-weights model built with NVIDIA, with 128k context.
Context: 128KModalities: text
Mistral Small
Mistral AI
Active
Efficient open-weights mid-sized model from Mistral.
Context: 32KModalities: text
Mistral Small 3.2
Mistral AI
Active
Fast and affordable.
Modalities: text
Mixtral 8x7B
Mistral AI
Active
Open-weights mixture-of-experts model with GPT-3.5 class performance.
Context: 32KModalities: text
Llama 3.1 Nemotron 70B
NVIDIA
Active
NVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.
Context: 128KModalities: text
Llama 3.3 Nemotron Super 49B
NVIDIA
Active
49B parameter efficient model with frontier reasoning capability from NVIDIA.
Context: 128KModalities: text
GPT-4 Turbo
OpenAI
Deprecated
Previous-gen GPT-4 flagship with 128k context, now superseded by GPT-4o.
Context: 128KModalities: text, image
GPT-4.1
OpenAI
Active
OpenAI GPT-4.1
Context: 1.0MModalities: text, image
GPT-4.1 mini
OpenAI
Active
Smaller, faster and cheaper version of GPT-4.1.
Context: 1.0MModalities: text, image
GPT-4.1 nano
OpenAI
Active
Ultra-fast nano variant of GPT-4.1.
Context: 1.0MModalities: text, image
GPT-4o
OpenAI
Active
Fast, multimodal model for general use with 128k context.
Context: 128KModalities: text, image, audio
GPT-4o mini
OpenAI
Active
Low-cost, fast multimodal model for high-volume tasks.
Context: 128KModalities: text, image
GPT-5
OpenAI
Active
OpenAI's frontier flagship model with long context and advanced reasoning.
Context: 272KModalities: text, image
GPT-5.1
OpenAI
Active
OpenAI GPT-5.1.
Context: 272KModalities: text, image
GPT-5.2
OpenAI
Active
OpenAI GPT-5.2.
Context: 272KModalities: text, image
GPT-5.3
OpenAI
Active
OpenAI GPT-5.3.
Modalities: text
GPT-5.4
OpenAI
Active
OpenAI GPT-5.4.
Context: 1.1MModalities: text, image
GPT-5.4 mini
OpenAI
Active
Cost-efficient variant of GPT-5.4.
Context: 272KModalities: text, image
GPT-5.4 nano
OpenAI
Active
Ultra-fast nano variant of GPT-5.4.
Context: 272KModalities: text, image
o1
OpenAI
Active
Reasoning-focused model that thinks before answering.
Context: 200KModalities: text, image
o1-mini
OpenAI
Deprecated
Smaller o1-series reasoning model, now superseded by o3-mini.
Context: 128KModalities: text
o3
OpenAI
Active
OpenAI's most powerful reasoning model, successor to o1.
Context: 200KModalities: text, image
o3-mini
OpenAI
Active
Cost-efficient reasoning model with strong STEM performance.
Context: 200KModalities: text
o3-pro
OpenAI
Active
Highest capability reasoning model.
Context: 200KModalities: text, image
o4-mini
OpenAI
Active
Fast reasoning model.
Context: 200KModalities: text, image
Grok 2
xAI
Deprecated
Previous generation Grok model, superseded by Grok 3.
Context: 131KModalities: text, image
Grok 3
xAI
Active
xAI's frontier reasoning model with real-time web access.
Context: 131KModalities: text, image
Grok 3 Mini
xAI
Active
Fast and cost-efficient reasoning model.
Modalities: text
Grok 4
xAI
Active
Most capable Grok model.
Context: 256KModalities: text
Grok 4 Fast
xAI
Active
High-speed variant of Grok 4.
Modalities: text