Models
88 models tracked across 12 providers.
- Active
Qwen 3 14B
Alibaba14B compact Qwen 3 model for efficient local deployment.
Context: 128KModalities: text - Active
Qwen 3 235B
AlibabaAlibaba's frontier open-weight MoE model with hybrid thinking.
Context: 128KModalities: text - Active
Qwen 3 32B
Alibaba32B Qwen 3 model offering strong reasoning at mid-size cost.
Context: 128KModalities: text - Active
Qwen 3 72B
Alibaba72B dense open-weight model with hybrid thinking from Alibaba.
Context: 128KModalities: text - Active
Amazon Nova 2 Lite
Amazon (AWS)Cost-efficient Nova 2 model with 1M context.
Context: 1MModalities: text, image - Active
Amazon Nova 2 Pro
Amazon (AWS)Next-generation Nova flagship with 1M context from Amazon Bedrock.
Context: 1MModalities: text, image, video - Active
Amazon Nova Lite
Amazon (AWS)Low-cost multimodal model from Amazon for high-throughput workloads.
Context: 300KModalities: text, image, video - Active
Amazon Nova Micro
Amazon (AWS)Ultra-low-cost text-only model from Amazon.
Context: 128KModalities: text - Active
Amazon Nova Pro
Amazon (AWS)Amazon's most capable multimodal model, available through Amazon Bedrock.
Context: 300KModalities: text, image, video - Deprecated
Claude 3 Opus
AnthropicPrevious Anthropic flagship, now superseded by Claude Opus 4.
Context: 200KModalities: text, image - Active
Claude 3.5 Haiku
AnthropicFast, low-cost model with stronger capabilities than Claude 3 Haiku.
Context: 200KModalities: text, image - Active
Claude 3.5 Sonnet
AnthropicMid-2024 release setting a new standard for coding and reasoning at mid-tier price.
Context: 200KModalities: text, image - Active
Claude 3.7 Sonnet
AnthropicAnthropic Claude 3.7 Sonnet.
Context: 200KModalities: text, image - Active
Claude Haiku 4.5
AnthropicFast, low-cost Claude model for latency-sensitive workloads.
Context: 200KModalities: text, image - Active
Claude Opus 4
AnthropicAnthropic's most capable model for complex reasoning and long-context work.
Context: 200KModalities: text, image - Active
Claude Opus 4.1
AnthropicMost capable Claude Opus model.
Modalities: text, image - Active
Claude Opus 4.5
AnthropicAnthropic Claude Opus 4.5.
Context: 200KModalities: text, image - Active
Claude Opus 4.6
AnthropicAnthropic Claude Opus 4.6.
Modalities: text, image - Active
Claude Opus 4.7
AnthropicAnthropic Claude Opus 4.7.
Context: 1MModalities: text, image - Active
Claude Sonnet 4
AnthropicBalanced mid-tier Claude model with strong general capability and price.
Context: 200KModalities: text, image - Active
Claude Sonnet 4.5
AnthropicBalanced performance and speed.
Context: 200KModalities: text, image - Active
Claude Sonnet 4.6
AnthropicAnthropic Claude Sonnet 4.6.
Context: 1MModalities: text, image - Active
Command A
CohereCohere's most capable model with 256K context, optimized for enterprise agentic tasks.
Context: 256KModalities: text - Active
Command R
CohereEfficient mid-size model from Cohere for RAG and agentic tasks.
Context: 128KModalities: text - Active
Command R+
CohereCohere's flagship model optimized for enterprise RAG and complex tasks.
Context: 128KModalities: text - Active
DeepSeek R1
DeepSeekOpen-weight reasoning model matching o1 performance, fully open-source.
Context: 128KModalities: text - Active
DeepSeek V3
DeepSeekOpen-weight frontier model competitive with GPT-4o and Claude Sonnet at fraction of training cost.
Context: 128KModalities: text - Active
DeepSeek V3 (2506)
DeepSeekLatest version of DeepSeek V3.
Context: 131KModalities: text - Deprecated
Gemini 1.5 Flash
GoogleFast multimodal model from the Gemini 1.5 generation.
Context: 1MModalities: text, image, audio, video - Deprecated
Gemini 1.5 Pro
GooglePrevious Google flagship with 1M context window, superseded by Gemini 2.
Context: 1MModalities: text, image, audio, video - Active
Gemini 2 Flash
GoogleLow-latency, low-cost multimodal model with 1M context.
Context: 1.0MModalities: text, image, audio, video - Active
Gemini 2 Pro
GoogleGoogle's flagship multimodal model with very long context.
Context: 2MModalities: text, image, audio, video - Active
Gemini 2.5 Flash
GoogleFast and efficient multimodal model.
Context: 1.0MModalities: text, image, audio, video - Active
Gemini 2.5 Flash Lite
GoogleUltra-fast lightweight variant.
Context: 1.0MModalities: text, image - Active
Gemini 2.5 Pro
GoogleGoogle Gemini 2.5 Pro — state-of-the-art thinking model.
Context: 1.0MModalities: text, image, audio, video - Active
Gemma 2 27B
GoogleOpen-weights 27B model from Google with state-of-the-art performance at its size.
Context: 8KModalities: text - Active
Gemma 2 9B
GoogleOpen-weights 9B model from Google, competitive with much larger models.
Context: 8KModalities: text - Active
Llama 3 70B
MetaOpen-weights 70B model for high-quality general use.
Context: 128KModalities: text - Active
Llama 3 8B
MetaSmaller open-weights Llama for on-device and cost-sensitive use.
Context: 128KModalities: text - Active
Llama 3.1 405B
MetaMeta's largest open-weights model, competitive with frontier closed models.
Context: 128KModalities: text - Active
Llama 3.1 70B
MetaUpdated 70B open-weights model with 128k context and improved tool calling.
Context: 128KModalities: text - Active
Llama 3.2 11B
MetaMultimodal 11B model from Meta supporting text and image inputs.
Context: 128KModalities: text, image - Active
Llama 3.2 3B
MetaSmall on-device model for edge and mobile deployments.
Context: 128KModalities: text - Active
Llama 3.3 70B
MetaMeta Llama 3.3 70B — improved instruction-following.
Modalities: text - Active
Llama 4 Maverick
MetaHigh-performance multimodal model.
Modalities: text, image - Active
Llama 4 Scout
MetaEfficient multimodal model with 17B active parameters.
Modalities: text, image - Active
Phi-3.5 Mini
Microsoft3.8B instruction-following model targeting mobile and edge deployment.
Context: 128KModalities: text - Active
Phi-4
Microsoft14B small language model from Microsoft Research with state-of-the-art STEM reasoning.
Context: 16KModalities: text - Active
Phi-4 Mini
MicrosoftCompact yet capable small language model.
Modalities: text - Active
Phi-4 Reasoning
Microsoft14B reasoning-specialized Phi model with extended thinking.
Context: 32KModalities: text - Active
Phi-4 Reasoning Vision
Microsoft15B multimodal reasoning model with image understanding.
Context: 32KModalities: text, image - Active
Codestral
Mistral AIMistral's code-specialized model with long context.
Context: 32KModalities: text - Active
Codestral 2508
Mistral AISpecialized code generation model.
Context: 256KModalities: text - Active
Devstral
Mistral AIAgentic coding model for software development.
Context: 256KModalities: text - Active
Mistral 7B
Mistral AICompact open-weights model that outperforms Llama 2 13B on many benchmarks.
Context: 32KModalities: text - Active
Mistral Large
Mistral AIMistral's flagship commercial model with tool calling and structured outputs.
Context: 262KModalities: text, image - Active
Mistral Large 3
Mistral AITop-tier reasoning and coding model.
Context: 262KModalities: text, image - Active
Mistral Medium 3
Mistral AIBalanced performance and cost.
Modalities: text - Active
Mistral NeMo
Mistral AI12B open-weights model built with NVIDIA, with 128k context.
Context: 128KModalities: text - Active
Mistral Small
Mistral AIEfficient open-weights mid-sized model from Mistral.
Context: 32KModalities: text - Active
Mistral Small 3.2
Mistral AIFast and affordable.
Modalities: text - Active
Mixtral 8x7B
Mistral AIOpen-weights mixture-of-experts model with GPT-3.5 class performance.
Context: 32KModalities: text - Active
Llama 3.1 Nemotron 70B
NVIDIANVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.
Context: 128KModalities: text - Active
Llama 3.3 Nemotron Super 49B
NVIDIA49B parameter efficient model with frontier reasoning capability from NVIDIA.
Context: 128KModalities: text - Deprecated
GPT-4 Turbo
OpenAIPrevious-gen GPT-4 flagship with 128k context, now superseded by GPT-4o.
Context: 128KModalities: text, image - Active
GPT-4.1
OpenAIOpenAI GPT-4.1
Context: 1.0MModalities: text, image - Active
GPT-4.1 mini
OpenAISmaller, faster and cheaper version of GPT-4.1.
Context: 1.0MModalities: text, image - Active
GPT-4.1 nano
OpenAIUltra-fast nano variant of GPT-4.1.
Context: 1.0MModalities: text, image - Active
GPT-4o
OpenAIFast, multimodal model for general use with 128k context.
Context: 128KModalities: text, image, audio - Active
GPT-4o mini
OpenAILow-cost, fast multimodal model for high-volume tasks.
Context: 128KModalities: text, image - Active
GPT-5
OpenAIOpenAI's frontier flagship model with long context and advanced reasoning.
Context: 272KModalities: text, image - Active
GPT-5.1
OpenAIOpenAI GPT-5.1.
Context: 272KModalities: text, image - Active
GPT-5.2
OpenAIOpenAI GPT-5.2.
Context: 272KModalities: text, image - Active
GPT-5.3
OpenAIOpenAI GPT-5.3.
Modalities: text - Active
GPT-5.4
OpenAIOpenAI GPT-5.4.
Context: 1.1MModalities: text, image - Active
GPT-5.4 mini
OpenAICost-efficient variant of GPT-5.4.
Context: 272KModalities: text, image - Active
GPT-5.4 nano
OpenAIUltra-fast nano variant of GPT-5.4.
Context: 272KModalities: text, image - Active
o1
OpenAIReasoning-focused model that thinks before answering.
Context: 200KModalities: text, image - Deprecated
o1-mini
OpenAISmaller o1-series reasoning model, now superseded by o3-mini.
Context: 128KModalities: text - Active
o3
OpenAIOpenAI's most powerful reasoning model, successor to o1.
Context: 200KModalities: text, image - Active
o3-mini
OpenAICost-efficient reasoning model with strong STEM performance.
Context: 200KModalities: text - Active
o3-pro
OpenAIHighest capability reasoning model.
Context: 200KModalities: text, image - Active
o4-mini
OpenAIFast reasoning model.
Context: 200KModalities: text, image - Deprecated
Grok 2
xAIPrevious generation Grok model, superseded by Grok 3.
Context: 131KModalities: text, image - Active
Grok 3
xAIxAI's frontier reasoning model with real-time web access.
Context: 131KModalities: text, image - Active
Grok 3 Mini
xAIFast and cost-efficient reasoning model.
Modalities: text - Active
Grok 4
xAIMost capable Grok model.
Context: 256KModalities: text - Active
Grok 4 Fast
xAIHigh-speed variant of Grok 4.
Modalities: text