MMODELYST
Capability × cost × speed

Frontier

Capability = Modelyst cross-benchmark percentile · price & speed = Artificial Analysis medians · verified Jun 15, 2026 · methodology
The efficient frontier — capability vs price
252 models · hover a point · click to open
$0.1$1$1020406080100blended price, USD per 1M tokens (log)capability scoreGPT-5.5 · OpenAI — $11.25, 97Claude Fable 5 · Anthropic — $20, 95Gemini 3.5 Flash · Google — $3.38, 95GPT-5.2 · OpenAI — $4.81, 95Qwen3.6 Max Preview · Alibaba — $2.92, 95GPT-5.4 · OpenAI — $5.63, 95DeepSeek V4 Pro · DeepSeek — $0.544, 95GPT-5.2 Codex · OpenAI — $4.81, 95Gemini 3 Pro Preview · Google — $4.5, 95GPT-5.3 Codex · OpenAI — $4.81, 94Gemini 3 Flash Preview · Google — $1.13, 94Grok 4.3 · xAI — $1.56, 94Claude Opus 4.8 · Anthropic — $10, 93Qwen3.7 Plus · Alibaba — $0.59, 93GPT-5.1 · OpenAI — $3.44, 93Claude Opus 4.5 · Anthropic — $10, 93Qwen3.6 Plus · Alibaba — $1.13, 93Grok 4.20 0309 · xAI — $3, 92GPT-5 · OpenAI — $3.44, 92Qwen3.5 397B A17B · Alibaba — $1.35, 92GLM-5.1 · Z AI — $2.15, 92Claude Opus 4.7 · Anthropic — $10, 92GPT-5.4 mini · OpenAI — $1.69, 91GPT-5 Codex · OpenAI — $3.44, 91GLM-5 · Z AI — $1.55, 91GLM-4.7 · Z AI — $1, 91Kimi K2.5 · Kimi — $1.19, 91Claude Opus 4.6 · Anthropic — $10, 91MiniMax-M2.7 · MiniMax — $0.525, 91Grok 4.20 0309 v2 · xAI — $3, 91Gemini 2.5 Pro Preview (May' 25) · Google — $3.44, 91Kimi K2 Thinking · Kimi — $1.07, 90GPT-5.1 Codex · OpenAI — $3.44, 89o3 · OpenAI — $3.5, 89Grok 4 · xAI — $11, 89GPT-5.4 nano · OpenAI — $0.463, 89Qwen3.5 122B A10B · Alibaba — $1.1, 89MiMo-V2-Pro · Xiaomi — $1.5, 89Nemotron 3 Ultra 550B A55B · NVIDIA — $1.18, 88MiniMax-M2.5 · MiniMax — $0.525, 88Claude Sonnet 4.6 · Anthropic — $6, 88Qwen3.5 27B · Alibaba — $0.825, 88DeepSeek V3.2 · DeepSeek — $0.337, 88GPT-5 mini · OpenAI — $0.688, 87MiMo-V2.5 · Xiaomi — $0.175, 87Qwen3.6 27B · Alibaba — $1.35, 87Step 3.7 Flash · StepFun — $0.438, 86GPT-5.1 Codex mini · OpenAI — $0.688, 85Qwen3 Max Thinking · Alibaba — $2.4, 85KAT Coder Pro V2 · KwaiKAT — $0.525, 85Claude 4.5 Sonnet · Anthropic — $6, 85MiniMax-M2.1 · MiniMax — $0.525, 85KAT-Coder-Pro V1 · KwaiKAT — $0.525, 84MiMo-V2-Flash (Feb 2026) · Xiaomi — $0.15, 84MiMo-V2-Omni-0327 · Xiaomi — $0.8, 84o4-mini · OpenAI — $1.93, 84GPT-5.5 Instant (May 2026) · OpenAI — $11.25, 84Nova 2.0 Pro Preview · Amazon — $3.44, 84Hy3-preview · tencent — $0.2, 83Gemini 2.5 Pro · Google — $3.44, 83Grok 4 Fast · xAI — $0.275, 83Qwen3.6 35B A3B · Alibaba — $0.557, 83Qwen3 235B A22B 2507 · Alibaba — $0.838, 82Mistral Medium 3.5 · Mistral — $3, 82Qwen3.5 35B A3B · Alibaba — $0.688, 82gpt-oss-120b · openai — $0.262, 82DeepSeek V3.1 Terminus · DeepSeek — $1.91, 82MiniMax-M2 · MiniMax — $0.525, 81Claude 4.1 Opus · Anthropic — $30, 81Ring-2.6-1T · InclusionAI — $0.85, 81DeepSeek V3.2 Exp · DeepSeek — $0.31, 80Step 3.5 Flash · StepFun — $0.15, 80Step 3.5 Flash 2603 · StepFun — $0.15, 80Gemini 3.1 Flash-Lite · Google — $0.563, 80Nova 2.0 Lite · Amazon — $0.85, 79Grok 3 mini Reasoning · xAI — $0.35, 79Claude 4 Sonnet · Anthropic — $6, 79Claude 4.5 Haiku · Anthropic — $2, 78NVIDIA Nemotron 3 Super 120B A12B · NVIDIA — $0.412, 78GLM-4.6 · Z AI — $0.963, 77Claude 4 Opus · Anthropic — $30, 76Qwen3 Next 80B A3B · Alibaba — $1.88, 76Qwen3.5 Omni Plus · Alibaba — $1.5, 76Qwen3 Max Thinking (Preview) · Alibaba — $2.4, 76Qwen3 VL 235B A22B · Alibaba — $2.17, 76o1 · OpenAI — $26.25, 76DeepSeek R1 0528 (May '25) · DeepSeek — $2.06, 75Mercury 2 · Inception — $0.375, 75Gemma 4 26B A4B · Google — $0.198, 75Qwen3 Max · Alibaba — $3.05, 75o3-mini · OpenAI — $1.93, 75Gemini 2.5 Flash · Google — $0.85, 75Gemma 4 12B · Google — $0.15, 74GLM-4.5 · Z AI — $1, 73GPT-5 nano · OpenAI — $0.138, 73Trinity Large Thinking · Arcee AI — $0.395, 73Nova 2.0 Omni · Amazon — $0.85, 72Cogito v2.1 · Deep Cogito — $1.25, 72Magistral Medium 1.2 · Mistral — $2.75, 72Ling-2.6-1T · InclusionAI — $0.85, 72Seed-OSS-36B-Instruct · ByteDance Seed — $0.3, 71Qwen3 VL 32B · Alibaba — $2.63, 71GLM-4.7-Flash · Z AI — $0.153, 71Qwen3 Max (Preview) · Alibaba — $2.4, 70Qwen3 235B A22B 2507 Instruct · Alibaba — $0.356, 69Qwen3 30B A3B 2507 · Alibaba — $0.673, 68GLM-4.5-Air · Z AI — $0.372, 68Kimi K2 · Kimi — $1.04, 68MiniMax M1 80k · MiniMax — $0.963, 67Kimi K2 0905 · Kimi — $1.07, 67Mistral Small 4 · Mistral — $0.262, 66Gemini 2.5 Flash-Lite Preview (Sep '25) · Google — $0.175, 66Llama Nemotron Super 49B v1.5 · NVIDIA — $0.175, 66QwQ 32B · Alibaba — $0.745, 65Claude 3.5 Sonnet · Anthropic — $6, 65DeepSeek V3.1 · DeepSeek — $0.834, 64GPT-4.1 · OpenAI — $3.5, 64Grok 3 · xAI — $8, 64Qwen2.5 Max · Alibaba — $2.8, 64Qwen3 VL 30B A3B · Alibaba — $0.338, 63Qwen3 235B A22B · Alibaba — $2.63, 63Qwen3 VL 235B A22B Instruct · Alibaba — $0.7, 63DeepSeek R1 (Jan '25) · DeepSeek — $2.43, 63Ling 2.6 Flash · InclusionAI — $0.15, 63DeepSeek V3 0324 · DeepSeek — $1.21, 62Qwen3 Coder Next · Alibaba — $0.563, 62Qwen3 Coder 480B A35B Instruct · Alibaba — $0.675, 62Qwen3 Next 80B A3B Instruct · Alibaba — $0.875, 62GPT-5 (ChatGPT) · OpenAI — $3.44, 62Magistral Small 1.2 · Mistral — $0.75, 62GPT-4.1 mini · OpenAI — $0.7, 62Qwen3 VL 32B Instruct · Alibaba — $1.23, 59GLM-4.6V · Z AI — $0.45, 59Qwen3 32B · Alibaba — $0.276, 59Hermes 4 - Llama-3.1 405B · Nous Research — $1.5, 59Gemini 2.5 Flash-Lite · Google — $0.175, 59Qwen3.5 Omni Flash · Alibaba — $0.275, 58Claude 3.5 Sonnet (June '24) · Anthropic — $6, 58Nemotron 3 Nano Omni 30B A3B Reasoning · NVIDIA — $0.131, 57GPT-4o (May '24) · OpenAI — $7.5, 57Llama 3.1 Nemotron Ultra 253B v1 · NVIDIA — $0.9, 57Ring-flash-2.0 · InclusionAI — $0.247, 57Qwen3 Omni 30B A3B · Alibaba — $0.43, 57Qwen3 30B A3B 2507 Instruct · Alibaba — $0.213, 57Qwen3 30B A3B · Alibaba — $0.18, 56Hermes 4 - Llama-3.1 70B · Nous Research — $0.198, 56NVIDIA Nemotron Nano 12B v2 VL · NVIDIA — $0.3, 56Ling-flash-2.0 · InclusionAI — $0.247, 55Mistral Large 3 · Mistral — $0.75, 55Gemini 2.0 Flash (Feb '25) · Google — $0.262, 55Qwen3 VL 30B A3B Instruct · Alibaba — $0.3, 55Mistral Medium 3.1 · Mistral — $0.8, 55Qwen3 14B · Alibaba — $0.731, 55GPT-3.5 Turbo · OpenAI — $0.75, 55Llama 4 Maverick · Meta — $0.475, 54Mistral Medium 3 · Mistral — $0.8, 52Qwen3 Coder 30B A3B Instruct · Alibaba — $0.352, 52Llama 3.2 Instruct 90B (Vision) · Meta — $1.38, 52GLM-4.5V · Z AI — $0.9, 52Nova Premier · Amazon — $5, 51DeepSeek V3 (Dec '24) · DeepSeek — $0.523, 51GPT-4o · OpenAI — $4.38, 51Qwen2.5 Turbo · Alibaba — $0.088, 50DeepSeek R1 Distill Llama 70B · DeepSeek — $0.787, 49NVIDIA-Nemotron-Nano-9B-v2 · nvidia — $0.07, 49ERNIE 4.5 300B A47B · Baidu — $0.485, 49Claude 3 Opus · Anthropic — $30, 48Mistral Small 3.2 · Mistral — $0.128, 48Qwen2.5 Instruct 72B · Alibaba — $0.37, 48Hermes 3 - Llama-3.1 70B · Nous Research — $0.3, 48Ministral 3 14B · Mistral — $0.2, 47GPT-4 Turbo · OpenAI — $15, 47Qwen3 VL 8B · Alibaba — $0.66, 47Claude 3.5 Haiku · Anthropic — $1.6, 46Jamba 1.6 Large · AI21 Labs — $3.5, 46Qwen3 4B · Alibaba — $0.398, 46Sarvam 105B · Sarvam — $0.074, 46Qwen3 8B · Alibaba — $0.37, 46Llama 3.3 Instruct 70B · Meta — $0.612, 45GPT-4o mini · OpenAI — $0.262, 45Jamba 1.5 Large · AI21 Labs — $3.5, 45Claude 3 Sonnet · Anthropic — $6, 45Mistral Small (Sep '24) · Mistral — $0.3, 45Llama 3.1 Instruct 405B · Meta — $3.69, 45Ministral 3 8B · Mistral — $0.15, 44Devstral Medium · Mistral — $0.8, 44Pixtral Large · Mistral — $3, 44GPT-4.1 nano · OpenAI — $0.175, 44Mistral Large (Feb '24) · Mistral — $6, 44Mistral Large 2 (Nov '24) · Mistral — $3, 44Llama 4 Scout · Meta — $0.292, 44GPT-4o (Nov '24) · OpenAI — $4.38, 44Mistral Small 3.1 · Mistral — $0.138, 43Qwen3 Omni 30B A3B Instruct · Alibaba — $0.43, 43Devstral Small (Jul '25) · Mistral — $0.15, 43phi-4 · microsoft — $0.219, 43Qwen3 VL 8B Instruct · Alibaba — $0.31, 42Reka Flash 3 · Reka AI — $0.35, 42Nova Pro · Amazon — $1.4, 41Qwen3 1.7B · Alibaba — $0.398, 41Llama 3.1 Nemotron Instruct 70B · NVIDIA — $1.2, 40Mistral Large 2 (Jul '24) · Mistral — $3, 40Gemma 3 27B Instruct · Google — $0.145, 40Gemma 3 12B Instruct · Google — $0.14, 39Llama 3.1 Instruct 70B · Meta — $0.56, 39Nova Lite · Amazon — $0.105, 39Ministral 3 3B · Mistral — $0.1, 38Mistral Medium · Mistral — $4.09, 38Mistral Small 3 · Mistral — $0.104, 38Granite 4.0 H Small · IBM — $0.107, 37Mistral Small (Feb '24) · Mistral — $1.5, 37Olmo 3 7B Instruct · Allen Institute for AI — $0.125, 37Command-R+ (Apr '24) · Cohere — $6, 36Jamba 1.7 Large · AI21 Labs — $3.5, 36Granite 4.1 8B · IBM — $0.063, 35Llama 3 Instruct 70B · Meta — $1.18, 35Nova Micro · Amazon — $0.061, 34Jamba 1.5 Mini · AI21 Labs — $0.25, 32Llama 3.1 Instruct 8B · Meta — $0.1, 32Claude 3 Haiku · Anthropic — $0.5, 31Mixtral 8x7B Instruct · Mistral — $0.512, 31Gemma 3 4B Instruct · Google — $0.05, 30Jamba 1.6 Mini · AI21 Labs — $0.25, 30Llama 3.2 Instruct 11B (Vision) · Meta — $0.245, 29LFM2 24B A2B · Liquid AI — $0.052, 29Qwen3 0.6B · Alibaba — $0.398, 28Granite 3.3 8B · IBM — $0.085, 28Command-R (Mar '24) · Cohere — $0.75, 27Apertus 70B Instruct · Swiss AI Initiative — $1.34, 26Llama 3 Instruct 8B · Meta — $0.07, 26Llama 3.2 Instruct 3B · Meta — $0.15, 25Apertus 8B Instruct · Swiss AI Initiative — $0.125, 24Llama 2 Chat 7B · Meta — $0.1, 23Solar Mini · Upstage — $0.15, 16Llama 3.2 Instruct 1B · Meta — $0.05, 16Mistral 7B Instruct · Mistral — $0.206, 14Qwen3.5 0.8B · Alibaba — $0.02, 19Gemma 3n E4B Instruct · Google — $0.025, 32Qwen3.5 2B · Alibaba — $0.04, 37Sarvam 30B · Sarvam — $0.047, 39Qwen3.5 4B · Alibaba — $0.06, 65HyperNova 60B 2605 · Multiverse Computing — $0.065, 70gpt-oss-20b · openai — $0.088, 72NVIDIA Nemotron 3 Nano 30B A3B · NVIDIA — $0.096, 73Qwen3.5 9B · Alibaba — $0.113, 74MiMo-V2-Flash · Xiaomi — $0.15, 88DeepSeek V4 Flash · DeepSeek — $0.175, 91MiniMax-M3 · MiniMax — $0.525, 95MiMo-V2.5-Pro · Xiaomi — $0.544, 96Kimi K2.6 · Kimi — $1.71, 96Qwen3.7 Max · Alibaba — $3.75, 96Gemini 3.1 Pro Preview · Google — $4.5, 98Gemini 3.1 Pro PreviewDeepSeek V4 FlashMiMo-V2-FlashQwen3.5 9BHyperNova 60B 2605Qwen3.5 4BSarvam 30BGemma 3n E4B InstructQwen3.5 0.8B
on the frontier — nothing is both cheaper and bettereverything else
Up and to the left wins: the gold staircase is the set of models nothing beats on both price and capability at once.
The race — capability over time
gold = set a new record on release
20242024.0620252025.0620262026.0620406080100release datecapability scoreGPT-5.5 · OpenAI — 2026.04, 97Qwen3.7 Max · Alibaba — 2026.05, 96Kimi K2.6 · Kimi — 2026.04, 96MiMo-V2.5-Pro · Xiaomi — 2026.04, 96Claude Fable 5 · Anthropic — 2026.05, 95Gemini 3.5 Flash · Google — 2026.05, 95Qwen3.6 Max Preview · Alibaba — 2026.04, 95GPT-5.4 · OpenAI — 2026.02, 95DeepSeek V4 Pro · DeepSeek — 2026.04, 95GPT-5.2 Codex · OpenAI — 2025.11, 95MiniMax-M3 · MiniMax — 2026.05, 95GPT-5.3 Codex · OpenAI — 2026.01, 94Gemini 3 Flash Preview · Google — 2025.12, 94Grok 4.3 · xAI — 2026.04, 94Claude Opus 4.8 · Anthropic — 2026.05, 93Qwen3.7 Plus · Alibaba — 2026.05, 93Claude Opus 4.5 · Anthropic — 2025.11, 93Qwen3.6 Plus · Alibaba — 2026.03, 93Grok 4.20 0309 · xAI — 2026.02, 92Qwen3.5 397B A17B · Alibaba — 2026.02, 92GLM-5.1 · Z AI — 2026.03, 92Claude Opus 4.7 · Anthropic — 2026.04, 92GPT-5.4 mini · OpenAI — 2026.03, 91GPT-5 Codex · OpenAI — 2025.09, 91DeepSeek V4 Flash · DeepSeek — 2026.04, 91GLM-5 · Z AI — 2026.01, 91GLM-4.7 · Z AI — 2025.12, 91Kimi K2.5 · Kimi — 2026.01, 91Claude Opus 4.6 · Anthropic — 2026.01, 91MiniMax-M2.7 · MiniMax — 2026.03, 91Grok 4.20 0309 v2 · xAI — 2026.03, 91Kimi K2 Thinking · Kimi — 2025.10, 90GPT-5.1 Codex · OpenAI — 2025.10, 89Grok 4 · xAI — 2025.06, 89GPT-5.4 nano · OpenAI — 2026.03, 89Qwen3.5 122B A10B · Alibaba — 2026.02, 89MiMo-V2-Pro · Xiaomi — 2026.03, 89Nemotron 3 Ultra 550B A55B · NVIDIA — 2026.05, 88MiniMax-M2.5 · MiniMax — 2026.01, 88Claude Sonnet 4.6 · Anthropic — 2026.02, 88Qwen3.5 27B · Alibaba — 2026.02, 88MiMo-V2-Flash · Xiaomi — 2025.12, 88DeepSeek V3.2 · DeepSeek — 2025.11, 88GPT-5 mini · OpenAI — 2025.07, 87MiMo-V2.5 · Xiaomi — 2026.04, 87Qwen3.6 27B · Alibaba — 2026.04, 87Step 3.7 Flash · StepFun — 2026.05, 86GPT-5.1 Codex mini · OpenAI — 2025.10, 85Qwen3 Max Thinking · Alibaba — 2026.01, 85KAT Coder Pro V2 · KwaiKAT — 2026.03, 85Claude 4.5 Sonnet · Anthropic — 2025.09, 85Gemma 4 31B · Google — 2026.03, 85MiniMax-M2.1 · MiniMax — 2025.12, 85KAT-Coder-Pro V1 · KwaiKAT — 2025.10, 84MiMo-V2-Flash (Feb 2026) · Xiaomi — 2025.12, 84MiMo-V2-Omni-0327 · Xiaomi — 2026.03, 84o4-mini · OpenAI — 2025.04, 84GPT-5.5 Instant (May 2026) · OpenAI — 2026.04, 84Nova 2.0 Pro Preview · Amazon — 2025.11, 84Hy3-preview · tencent — 2026.04, 83Gemini 2.5 Pro · Google — 2025.05, 83Grok 4 Fast · xAI — 2025.09, 83Qwen3.6 35B A3B · Alibaba — 2026.04, 83Qwen3 235B A22B 2507 · Alibaba — 2025.07, 82Mistral Medium 3.5 · Mistral — 2026.04, 82Qwen3.5 35B A3B · Alibaba — 2026.02, 82gpt-oss-120b · openai — 2025.07, 82DeepSeek V3.1 Terminus · DeepSeek — 2025.09, 82MiniMax-M2 · MiniMax — 2025.10, 81MiMo-V2-Omni · Xiaomi — 2026.03, 81Claude 4.1 Opus · Anthropic — 2025.07, 81Ring-2.6-1T · InclusionAI — 2026.04, 81DeepSeek V3.2 Exp · DeepSeek — 2025.09, 80Step 3.5 Flash · StepFun — 2026.01, 80Step 3.5 Flash 2603 · StepFun — 2026.03, 80Gemini 3.1 Flash-Lite · Google — 2026.02, 80Nova 2.0 Lite · Amazon — 2025.10, 79Claude 4 Sonnet · Anthropic — 2025.05, 79Claude 4.5 Haiku · Anthropic — 2025.09, 78NVIDIA Nemotron 3 Super 120B A12B · NVIDIA — 2026.02, 78GLM-4.6 · Z AI — 2025.09, 77Claude 4 Opus · Anthropic — 2025.05, 76Qwen3 Next 80B A3B · Alibaba — 2025.08, 76Command A+ · Cohere — 2026.05, 76Qwen3.5 Omni Plus · Alibaba — 2026.03, 76Qwen3 Max Thinking (Preview) · Alibaba — 2025.10, 76Qwen3 VL 235B A22B · Alibaba — 2025.09, 76DeepSeek R1 0528 (May '25) · DeepSeek — 2025.05, 75Mercury 2 · Inception — 2026.02, 75Gemma 4 26B A4B · Google — 2026.03, 75Qwen3 Max · Alibaba — 2025.09, 75o3-mini · OpenAI — 2025.01, 75Gemini 2.5 Flash · Google — 2025.05, 75Gemma 4 12B · Google — 2026.05, 74Qwen3.5 9B · Alibaba — 2026.02, 74GLM-4.5 · Z AI — 2025.07, 73NVIDIA Nemotron 3 Nano 30B A3B · NVIDIA — 2025.11, 73GPT-5 nano · OpenAI — 2025.07, 73Trinity Large Thinking · Arcee AI — 2026.03, 73gpt-oss-20b · openai — 2025.07, 72Nova 2.0 Omni · Amazon — 2025.11, 72Cogito v2.1 · Deep Cogito — 2025.11, 72Magistral Medium 1.2 · Mistral — 2025.09, 72Ling-2.6-1T · InclusionAI — 2026.04, 72Seed-OSS-36B-Instruct · ByteDance Seed — 2025.08, 71Qwen3 VL 32B · Alibaba — 2025.10, 71GLM-4.7-Flash · Z AI — 2026.01, 71HyperNova 60B 2605 · Multiverse Computing — 2026.05, 70Qwen3 Max (Preview) · Alibaba — 2025.08, 70North Mini Code · Cohere — 2026.05, 69Qwen3 235B A22B 2507 Instruct · Alibaba — 2025.07, 69Qwen3 30B A3B 2507 · Alibaba — 2025.07, 68GLM-4.5-Air · Z AI — 2025.07, 68Kimi K2 · Kimi — 2025.06, 68MiniMax M1 80k · MiniMax — 2025.06, 67Kimi K2 0905 · Kimi — 2025.08, 67Mistral Small 4 · Mistral — 2026.03, 66Gemini 2.5 Flash-Lite Preview (Sep '25) · Google — 2025.08, 66Llama Nemotron Super 49B v1.5 · NVIDIA — 2025.07, 66Qwen3.5 4B · Alibaba — 2026.02, 65QwQ 32B · Alibaba — 2025.02, 65DeepSeek V3.1 · DeepSeek — 2025.08, 64GPT-4.1 · OpenAI — 2025.03, 64Grok 3 · xAI — 2025.02, 64Qwen2.5 Max · Alibaba — 2025.01, 64Qwen3 VL 30B A3B · Alibaba — 2025.09, 63Qwen3 235B A22B · Alibaba — 2025.04, 63Qwen3 VL 235B A22B Instruct · Alibaba — 2025.09, 63DeepSeek R1 (Jan '25) · DeepSeek — 2025.01, 63Ling 2.6 Flash · InclusionAI — 2026.04, 63DeepSeek V3 0324 · DeepSeek — 2025.03, 62Qwen3 Coder Next · Alibaba — 2026.01, 62Qwen3 Coder 480B A35B Instruct · Alibaba — 2025.07, 62Qwen3 Next 80B A3B Instruct · Alibaba — 2025.08, 62GPT-5 (ChatGPT) · OpenAI — 2025.07, 62Magistral Small 1.2 · Mistral — 2025.09, 62GPT-4.1 mini · OpenAI — 2025.03, 62Qwen3 VL 32B Instruct · Alibaba — 2025.10, 59GLM-4.6V · Z AI — 2025.11, 59Qwen3 32B · Alibaba — 2025.04, 59Hermes 4 - Llama-3.1 405B · Nous Research — 2025.08, 59Gemini 2.5 Flash-Lite · Google — 2025.06, 59Qwen3.5 Omni Flash · Alibaba — 2026.03, 58Nemotron 3 Nano Omni 30B A3B Reasoning · NVIDIA — 2026.04, 57Llama 3.1 Nemotron Ultra 253B v1 · NVIDIA — 2025.03, 57Ring-flash-2.0 · InclusionAI — 2025.09, 57Qwen3 Omni 30B A3B · Alibaba — 2025.09, 57Qwen3 30B A3B 2507 Instruct · Alibaba — 2025.07, 57Qwen3 30B A3B · Alibaba — 2025.04, 56Hermes 4 - Llama-3.1 70B · Nous Research — 2025.08, 56NVIDIA Nemotron Nano 12B v2 VL · NVIDIA — 2025.10, 56Ling-flash-2.0 · InclusionAI — 2025.09, 55Mistral Large 3 · Mistral — 2025.11, 55Gemini 2.0 Flash (Feb '25) · Google — 2025.01, 55Qwen3 VL 30B A3B Instruct · Alibaba — 2025.09, 55Mistral Medium 3.1 · Mistral — 2025.07, 55Qwen3 14B · Alibaba — 2025.04, 55Llama 4 Maverick · Meta — 2025.03, 54Devstral 2 · Mistral — 2025.11, 53Mistral Medium 3 · Mistral — 2025.04, 52Qwen3 Coder 30B A3B Instruct · Alibaba — 2025.07, 52Llama 3.2 Instruct 90B (Vision) · Meta — 2024.09, 52GLM-4.5V · Z AI — 2025.07, 52Nova Premier · Amazon — 2025.04, 51DeepSeek V3 (Dec '24) · DeepSeek — 2024.12, 51GPT-4o · OpenAI — 2024.07, 51Qwen2.5 Turbo · Alibaba — 2024.11, 50DeepSeek R1 Distill Llama 70B · DeepSeek — 2025.01, 49NVIDIA-Nemotron-Nano-9B-v2 · nvidia — 2025.08, 49ERNIE 4.5 300B A47B · Baidu — 2025.06, 49Mistral Small 3.2 · Mistral — 2025.06, 48Devstral Small 2 · Mistral — 2025.11, 48Qwen2.5 Instruct 72B · Alibaba — 2024.09, 48Hermes 3 - Llama-3.1 70B · Nous Research — 2024.07, 48Ministral 3 14B · Mistral — 2025.11, 47Qwen3 VL 8B · Alibaba — 2025.09, 47Claude 3.5 Haiku · Anthropic — 2024.10, 46Jamba 1.6 Large · AI21 Labs — 2025.02, 46Qwen3 4B · Alibaba — 2025.04, 46Sarvam 105B · Sarvam — 2026.02, 46Qwen3 8B · Alibaba — 2025.04, 46Llama 3.3 Instruct 70B · Meta — 2024.11, 45GPT-4o mini · OpenAI — 2024.07, 45Jamba 1.5 Large · AI21 Labs — 2024.08, 45Claude 3 Sonnet · Anthropic — 2024.02, 45Mistral Small (Sep '24) · Mistral — 2024.09, 45Llama 3.1 Instruct 405B · Meta — 2024.07, 45Ministral 3 8B · Mistral — 2025.11, 44Devstral Medium · Mistral — 2025.06, 44Pixtral Large · Mistral — 2024.11, 44GPT-4.1 nano · OpenAI — 2025.03, 44Mistral Large 2 (Nov '24) · Mistral — 2024.11, 44Llama 4 Scout · Meta — 2025.03, 44GPT-4o (Nov '24) · OpenAI — 2024.11, 44Mistral Small 3.1 · Mistral — 2025.03, 43Qwen3 Omni 30B A3B Instruct · Alibaba — 2025.09, 43Devstral Small (Jul '25) · Mistral — 2025.06, 43phi-4 · microsoft — 2024.11, 43Phi-4 Multimodal Instruct · Microsoft — 2025.02, 42Qwen3 VL 8B Instruct · Alibaba — 2025.09, 42Reka Flash 3 · Reka AI — 2025.02, 42Nova Pro · Amazon — 2024.11, 41Qwen3 1.7B · Alibaba — 2025.04, 41Llama 3.1 Nemotron Instruct 70B · NVIDIA — 2024.09, 40Mistral Large 2 (Jul '24) · Mistral — 2024.07, 40LFM2.5-8B-A1B · Liquid AI — 2026.05, 40Gemma 3 27B Instruct · Google — 2025.02, 40Gemma 3 12B Instruct · Google — 2025.02, 39Llama 3.1 Instruct 70B · Meta — 2024.07, 39Sarvam 30B · Sarvam — 2026.02, 39Nova Lite · Amazon — 2024.11, 39Ministral 3 3B · Mistral — 2025.11, 38Mistral Small 3 · Mistral — 2025.01, 38Granite 4.0 H Small · IBM — 2025.09, 37Mistral Small (Feb '24) · Mistral — 2024.02, 37Olmo 3 7B Instruct · Allen Institute for AI — 2025.11, 37Qwen3.5 2B · Alibaba — 2026.02, 37Command-R+ (Apr '24) · Cohere — 2024.03, 36Jamba 1.7 Large · AI21 Labs — 2025.06, 36Granite 4.1 8B · IBM — 2026.04, 35Llama 3 Instruct 70B · Meta — 2024.04, 35Nova Micro · Amazon — 2024.11, 34Gemma 3n E4B Instruct · Google — 2025.06, 32Jamba 1.5 Mini · AI21 Labs — 2024.08, 32Llama 3.1 Instruct 8B · Meta — 2024.07, 32LFM2.5-1.2B-Instruct · Liquid AI — 2026, 31Claude 3 Haiku · Anthropic — 2024.02, 31Gemma 3 4B Instruct · Google — 2025.02, 30Jamba 1.6 Mini · AI21 Labs — 2025.02, 30Llama 3.2 Instruct 11B (Vision) · Meta — 2024.09, 29LFM2 24B A2B · Liquid AI — 2026.02, 29Phi-4-mini-instruct · microsoft — 2024.02, 28Qwen3 0.6B · Alibaba — 2025.04, 28Granite 3.3 8B · IBM — 2025.04, 28Command-R (Mar '24) · Cohere — 2024.02, 27Apertus 70B Instruct · Swiss AI Initiative — 2025.08, 26Llama 3 Instruct 8B · Meta — 2024.04, 26Llama 3.2 Instruct 3B · Meta — 2024.09, 25LFM2.5-VL-1.6B · Liquid AI — 2026, 25Apertus 8B Instruct · Swiss AI Initiative — 2025.08, 24Tiny Aya Global · Cohere — 2026.02, 23LFM2 2.6B · Liquid AI — 2025.09, 20Qwen3.5 0.8B · Alibaba — 2026.02, 19LFM2 1.2B · Liquid AI — 2025.06, 19Llama 3.2 Instruct 1B · Meta — 2024.09, 16Solar Mini · Upstage — 2024.01, 16Mistral Large (Feb '24) · Mistral — 2024.02, 44Claude 3 Opus · Anthropic — 2024.02, 48GPT-4o (May '24) · OpenAI — 2024.04, 57Claude 3.5 Sonnet (June '24) · Anthropic — 2024.06, 58Claude 3.5 Sonnet · Anthropic — 2024.10, 65o1 · OpenAI — 2024.11, 76Grok 3 mini Reasoning · xAI — 2025.02, 79o3 · OpenAI — 2025.04, 89Gemini 2.5 Pro Preview (May' 25) · Google — 2025.04, 91GPT-5 · OpenAI — 2025.07, 92GPT-5.1 · OpenAI — 2025.10, 93Gemini 3 Pro Preview · Google — 2025.11, 95GPT-5.2 · OpenAI — 2025.11, 95Gemini 3.1 Pro Preview · Google — 2026.02, 98Gemini 3.1 Pro PreviewGPT-5.1Grok 3 mini ReasoningClaude 3.5 SonnetClaude 3.5 Sonnet (June '24)Claude 3 OpusMistral Large (Feb '24)Solar Mini
Throughput value — output speed vs price
203 models
$0.1$1$100200400600800100012001400blended price, USD per 1M tokens (log)output tokens / secondGemini 3.1 Pro Preview · Google — $4.5, 142GPT-5.5 · OpenAI — $11.25, 78Qwen3.7 Max · Alibaba — $3.75, 199Kimi K2.6 · Kimi — $1.71, 46MiMo-V2.5-Pro · Xiaomi — $0.544, 43Claude Fable 5 · Anthropic — $20, 77Gemini 3.5 Flash · Google — $3.38, 227GPT-5.2 · OpenAI — $4.81, 93Qwen3.6 Max Preview · Alibaba — $2.92, 47GPT-5.4 · OpenAI — $5.63, 203DeepSeek V4 Pro · DeepSeek — $0.544, 89GPT-5.2 Codex · OpenAI — $4.81, 154MiniMax-M3 · MiniMax — $0.525, 59GPT-5.3 Codex · OpenAI — $4.81, 116Gemini 3 Flash Preview · Google — $1.13, 226Grok 4.3 · xAI — $1.56, 189Claude Opus 4.8 · Anthropic — $10, 66Qwen3.7 Plus · Alibaba — $0.59, 53GPT-5.1 · OpenAI — $3.44, 133Claude Opus 4.5 · Anthropic — $10, 70Qwen3.6 Plus · Alibaba — $1.13, 52Grok 4.20 0309 · xAI — $3, 213GPT-5 · OpenAI — $3.44, 124Qwen3.5 397B A17B · Alibaba — $1.35, 51GLM-5.1 · Z AI — $2.15, 80Claude Opus 4.7 · Anthropic — $10, 58GPT-5.4 mini · OpenAI — $1.69, 193GPT-5 Codex · OpenAI — $3.44, 198DeepSeek V4 Flash · DeepSeek — $0.175, 114GLM-5 · Z AI — $1.55, 82GLM-4.7 · Z AI — $1, 107Kimi K2.5 · Kimi — $1.19, 41Claude Opus 4.6 · Anthropic — $10, 58MiniMax-M2.7 · MiniMax — $0.525, 43Grok 4.20 0309 v2 · xAI — $3, 221Kimi K2 Thinking · Kimi — $1.07, 120GPT-5.1 Codex · OpenAI — $3.44, 218o3 · OpenAI — $3.5, 166GPT-5.4 nano · OpenAI — $0.463, 159Qwen3.5 122B A10B · Alibaba — $1.1, 147MiMo-V2-Pro · Xiaomi — $1.5, 41Nemotron 3 Ultra 550B A55B · NVIDIA — $1.18, 196MiniMax-M2.5 · MiniMax — $0.525, 249Claude Sonnet 4.6 · Anthropic — $6, 74Qwen3.5 27B · Alibaba — $0.825, 84MiMo-V2-Flash · Xiaomi — $0.15, 154GPT-5 mini · OpenAI — $0.688, 93MiMo-V2.5 · Xiaomi — $0.175, 81Qwen3.6 27B · Alibaba — $1.35, 63Step 3.7 Flash · StepFun — $0.438, 407GPT-5.1 Codex mini · OpenAI — $0.688, 219KAT Coder Pro V2 · KwaiKAT — $0.525, 119Claude 4.5 Sonnet · Anthropic — $6, 59MiniMax-M2.1 · MiniMax — $0.525, 257KAT-Coder-Pro V1 · KwaiKAT — $0.525, 119MiMo-V2-Flash (Feb 2026) · Xiaomi — $0.15, 158MiMo-V2-Omni-0327 · Xiaomi — $0.8, 80o4-mini · OpenAI — $1.93, 191Nova 2.0 Pro Preview · Amazon — $3.44, 141Hy3-preview · tencent — $0.2, 135Gemini 2.5 Pro · Google — $3.44, 152Qwen3.6 35B A3B · Alibaba — $0.557, 166Qwen3 235B A22B 2507 · Alibaba — $0.838, 76Mistral Medium 3.5 · Mistral — $3, 105Qwen3.5 35B A3B · Alibaba — $0.688, 154gpt-oss-120b · openai — $0.262, 356MiniMax-M2 · MiniMax — $0.525, 132Claude 4.1 Opus · Anthropic — $30, 43Ring-2.6-1T · InclusionAI — $0.85, 135Step 3.5 Flash · StepFun — $0.15, 170Step 3.5 Flash 2603 · StepFun — $0.15, 169Gemini 3.1 Flash-Lite · Google — $0.563, 325Nova 2.0 Lite · Amazon — $0.85, 187Grok 3 mini Reasoning · xAI — $0.35, 62Claude 4 Sonnet · Anthropic — $6, 62Claude 4.5 Haiku · Anthropic — $2, 168NVIDIA Nemotron 3 Super 120B A12B · NVIDIA — $0.412, 147GLM-4.6 · Z AI — $0.963, 52Claude 4 Opus · Anthropic — $30, 44Qwen3 Next 80B A3B · Alibaba — $1.88, 178Qwen3.5 Omni Plus · Alibaba — $1.5, 55Qwen3 Max Thinking (Preview) · Alibaba — $2.4, 54Qwen3 VL 235B A22B · Alibaba — $2.17, 52o1 · OpenAI — $26.25, 131Qwen3 Max · Alibaba — $3.05, 59o3-mini · OpenAI — $1.93, 229Gemini 2.5 Flash · Google — $0.85, 219Gemma 4 12B · Google — $0.15, 161Qwen3.5 9B · Alibaba — $0.113, 65GLM-4.5 · Z AI — $1, 52NVIDIA Nemotron 3 Nano 30B A3B · NVIDIA — $0.096, 85GPT-5 nano · OpenAI — $0.138, 162Trinity Large Thinking · Arcee AI — $0.395, 188gpt-oss-20b · openai — $0.088, 252Cogito v2.1 · Deep Cogito — $1.25, 80Magistral Medium 1.2 · Mistral — $2.75, 42Seed-OSS-36B-Instruct · ByteDance Seed — $0.3, 25Qwen3 VL 32B · Alibaba — $2.63, 96GLM-4.7-Flash · Z AI — $0.153, 88Qwen3 Max (Preview) · Alibaba — $2.4, 62Qwen3 235B A22B 2507 Instruct · Alibaba — $0.356, 72Qwen3 30B A3B 2507 · Alibaba — $0.673, 146GLM-4.5-Air · Z AI — $0.372, 82Kimi K2 · Kimi — $1.04, 26Kimi K2 0905 · Kimi — $1.07, 26Mistral Small 4 · Mistral — $0.262, 179Llama Nemotron Super 49B v1.5 · NVIDIA — $0.175, 49Qwen3.5 4B · Alibaba — $0.06, 23QwQ 32B · Alibaba — $0.745, 32GPT-4.1 · OpenAI — $3.5, 149Qwen3 VL 30B A3B · Alibaba — $0.338, 125Qwen3 235B A22B · Alibaba — $2.63, 64Qwen3 VL 235B A22B Instruct · Alibaba — $0.7, 50Qwen3 Coder Next · Alibaba — $0.563, 93Qwen3 Coder 480B A35B Instruct · Alibaba — $0.675, 68Qwen3 Next 80B A3B Instruct · Alibaba — $0.875, 168GPT-5 (ChatGPT) · OpenAI — $3.44, 202Magistral Small 1.2 · Mistral — $0.75, 110GPT-4.1 mini · OpenAI — $0.7, 102Qwen3 VL 32B Instruct · Alibaba — $1.23, 75GLM-4.6V · Z AI — $0.45, 84Qwen3 32B · Alibaba — $0.276, 102Hermes 4 - Llama-3.1 405B · Nous Research — $1.5, 40Gemini 2.5 Flash-Lite · Google — $0.175, 294Qwen3.5 Omni Flash · Alibaba — $0.275, 279Nemotron 3 Nano Omni 30B A3B Reasoning · NVIDIA — $0.131, 298GPT-4o (May '24) · OpenAI — $7.5, 107Llama 3.1 Nemotron Ultra 253B v1 · NVIDIA — $0.9, 52Qwen3 Omni 30B A3B · Alibaba — $0.43, 101Qwen3 30B A3B 2507 Instruct · Alibaba — $0.213, 149Qwen3 30B A3B · Alibaba — $0.18, 103Hermes 4 - Llama-3.1 70B · Nous Research — $0.198, 92NVIDIA Nemotron Nano 12B v2 VL · NVIDIA — $0.3, 298Ling-flash-2.0 · InclusionAI — $0.247, 73Mistral Large 3 · Mistral — $0.75, 64Qwen3 VL 30B A3B Instruct · Alibaba — $0.3, 121Mistral Medium 3.1 · Mistral — $0.8, 78Qwen3 14B · Alibaba — $0.731, 64GPT-3.5 Turbo · OpenAI — $0.75, 149Llama 4 Maverick · Meta — $0.475, 97Mistral Medium 3 · Mistral — $0.8, 46Qwen3 Coder 30B A3B Instruct · Alibaba — $0.352, 113Llama 3.2 Instruct 90B (Vision) · Meta — $1.38, 49GLM-4.5V · Z AI — $0.9, 18Nova Premier · Amazon — $5, 80GPT-4o · OpenAI — $4.38, 127Qwen2.5 Turbo · Alibaba — $0.088, 100DeepSeek R1 Distill Llama 70B · DeepSeek — $0.787, 41NVIDIA-Nemotron-Nano-9B-v2 · nvidia — $0.07, 82Mistral Small 3.2 · Mistral — $0.128, 149Hermes 3 - Llama-3.1 70B · Nous Research — $0.3, 32Ministral 3 14B · Mistral — $0.2, 103GPT-4 Turbo · OpenAI — $15, 33Qwen3 VL 8B · Alibaba — $0.66, 131Jamba 1.6 Large · AI21 Labs — $3.5, 55Sarvam 105B · Sarvam — $0.074, 104Qwen3 8B · Alibaba — $0.37, 61Llama 3.3 Instruct 70B · Meta — $0.612, 97GPT-4o mini · OpenAI — $0.262, 65Mistral Small (Sep '24) · Mistral — $0.3, 171Llama 3.1 Instruct 405B · Meta — $3.69, 65Ministral 3 8B · Mistral — $0.15, 109Devstral Medium · Mistral — $0.8, 59Pixtral Large · Mistral — $3, 63GPT-4.1 nano · OpenAI — $0.175, 162Mistral Large 2 (Nov '24) · Mistral — $3, 63Llama 4 Scout · Meta — $0.292, 108GPT-4o (Nov '24) · OpenAI — $4.38, 200Mistral Small 3.1 · Mistral — $0.138, 163Qwen3 Omni 30B A3B Instruct · Alibaba — $0.43, 107Devstral Small (Jul '25) · Mistral — $0.15, 69phi-4 · microsoft — $0.219, 37Qwen3 VL 8B Instruct · Alibaba — $0.31, 145Reka Flash 3 · Reka AI — $0.35, 90Llama 3.1 Nemotron Instruct 70B · NVIDIA — $1.2, 304Llama 3.1 Instruct 70B · Meta — $0.56, 34Nova Lite · Amazon — $0.105, 179Ministral 3 3B · Mistral — $0.1, 181Mistral Medium · Mistral — $4.09, 128Mistral Small 3 · Mistral — $0.104, 176Granite 4.0 H Small · IBM — $0.107, 391Mistral Small (Feb '24) · Mistral — $1.5, 172Qwen3.5 2B · Alibaba — $0.04, 21Jamba 1.7 Large · AI21 Labs — $3.5, 56Granite 4.1 8B · IBM — $0.063, 122Llama 3 Instruct 70B · Meta — $1.18, 46Llama 3.1 Instruct 8B · Meta — $0.1, 198Jamba 1.6 Mini · AI21 Labs — $0.25, 176Llama 3.2 Instruct 11B (Vision) · Meta — $0.245, 87LFM2 24B A2B · Liquid AI — $0.052, 133Llama 3 Instruct 8B · Meta — $0.07, 81Llama 3.2 Instruct 3B · Meta — $0.15, 52Llama 2 Chat 7B · Meta — $0.1, 120Solar Mini · Upstage — $0.15, 75Llama 3.2 Instruct 1B · Meta — $0.05, 87Mistral 7B Instruct · Mistral — $0.206, 105Qwen3.5 0.8B · Alibaba — $0.02, 20Gemma 3n E4B Instruct · Google — $0.025, 40Sarvam 30B · Sarvam — $0.047, 240Nova Micro · Amazon — $0.061, 311HyperNova 60B 2605 · Multiverse Computing — $0.065, 347Granite 3.3 8B · IBM — $0.085, 461Mercury 2 · Inception — $0.375, 1344Mercury 2Granite 3.3 8BHyperNova 60B 2605Sarvam 30BGemma 3n E4B Instruct

On the frontier · 16

cheapest → most capable
ModelCap$/1Mtok/sWeights
Qwen3.5 0.8B
Alibaba
19.3$0.0220open
Gemma 3n E4B Instruct
Google
32.2$0.02540open
Qwen3.5 2B
Alibaba
36.6$0.0421open
Sarvam 30B
Sarvam
38.9$0.047240
Qwen3.5 4B
Alibaba
65.3$0.0623open
HyperNova 60B 2605
Multiverse Computing
69.8$0.065347
gpt-oss-20b
openai
72.5$0.088252open
NVIDIA Nemotron 3 Nano 30B A3B
NVIDIA
73.1$0.09685open
Qwen3.5 9B
Alibaba
73.6$0.11365open
MiMo-V2-Flash
Xiaomi
87.7$0.15154
DeepSeek V4 Flash
DeepSeek
91.4$0.175114open
MiniMax-M3
MiniMax
94.6$0.52559open
MiMo-V2.5-Pro
Xiaomi
95.5$0.54443
Kimi K2.6
Kimi
96.0$1.7146open
Qwen3.7 Max
Alibaba
96.1$3.75199open
Gemini 3.1 Pro Preview
Google
97.9$4.5142

Every point links to the model's page — scores with sources, latency, and the research behind it. Compare any of them head-to-head on Compare.

Workload cost calculator

Tokens per task × tasks per day → what each model actually costs to run, and how long a task takes.

Presets
ModelCap$ / task$ / daytime / task
Qwen3.5 0.8BAlibaba19$0$0.17515s
Gemma 3n E4B InstructGoogle32$0.0001$0.268.0s
Qwen3.5 2BAlibaba37$0.0001$0.3515s
Sarvam 30BSarvam39$0.0001$0.4252.4s
LFM2 24B A2BLiquid AI29$0.0001$0.482.5s
Gemma 3 4B InstructGoogle30$0.0001$0.52
Qwen3.5 4BAlibaba65$0.0001$0.52514s
Nova MicroAmazon34$0.0001$0.561.6s
Llama 3.2 Instruct 1BMeta16$0.0001$0.5754.0s
HyperNova 60B 2605Multiverse Computing70$0.0001$0.611.4s
NVIDIA-Nemotron-Nano-9B-v2nvidia49$0.0001$0.6419s
Granite 4.1 8BIBM35$0.0001$0.652.9s

Price arithmetic from live per-token prices (Artificial Analysis medians) — not a measured task benchmark. Time per task = latency + output tokens ÷ speed; ignores caching, rate limits and retries. For reasoning models, latency includes thinking time.