Compare large language models side by side. Filter GPT-4o, Claude, Gemini, Llama, Mistral, DeepSeek, and Qwen models by intelligence benchmarks like GPQA Diamond, HLE, and LiveCodeBench. Sort by API pricing, tokens per second, and overall score. Find the best LLM for your use case.
All Models
| Model | Intelligence | Cost | Speed |
|---|---|---|---|
| Gemini 3.1 Pro | ~81 | $4.50 | 130tok/s |
| GPT-5.5 | ~80 | $11.25 | 78tok/s |
| GPT-5.4 | ~78 | $5.63 | 93tok/s |
| Claude Opus 4.7 | ~77 | $10.00 | 86tok/s |
| Gemini 3 Pro | 76 | $4.50 | 142tok/s |
| DeepSeek V4 Pro | 76 | $2.17 | 56tok/s |
| Gemini 3 Flash | 75 | $1.13 | 184tok/s |
| GPT-5.2 | 74 | $4.81 | 88tok/s |
| Claude Opus 4.6 | 72 | $10.00 | 52tok/s |
| DeepSeek V4 Flash | ~71 | $0.18 | 79tok/s |
| GPT-5.1 | 69 | $3.44 | 136tok/s |
| Claude Opus 4.5 | 69 | $10.00 | 88tok/s |
| Grok 4.20 | ~69 | $3.00 | 98tok/s |
| Kimi K2.5 | 69 | $1.07 | 345tok/s |
| GPT-5.4 mini | ~68 | $1.69 | 171tok/s |
| Qwen3.5 397B | 68 | $1.35 | 74tok/s |
| Qwen3.6 Plus | ~68 | $1.13 | 0tok/s |
| GLM-4.7 | 68 | $1.00 | 136tok/s |
| GPT-5 | 67 | $3.44 | 98tok/s |
| Grok 4 | 66 | $6.00 | 60tok/s |
| Claude Sonnet 4.6 | 65 | $6.00 | 46tok/s |
| DeepSeek V3.2 | 65 | $0.32 | 219tok/s |
| GPT-5.4 nano | ~64 | $0.46 | 162tok/s |
| Qwen3.6-27B | 64 | $1.35 | 65tok/s |
| GPT-5 mini | 63 | $0.69 | 75tok/s |
| o3 | 62 | $3.50 | 141tok/s |
| Gemini 2.5 Pro | 62 | $3.44 | 139tok/s |
| Grok 4.1 Fast | 62 | $0.28 | 107tok/s |
| Grok 4 Fast | 62 | $0.28 | 145tok/s |
| Qwen3.6-35B-A3B | 62 | $0.56 | 200tok/s |
| GPT-OSS 120B | 61 | $0.20 | 2951tok/s |
| o4-mini | 60 | $1.93 | 134tok/s |
| Gemini 3.1 Flash-Lite | 59 | $0.56 | 307tok/s |
| Claude Sonnet 4.5 | 57 | $6.00 | 103tok/s |
| DeepSeek R1 | 57 | $2.36 | 306tok/s |
| GLM-5.1 | ~57 | $2.15 | 163tok/s |
| MiniMax M2.5 | 57 | $0.52 | 273tok/s |
| DeepSeek V3.1 | 55 | $0.75 | 347tok/s |
| GLM-5 | 55 | $1.50 | 266tok/s |
| Gemini 2.5 Flash | 52 | $0.85 | 282tok/s |
| Magistral Medium | 51 | $2.75 | 29tok/s |
| GPT-OSS 20B | 50 | $0.09 | 962tok/s |
| GPT-5 nano | 49 | $0.14 | 131tok/s |
| o1 | 48 | $26.25 | 174tok/s |
| Grok Code Fast 1 | 47 | $0.53 | 144tok/s |
| Kimi K2 | 45 | $1.07 | 36tok/s |
| Claude Haiku 4.5 | 37 | $2.00 | 109tok/s |
| Mistral Large 3 | 37 | $0.75 | 147tok/s |
| GPT-4.1 | 36 | $3.50 | 104tok/s |
| GPT-4.1 mini | 36 | $0.70 | 78tok/s |
| Llama 4 Maverick | 34 | $0.30 | 434tok/s |
| DeepSeek V3 | 28 | $0.52 | 120tok/s |
| Llama 4 Scout | 27 | $0.17 | 448tok/s |
| GPT-4.1 nano | 24 | $0.18 | 142tok/s |
| Command A | 24 | $4.38 | 49tok/s |
| GPT-4o | 23 | $4.38 | 170tok/s |
| GPT-4o mini | 17 | $0.26 | 54tok/s |
| Command R+ | 9 | $4.38 | 59tok/s |
| Mistral Small 4 | — | $0.26 | 157tok/s |