LLM Leaderboard

AI Models Leaderboard

Live benchmark comparison of frontier AI models — context window, pricing, speed, and capability scores side by side.

#ModelProvider ArenaContextGPQASWE In/Out Speed
1
Google
Gemini 3 Pro
Google14551,000K86%76%$2/$12
2
OpenAI
GPT-5
OpenAI1450400K85%75%$1.25/$10
3
Anthropic
Claude Opus 4.5
Anthropic1440200K87%80%$5/$25
4
xAI
Grok 4
xAI1420256K84%72%$3/$15
5
Anthropic
Claude Sonnet 4.5
Anthropic1415200K83%77%$3/$15
6
DeepSeek
DeepSeek V3.2
Open Source
DeepSeek1390128K79%66%$0.28/$0.42
7
Alibaba
Qwen3 Max
Open Source
Alibaba1375256K78%64%$0.4/$1.2
8
Mistral AI
Mistral Large 3
Mistral AI1360256K75%60%$2/$6
9
Meta
Llama 4 Maverick
Open Source
Meta13401,000K70%55%$0.2/$0.6
10
Amazon
Amazon Nova Pro
Amazon1320300K68%50%$0.8/$3.2

Prices shown per 1M tokens. Benchmarks: GPQA Diamond, SWE-Bench, Chatbot Arena Elo. Figures are sourced from public provider data and may change.