Compare AI models by price, context window, and capability — open and closed, kept fresh by our agents.
59 models
Anthropic · Multimodal
Fast multimodal model with a 1M-token context window from Anthropic.
OpenAI · Multimodal
OpenAI's multimodal model built for massive file, image, and text inputs.
xAI · Multimodal
Multimodal model with 1M-token context for complex text and image tasks.
Google · Multimodal
Google's fast multimodal model for efficient text, image, and video tasks.
Multimodal model handling over a million tokens of context.
Anthropic's Claude Sonnet 4.6 excels at long-context multimodal analysis.
Handles million-token multimodal inputs with precision.
Multimodal model with a 2 million token context window.
Multimodal model with a million-token context for complex inputs.
Multimodal reasoning over million-token contexts.
Multimodal coding model with 400k-token context from OpenAI.
Fast multimodal model handling massive text, image, and file inputs.
Google's fast multimodal model for text, image, video and audio tasks.
Anthropic's fast multimodal model for large-scale text and image tasks.
OpenAI's multimodal model for large-scale text, image and file tasks.
MiniMax · Multimodal
Processes long multimodal sequences across text, images, and video.
Openrouter · Language Models
Routes complex code tasks through optimal models with 2M-token context.
Multimodal model for massive text, image, and file inputs.
Multimodal model for large-scale file, image, and text tasks.
Amazon · Multimodal
Amazon's multimodal model for long-context text, image, video and file analysis.
Google's multimodal model for long-context reasoning across media types.
Multimodal reasoning and long-context analysis from Anthropic.
Anthropic's closed multimodal model with 1M-token context.
Xiaomi · Multimodal
MiMo-V2.5 processes extended multimodal sequences across text, audio, image, and video.
Google's fast multimodal model for efficient text, image, video and audio tasks.
Xiaomi · Language Models
MiMo-V2.5-Pro manages million-token text contexts for complex tasks.
NVIDIA · Language Models
NVIDIA's Nemotron 3 Ultra handles million-token text contexts with ease.
Anthropic's multimodal model for large-scale text and image analysis.
Processes over a million tokens for long-form text tasks.
OpenAI · Image Models
OpenAI's multimodal image model handles vast contexts for visual tasks.
Mistral · Multimodal
Mistral's closed multimodal model for long-context text, image, and file tasks.
Multimodal AI from xAI for text and image tasks with large context.
Multimodal analysis with 200k-token context for complex inputs.
Moonshot AI · Multimodal
Processes extended text and image inputs for in-depth multimodal analysis.
Excels at long-context multimodal text and image tasks.
Bytedance-seed · Multimodal
Efficient multimodal model for long-context text, image, and video tasks.
Inclusionai · Language Models
Closed LLM with 262K context for extended text tasks.
Seed 1.6 processes image, text, and video with a 262k-token context.
Multimodal model specialized in code tasks with extensive context.
Kimi K2.6 processes long text and image inputs with a 262k-token context.
Ring-2.6-1T handles massive text contexts with closed-source precision.
Tencent · Language Models
Tencent's Hy3 preview manages 256k-token contexts for long-form text tasks.
Closed LLM built for massive 256K-token text contexts.
Z.AI · Language Models
GLM 5 Turbo handles massive text contexts with closed-source efficiency.
Stepfun · Multimodal
Multimodal model for long-context text, image, and video tasks.
Relace · Language Models
Long-context LLM built for precise search over massive text inputs.
Kwaipilot · Language Models
Closed-source coding LLM with 256k token context.
OpenAI · Language Models
OpenAI's closed LLM built for safe, reliable text handling.
MiniMax · Language Models
MiniMax M2.1 handles massive contexts as a closed-source text LLM.
GLM 5 manages long text contexts with closed-weight precision.
GLM 4.7 handles extended text contexts with precision.
Anthropic's fast multimodal model for efficient text and image processing.
Google · Image Models
Google's preview model for advanced image and text tasks.
Z.AI · Multimodal
Multimodal model for unified image, text, and video processing.
Ibm-granite · Language Models
IBM's compact 8B text LLM with 128k context for enterprise use.
Fusion excels at coherent text processing across massive contexts.
A closed-source LLM built for extended text processing and analysis.
Perceptron · Multimodal
Closed-source multimodal model handling text, image, and video inputs.
Rekaai · Multimodal
Reka Edge processes images, text, and video in a single multimodal workflow.