Grok 4.20
VerifiedMultimodal model with a 2 million token context window.
About Grok 4.20
Grok 4.20 was developed as a closed-source multimodal system by xAI. It accepts text, images, and files as input modalities. The 2 million token context window enables processing of extensive combined inputs.
Its design emphasizes integration of multiple data types in a single session. This supports workflows that combine visual and textual information at scale. Access remains restricted due to its non-open-weight status.
Typical usage includes analysis of large documents paired with images or files. Developers and researchers apply it to tasks requiring sustained context across modalities. Updates and capabilities are managed directly by xAI.
Capabilities
How Grok 4.20 compares
Grok 4.20 (striped bar) vs other multimodal on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · Grok 4.20 ranks #32 of 67
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Enterprise Document Analysis
Grok 4.20 processes complex documents spanning millions of tokens, integrating text, images, and data tables for comprehensive summarization and insight extraction in a single pass.
Multimodal Research Workflows
It combines long-context reasoning with vision understanding to interpret charts, diagrams, and visual data alongside extensive textual sources for academic or technical research.
Large File Processing Pipelines
The model performs file analysis on massive multimodal inputs, enabling tasks like cross-referencing visual elements within lengthy reports or datasets without context truncation.
Strengths & limitations
Strengths
- +Handles extremely large contexts up to 2M tokens
- +Native support for text, image, and file inputs
- +Multimodal integration in a single model
Limitations
- –No audio or video modality support
- –Very large context can increase latency
- –Performance depends on input quality and structure
Pricing by provider
Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.
| Provider | Input /1M | Output /1M | Context | Uptime |
|---|---|---|---|---|
| xAI | $1.25 | $2.50 | 2000K | 100.0% |
Cost calculator
Estimate what Grok 4.20 would cost for your usage.
Based on Grok 4.20's $1.25/1M input · $2.50/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "x-ai/grok-4.20",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: x-ai/grok-4.20
Editor's verdict
Grok 4.20 is xAI's proprietary multimodal with a 2000K-token context window.
At $2.50 per 1M output tokens, it is mid-priced for its class, served by 1 provider.
It is available through xAI's API and aggregators like OpenRouter.
Best suited to handles extremely large contexts up to 2m tokens and native support for text, image, and file inputs.
Frequently asked questions
Grok 4.20 handles up to 2,000,000 tokens of context for extended reasoning sessions.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other Grok models
Sibling versions in the Grok family from xAI.