How do I access Grok 4.20?

Access is provided through xAI platforms supporting multimodal inputs and outputs.

What is the pricing model for Grok 4.20?

Pricing details are listed on the xAI website and depend on usage volume and subscription tier.

Can Grok 4.20 analyze images within documents?

Yes, it supports vision understanding and multimodal text-image reasoning for files containing visuals.

What file types work best with its analysis features?

It excels with complex documents, PDFs, and multimodal files requiring integrated text and image processing.

Grok 4.20

Verified

Multimodal model with a 2 million token context window.

xAIMultimodalClosed

Function callingJSON modeStructured outputsReasoningVision

Model page

Updated 2026-06-14

About Grok 4.20

Grok 4.20 was developed as a closed-source multimodal system by xAI. It accepts text, images, and files as input modalities. The 2 million token context window enables processing of extensive combined inputs.

Its design emphasizes integration of multiple data types in a single session. This supports workflows that combine visual and textual information at scale. Access remains restricted due to its non-open-weight status.

Typical usage includes analysis of large documents paired with images or files. Developers and researchers apply it to tasks requiring sustained context across modalities. Updates and capabilities are managed directly by xAI.

Capabilities

Long-context reasoning

Vision understanding

File analysis and processing

Multimodal text-image reasoning

Complex document handling

How Grok 4.20 compares

Grok 4.20 (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Grok 4.20 ranks #32 of 67

$2.0

Devstral 2 2512

$2.0

Grok Build 0.1

$2.0

Seed 1.6

$2.0

Qwen3.6 27B

$2.0

Kimi K2.5

$2.5

Grok 4.3

$2.5

Grok 4.20

$2.5

Nova 2 Lite

$2.6

Qwen3 VL 235B A22B Thinking

$3.4

Kimi K2.6

$3.4

MoonshotAI Kimi Latest

$3.5

Kimi K2.7 Code

$4.5

GPT-5.4 Mini

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Enterprise Document Analysis

Grok 4.20 processes complex documents spanning millions of tokens, integrating text, images, and data tables for comprehensive summarization and insight extraction in a single pass.

Multimodal Research Workflows

It combines long-context reasoning with vision understanding to interpret charts, diagrams, and visual data alongside extensive textual sources for academic or technical research.

Large File Processing Pipelines

The model performs file analysis on massive multimodal inputs, enabling tasks like cross-referencing visual elements within lengthy reports or datasets without context truncation.

Strengths & limitations

Strengths

+Handles extremely large contexts up to 2M tokens
+Native support for text, image, and file inputs
+Multimodal integration in a single model

Limitations

–No audio or video modality support
–Very large context can increase latency
–Performance depends on input quality and structure

Pricing by provider

Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.

Provider	Input /1M	Output /1M	Context	Uptime
xAI	$1.25	$2.50	2000K	100.0%

Cost calculator

Estimate what Grok 4.20 would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00250

per request

$25

estimated / month

Based on Grok 4.20's $1.25/1M input · $2.50/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "x-ai/grok-4.20",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: x-ai/grok-4.20

Editor's verdict

Our take on Grok 4.20

Grok 4.20 is xAI's proprietary multimodal with a 2000K-token context window.

At $2.50 per 1M output tokens, it is mid-priced for its class, served by 1 provider.

It is available through xAI's API and aggregators like OpenRouter.

Best suited to handles extremely large contexts up to 2m tokens and native support for text, image, and file inputs.

Did you find this helpful?

Frequently asked questions

Grok 4.20 handles up to 2,000,000 tokens of context for extended reasoning sessions.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Grok models

Sibling versions in the Grok family from xAI.

Grok 4.3

xAI · Multimodal

Verified

Multimodal model with 1M-token context for complex text and image tasks.

ClosedII 43.91000K ctx$2.50/1M out

Grok 4.20 Multi-Agent

xAI · Multimodal

Verified

Multi-agent multimodal model for massive context tasks

Closed2000K ctx$6.00/1M out

Grok Build 0.1

xAI · Multimodal

Verified

Multimodal AI from xAI for text and image tasks with large context.

Closed256K ctx$2.00/1M out

Promote Grok 4.20

Add this badge to your website, or share the tool.

DFeatured on DhanasviGrok 4.20 2

Grok 4.20

About Grok 4.20

Capabilities