GPT-5.4
VerifiedMultimodal model excelling at large-scale text, image and file tasks.
About GPT-5.4
GPT-5.4 was designed by OpenAI as a closed-weight system that integrates text, image and file modalities. Its architecture supports an exceptionally large context window, allowing it to ingest and reason over extensive combined inputs without requiring parameter details to be disclosed.
Strengths center on maintaining coherence across very long multimodal sequences. This enables reliable handling of documents that mix lengthy text passages with embedded images or attached files.
Typical usage includes research analysis, content summarization and workflow automation where users must process substantial multimodal datasets in a single session.
Capabilities
How GPT-5.4 compares
GPT-5.4 (striped bar) vs other multimodal on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · GPT-5.4 ranks #77 of 97
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Long-form document analysis
Processes and reasons over entire books, legal contracts, or research corpora within its 1,050,000-token context window while maintaining coherence across sections.
Multimodal research review
Combines vision understanding with text generation to interpret charts, diagrams, and images alongside accompanying documents for integrated insights.
Large-scale file processing
Handles multimodal file inputs to extract, summarize, and cross-reference data from mixed text, image, and structured formats in a single pass.
Strengths & limitations
Strengths
- +Handles extremely large contexts
- +Seamless text-image-file integration
- +Strong at document-level tasks
- +Flexible multimodal workflows
Limitations
- –No native audio or video support
- –Large context can increase latency
- –May need careful prompting for complex tasks
Pricing by provider
Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.
| Provider | Input /1M | Output /1M | Context | Uptime |
|---|---|---|---|---|
| OpenAI | $2.50 | $15.00 | 1050K | 99.1% |
| Azure | $2.50 | $15.00 | 1050K | 100.0% |
Cost calculator
Estimate what GPT-5.4 would cost for your usage.
Based on GPT-5.4's $2.50/1M input · $15.00/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "openai/gpt-5.4",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: openai/gpt-5.4
Editor's verdict
GPT-5.4 is OpenAI's proprietary multimodal with a 1050K-token context window.
At $15.00 per 1M output tokens, it is premium-priced for its class, served by 2 providers.
It is available through OpenAI's API and aggregators like OpenRouter.
Best suited to handles extremely large contexts and seamless text-image-file integration.
Frequently asked questions
GPT-5.4 provides a context window of 1,050,000 tokens.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other GPT models
Sibling versions in the GPT family from OpenAI.