GPT-5.4 Mini
VerifiedMultimodal model for large-scale file, image, and text processing.
About GPT-5.4 Mini
As a closed-source model, GPT-5.4 Mini is not available as open weights and does not publish parameter counts. Its design focuses on efficient multimodal fusion across files, images, and text. The architecture supports extended context lengths to maintain coherence over lengthy combined inputs.
Strengths include seamless processing of diverse data types without requiring separate specialized tools. It enables unified analysis sessions that span visual and textual elements alongside document files. Typical usage covers document review with embedded images, long-form content generation conditioned on visual references, and multimodal data exploration tasks.
Capabilities
How GPT-5.4 Mini compares
GPT-5.4 Mini (striped bar) vs other multimodal on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · GPT-5.4 Mini ranks #54 of 97
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Long-document multimodal analysis
The model processes up to 400000 tokens of combined text and image input, making it suitable for reviewing extensive reports that contain charts, diagrams, and supporting visuals.
Extended video transcript reasoning
With its large context window, GPT-5.4 Mini can maintain coherence across hours of transcribed video content while interpreting accompanying visual frames.
Complex cross-modal research queries
It excels at answering detailed questions that require simultaneous reference to lengthy textual sources and multiple embedded images or figures.
Strengths & limitations
Strengths
- +Very large context window
- +Native support for files, images, and text
- +Flexible multimodal workflows
- +Suitable for document-heavy tasks
Limitations
- –Mini size may reduce depth on complex reasoning
- –Performance depends on input quality across modalities
- –Long contexts can increase latency
Pricing by provider
Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.
| Provider | Input /1M | Output /1M | Context | Uptime |
|---|---|---|---|---|
| OpenAI | $0.75 | $4.50 | 400K | 99.3% |
| Azure | $0.75 | $4.50 | 400K | 100.0% |
Cost calculator
Estimate what GPT-5.4 Mini would cost for your usage.
Based on GPT-5.4 Mini's $0.75/1M input · $4.50/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "openai/gpt-5.4-mini",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: openai/gpt-5.4-mini
Editor's verdict
GPT-5.4 Mini is OpenAI's proprietary multimodal with a 400K-token context window.
At $4.50 per 1M output tokens, it is mid-priced for its class, served by 2 providers.
It is available through OpenAI's API and aggregators like OpenRouter.
Best suited to very large context window and native support for files, images, and text.
Frequently asked questions
The model supports a context window of 400000 tokens.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other GPT models
Sibling versions in the GPT family from OpenAI.