How can I access GPT-5.4 Mini?

Access is provided through OpenAI's standard API and platform interfaces.

Is GPT-5.4 Mini a multimodal model?

Yes, it is classified as a multimodal model capable of handling text and visual inputs.

What are typical pricing details for GPT-5.4 Mini?

Pricing follows OpenAI's published rates for similar multimodal models and is listed on their developer pricing page.

What use cases suit GPT-5.4 Mini best?

It is designed for tasks that combine large volumes of text with images or other visual data within a single context.

GPT-5.4 Mini

Verified

Multimodal model for large-scale file, image, and text processing.

OpenAIMultimodalClosed

Function callingJSON modeStructured outputsReasoningVision

Model page

Updated 2026-06-14

About GPT-5.4 Mini

As a closed-source model, GPT-5.4 Mini is not available as open weights and does not publish parameter counts. Its design focuses on efficient multimodal fusion across files, images, and text. The architecture supports extended context lengths to maintain coherence over lengthy combined inputs.

Strengths include seamless processing of diverse data types without requiring separate specialized tools. It enables unified analysis sessions that span visual and textual elements alongside document files. Typical usage covers document review with embedded images, long-form content generation conditioned on visual references, and multimodal data exploration tasks.

Capabilities

Long-context reasoning

Multimodal understanding

Image interpretation

File content analysis

Text generation

Cross-modal integration

How GPT-5.4 Mini compares

GPT-5.4 Mini (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · GPT-5.4 Mini ranks #54 of 97

$2.6

Qwen3 VL 235B A22B Thinking

$3.0

Gemini 3 Flash Preview

$3.4

Kimi K2.6

$3.4

MoonshotAI Kimi Latest

$3.4

MoonshotAI Kimi Latest

$3.5

Kimi K2.7 Code

$4.5

GPT-5.4 Mini

$4.5

OpenAI GPT Mini Latest

$5.0

Claude Haiku 4.5

$5.0

Anthropic Claude Haiku Latest

$6.0

Grok 4.20 Multi-Agent

$7.5

Mistral Medium 3.5

$9.0

Gemini 3.5 Flash

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-document multimodal analysis

The model processes up to 400000 tokens of combined text and image input, making it suitable for reviewing extensive reports that contain charts, diagrams, and supporting visuals.

Extended video transcript reasoning

With its large context window, GPT-5.4 Mini can maintain coherence across hours of transcribed video content while interpreting accompanying visual frames.

Complex cross-modal research queries

It excels at answering detailed questions that require simultaneous reference to lengthy textual sources and multiple embedded images or figures.

Strengths & limitations

Strengths

+Very large context window
+Native support for files, images, and text
+Flexible multimodal workflows
+Suitable for document-heavy tasks

Limitations

–Mini size may reduce depth on complex reasoning
–Performance depends on input quality across modalities
–Long contexts can increase latency

Pricing by provider

Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.

Provider	Input /1M	Output /1M	Context	Uptime
OpenAI	$0.75	$4.50	400K	99.3%
Azure	$0.75	$4.50	400K	100.0%

Cost calculator

Estimate what GPT-5.4 Mini would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00300

per request

$30

estimated / month

Based on GPT-5.4 Mini's $0.75/1M input · $4.50/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-5.4-mini",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-5.4-mini

Editor's verdict

Our take on GPT-5.4 Mini

GPT-5.4 Mini is OpenAI's proprietary multimodal with a 400K-token context window.

At $4.50 per 1M output tokens, it is mid-priced for its class, served by 2 providers.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to very large context window and native support for files, images, and text.

Did you find this helpful?

Frequently asked questions

The model supports a context window of 400000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other GPT models

Sibling versions in the GPT family from OpenAI.

GPT-5.5

OpenAI · Multimodal

Verified

OpenAI's multimodal model built for massive file, image, and text inputs.

ClosedII 50.81050K ctx$30.00/1M out

GPT-5.5 Pro

OpenAI · Multimodal

Verified

Multimodal model handling over a million tokens of context.

Closed1050K ctx$180.00/1M out

GPT-5.4

OpenAI · Multimodal

Verified

Multimodal model excelling at large-scale text, image and file tasks.

Closed1050K ctx$15.00/1M out

GPT-5 Mini

OpenAI · Multimodal

Verified

Multimodal model handling massive text, image, and file contexts.

Closed400K ctx$2.00/1M out

GPT-5.4 Pro

OpenAI · Multimodal

Verified

Multimodal model excelling at large-scale text, image, and file tasks.

Closed1050K ctx$180.00/1M out

GPT Chat Latest

OpenAI · Multimodal

Verified

OpenAI's multimodal model for large-scale text, image and file tasks.

Closed400K ctx$30.00/1M out

Promote GPT-5.4 Mini

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT-5.4 Mini 2

GPT-5.4 Mini

About GPT-5.4 Mini

Capabilities