Skip to content
GPT-5 logo

GPT-5

Verified

Multimodal model with 400k-token context for complex inputs.

OpenAIMultimodalClosed
Vision
Model page
Updated 2026-06-14

About GPT-5

GPT-5 was developed by OpenAI as a closed-weight multimodal architecture. It accepts text, image, and file inputs within a single 400000-token context window. The design emphasizes unified handling of diverse data types without public parameter details.

Its primary strengths lie in maintaining coherence across lengthy multimodal sequences. The system processes extended documents alongside visual and file-based content in one session. This enables consistent analysis without external chunking or summarization steps.

Typical usage includes document review that combines textual and visual elements. Users apply it to tasks such as file interpretation, image-augmented reasoning, and sustained multi-turn interactions. The closed nature limits direct modification while providing API access for production workflows.

Capabilities

Long-context reasoning
Multimodal understanding
Image and file analysis
Advanced text generation
Complex problem solving

How GPT-5 compares

GPT-5 (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · GPT-5 ranks #86 of 124

$9.0
Gemini 3.5 Flash
$9.0
Google Gemini Flash Latest
$10.0
Gemini 2.5 Pro
$10.0
GPT-5.1
$10.0
GPT-5 Codex
$10.0
GPT-5.1-Codex
$10.0
GPT-5
$10.0
GPT-5.1-Codex-Max
$10.0
GPT-5 Chat
$10.0
GPT-5.1 Chat
$12.0
Gemini 3.1 Pro Preview
$12.0
Gemini 3.1 Pro Preview Custom Tools
$12.0
Google Gemini Pro Latest

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Extended Multimodal Document Review

GPT-5 processes lengthy reports or books alongside embedded images and diagrams within its 400000-token context for unified analysis.

Large-Scale Research Integration

The model handles extensive collections of text, charts, and visuals in one session to synthesize findings across multiple sources.

Complex Visual-Text Workflows

It supports iterative tasks that combine detailed image interpretation with long-running textual reasoning and refinement.

Strengths & limitations

Strengths

  • +Very large context window
  • +Native multimodal input support
  • +Seamless text-image-file integration

Limitations

  • Hypothetical model with unverified performance
  • High resource demands for maximum context
  • Potential latency on large multimodal tasks

Cost calculator

Estimate what GPT-5 would cost for your usage.

$0.00625
per request
$62.5
estimated / month

Based on GPT-5's $1.25/1M input · $10.00/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-5",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-5

Editor's verdict

Our take on GPT-5

GPT-5 is OpenAI's proprietary multimodal with a 400K-token context window.

At $10.00 per 1M output tokens, it is premium-priced for its class.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to very large context window and native multimodal input support.

Did you find this helpful?

Frequently asked questions

OpenAI has not announced pricing details for GPT-5.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other GPT models

Sibling versions in the GPT family from OpenAI.

Promote GPT-5

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT-5 2