Is GPT-4o (2024-11-20) a multimodal model?

Yes, it handles multimodal reasoning across text and images.

Where can pricing details for GPT-4o be found?

Current pricing information is available on OpenAI's official pricing page.

How do users access GPT-4o (2024-11-20)?

It is accessible via OpenAI's ChatGPT interface and API endpoints.

What file-related tasks does GPT-4o support?

The model can interpret, summarize, and analyze uploaded files including documents and code.

GPT-4o (2024-11-20)

Verified

Multimodal AI for seamless text, image, and file processing.

OpenAIMultimodalClosedII 14.5

Vision

Model page

Updated 2026-06-15

About GPT-4o (2024-11-20)

The model combines multiple input modalities into a unified architecture. It supports extensive context lengths for processing lengthy documents or conversations. As a proprietary system, it emphasizes integrated performance across data types rather than open distribution.

Strengths include coherent handling of mixed visual and textual content in single queries. It maintains context across large inputs while delivering consistent outputs. The design prioritizes practical utility in dynamic environments over specialized single-mode tasks.

Common usage involves content creation, visual analysis, and file-based reasoning. Developers deploy it in chat interfaces, automation workflows, and enterprise tools. Individual users leverage it for research assistance, creative projects, and data interpretation.

Capabilities

Multimodal reasoning across text and images

Long-context document analysis

Code generation and debugging

File interpretation and summarization

Visual question answering

Complex instruction following

Benchmarks & performance

Independent evaluation scores and measured speed.

14.5

Intelligence Index

24.2

Coding Index

102

Tokens / sec

0.89s

Time to first token

Source: Artificial Analysis

How GPT-4o (2024-11-20) compares

GPT-4o (2024-11-20) (striped bar) vs other multimodal on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · GPT-4o (2024-11-20) ranks #74 of 88

Gemini 2.5 Flash Lite

Qwen3 VL 32B Instruct

Qwen3 VL 30B A3B Instruct

Sonar

Sonar Pro

GLM 4.5V

GPT-4o

Qwen3 VL 8B Instruct

GPT-4 Turbo

Llama 4 Scout

Speed

Output tokens per second · Higher is better · GPT-4o (2024-11-20) ranks #37 of 76

130

GPT-4.1

122

Qwen3 VL 8B Instruct

116

GPT-5.1

112

Qwen3 VL 30B A3B Instruct

105

102

Llama 4 Scout

102

GPT-4o

102

GPT-4o

102

GPT-4o

102

GPT-4o

101

GPT-5.3-Codex

GPT-4.1 Mini

GPT-5 Mini

Price

USD per 1M output tokens · Lower is better · GPT-4o (2024-11-20) ranks #110 of 155

$10.0

GPT-5.1-Codex

$10.0

GPT-5 Codex

$10.0

Gemini 2.5 Pro Preview 06-05

$10.0

GPT-5.1

$10.0

GPT-5

$10.0

GPT-5.1-Codex-Max

$10.0

GPT-4o

$10.0

GPT-4o

$10.0

GPT-4o

$10.0

GPT-5 Chat

$10.0

GPT-5.1 Chat

$12.0

Gemini 3.1 Pro Preview

$12.0

Gemini 3.1 Pro Preview Custom Tools

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Multimodal Visual Analysis

The model performs visual question answering and multimodal reasoning by interpreting images together with text inputs for tasks such as describing charts or identifying objects in photos.

Long-Context Document Review

With a 128,000-token context window it supports detailed analysis and summarization of lengthy reports, research papers, or multi-file code repositories.

Code Generation and Debugging

It generates, debugs, and refactors code while following complex instructions, making it effective for software development workflows and file interpretation.

Strengths & limitations

Strengths

+Strong integration of visual and textual inputs
+Reliable performance on diverse reasoning tasks
+Fast and coherent multi-turn dialogue
+Effective handling of mixed file and image queries

Limitations

–Can produce factual hallucinations
–No native audio or video processing
–Performance varies on highly specialized domains

Cost calculator

Estimate what GPT-4o (2024-11-20) would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00750

per request

$75

estimated / month

Based on GPT-4o (2024-11-20)'s $2.50/1M input · $10.00/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-4o-2024-11-20",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-4o-2024-11-20

Editor's verdict

Our take on GPT-4o (2024-11-20)

GPT-4o (2024-11-20) is OpenAI's proprietary multimodal with a 128K-token context window.

On independent testing it scores 14.5 on the Artificial Analysis Intelligence Index, running at roughly 102 tokens per second with about 0.89s to first token.

At $10.00 per 1M output tokens, it is premium-priced for its class.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to strong integration of visual and textual inputs and reliable performance on diverse reasoning tasks.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 128,000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other GPT models

Sibling versions in the GPT family from OpenAI.

GPT-5.4

OpenAI · Multimodal

Verified

Multimodal model excelling at large-scale text, image and file tasks.

ClosedII 56.81050K ctx$15.00/1M out

GPT-5.3-Codex

OpenAI · Multimodal

Verified

Multimodal coding model with 400k-token context from OpenAI.

ClosedII 53.6400K ctx$14.00/1M out

GPT-5.5

OpenAI · Multimodal

Verified

OpenAI's multimodal model built for massive file, image, and text inputs.

ClosedII 50.81050K ctx$30.00/1M out

GPT-5.2-Codex

OpenAI · Multimodal

Verified

Multimodal model handling text and images at scale.

ClosedII 49400K ctx$14.00/1M out

GPT-5.4 Mini

OpenAI · Multimodal

Verified

Multimodal model for large-scale file, image, and text processing.

ClosedII 48.9400K ctx$4.50/1M out

GPT-5.2

OpenAI · Multimodal

Verified

OpenAI's multimodal model for large-scale file, image, and text tasks.

ClosedII 46.6400K ctx$14.00/1M out

Promote GPT-4o (2024-11-20)

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT-4o (2024-11-20) 1

GPT-4o (2024-11-20)

About GPT-4o (2024-11-20)

Capabilities

Benchmarks & performance