GPT-4o (2024-11-20)
VerifiedMultimodal AI for seamless text, image, and file processing.
About GPT-4o (2024-11-20)
The model combines multiple input modalities into a unified architecture. It supports extensive context lengths for processing lengthy documents or conversations. As a proprietary system, it emphasizes integrated performance across data types rather than open distribution.
Strengths include coherent handling of mixed visual and textual content in single queries. It maintains context across large inputs while delivering consistent outputs. The design prioritizes practical utility in dynamic environments over specialized single-mode tasks.
Common usage involves content creation, visual analysis, and file-based reasoning. Developers deploy it in chat interfaces, automation workflows, and enterprise tools. Individual users leverage it for research assistance, creative projects, and data interpretation.
Capabilities
Benchmarks & performance
Independent evaluation scores and measured speed.
Source: Artificial Analysis
How GPT-4o (2024-11-20) compares
GPT-4o (2024-11-20) (striped bar) vs other multimodal on intelligence, speed and price.
Intelligence
Artificial Analysis Intelligence Index · Higher is better · GPT-4o (2024-11-20) ranks #74 of 88
Speed
Output tokens per second · Higher is better · GPT-4o (2024-11-20) ranks #37 of 76
Price
USD per 1M output tokens · Lower is better · GPT-4o (2024-11-20) ranks #110 of 155
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Multimodal Visual Analysis
The model performs visual question answering and multimodal reasoning by interpreting images together with text inputs for tasks such as describing charts or identifying objects in photos.
Long-Context Document Review
With a 128,000-token context window it supports detailed analysis and summarization of lengthy reports, research papers, or multi-file code repositories.
Code Generation and Debugging
It generates, debugs, and refactors code while following complex instructions, making it effective for software development workflows and file interpretation.
Strengths & limitations
Strengths
- +Strong integration of visual and textual inputs
- +Reliable performance on diverse reasoning tasks
- +Fast and coherent multi-turn dialogue
- +Effective handling of mixed file and image queries
Limitations
- –Can produce factual hallucinations
- –No native audio or video processing
- –Performance varies on highly specialized domains
Cost calculator
Estimate what GPT-4o (2024-11-20) would cost for your usage.
Based on GPT-4o (2024-11-20)'s $2.50/1M input · $10.00/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "openai/gpt-4o-2024-11-20",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: openai/gpt-4o-2024-11-20
Editor's verdict
GPT-4o (2024-11-20) is OpenAI's proprietary multimodal with a 128K-token context window.
On independent testing it scores 14.5 on the Artificial Analysis Intelligence Index, running at roughly 102 tokens per second with about 0.89s to first token.
At $10.00 per 1M output tokens, it is premium-priced for its class.
It is available through OpenAI's API and aggregators like OpenRouter.
Best suited to strong integration of visual and textual inputs and reliable performance on diverse reasoning tasks.
Frequently asked questions
The model supports a context length of 128,000 tokens.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other GPT models
Sibling versions in the GPT family from OpenAI.