o3
VerifiedOpenAI's o3 excels at multimodal reasoning over images, text, and files.
About o3
o3 was designed by OpenAI as a closed multimodal system without public weights. It natively accepts images, text, and files while supporting up to 200,000 tokens of context. This architecture enables unified handling of diverse input types in a single inference pass.
Strengths include robust integration of visual and textual data at scale. The model maintains coherence across lengthy documents and image collections without requiring external retrieval. Its proprietary training yields consistent performance on tasks that combine multiple modalities.
Typical usage covers research synthesis, document analysis, and creative workflows that mix images with text. Developers integrate o3 into applications needing file-aware reasoning over extended contexts. Access remains limited to OpenAI's controlled API channels.
Capabilities
How o3 compares
o3 (striped bar) vs other multimodal on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · o3 ranks #78 of 124
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Debugging Large Codebases
Applies chain-of-thought reasoning and file interpretation to identify issues across extensive code repositories and generate targeted fixes.
Multimodal Research Analysis
Performs combined image and text analysis on visual data paired with long documents to support multi-step problem solving.
Long-Context Document Workflows
Comprehends and processes up to 200000 tokens of text for tasks requiring sustained reasoning across multiple files or reports.
Strengths & limitations
Strengths
- +Strong at complex reasoning tasks
- +Effective multimodal integration
- +Handles large context windows well
- +Versatile across text, image, and file inputs
Limitations
- –Higher latency from reasoning steps
- –May overthink simple queries
- –No native audio or video support
Cost calculator
Estimate what o3 would cost for your usage.
Based on o3's $2.00/1M input · $8.00/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "openai/o3",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: openai/o3
Editor's verdict
o3 is OpenAI's proprietary multimodal with a 200K-token context window.
At $8.00 per 1M output tokens, it is premium-priced for its class.
It is available through OpenAI's API and aggregators like OpenRouter.
Best suited to strong at complex reasoning tasks and effective multimodal integration.
Frequently asked questions
The model provides a 200000-token context window for long-context comprehension.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other o models
Sibling versions in the o family from OpenAI.