Is GPT-4o-mini available via API?

Yes, it can be accessed through OpenAI's developer platform and associated tools.

Does GPT-4o-mini accept image inputs?

As a multimodal model it processes both text and images for understanding tasks.

Where is pricing information listed for GPT-4o-mini?

Current pricing details are published on OpenAI's official website and API documentation.

What file types can GPT-4o-mini process?

It supports file content processing for text-based and image-containing files as part of its multimodal capabilities.

GPT-4o-mini

Verified

Compact multimodal model for efficient text and image tasks.

OpenAIMultimodalClosedII 12.6

Vision

Model page

Updated 2026-06-15

About GPT-4o-mini

GPT-4o-mini belongs to OpenAI's optimized GPT-4o family. It processes multiple input types through a unified architecture that supports long contexts. The design prioritizes lower latency while preserving core multimodal functions.

Strengths include reliable handling of mixed media without requiring extensive resources. It delivers consistent results across varied prompts involving documents and visuals. This balance suits production environments where speed matters.

Typical usage covers chat interfaces, content moderation tools, and automated analysis pipelines. Applications often embed it for image captioning combined with textual reasoning. Teams deploy it in customer support systems and internal knowledge tools.

Capabilities

Multimodal text and image understanding

Long-context reasoning

Code generation and analysis

File content processing

General reasoning and problem-solving

Natural language instruction following

Benchmarks & performance

Independent evaluation scores and measured speed.

12.6

Intelligence Index

Tokens / sec

1.23s

Time to first token

Source: Artificial Analysis

How GPT-4o-mini compares

GPT-4o-mini (striped bar) vs other multimodal on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · GPT-4o-mini ranks #83 of 88

GPT-4o

Qwen3 VL 8B Instruct

GPT-4 Turbo

Llama 4 Scout

GPT-4.1 Nano

GPT-4o-mini

Claude 3 Haiku

Saba

Gemma 3 27B

Gemma 3 12B

Gemma 3 4B

Speed

Output tokens per second · Higher is better · GPT-4o-mini ranks #58 of 76

Claude Opus 4.8

GPT-5.5

MiniMax M3

Qwen3.6 27B

GPT-4o-mini

Kimi K2.5

Qwen3.7 Plus

Qwen3 VL 235B A22B Instruct

Qwen3.6 Plus

Gemma 4 26B A4B

Qwen3.5 397B A17B

Price

USD per 1M output tokens · Lower is better · GPT-4o-mini ranks #33 of 155

$0.50

Qwen3 VL 8B Instruct

$0.52

Qwen3 VL 30B A3B Instruct

$0.55

Mistral Small 3.1 24B

$0.60

Llama 4 Maverick

$0.60

Mistral Small 4

$0.60

GPT-4o-mini

$0.60

GPT-4o-mini

$0.60

Saba

$0.88

Qwen3 VL 235B A22B Instruct

$0.90

Codestral 2508

$0.90

GLM 4.6V

$0.97

Qwen3.6 35B A3B

$1.0

Qwen3.5-35B-A3B

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Multimodal document analysis

GPT-4o-mini processes documents containing both text and images, such as extracting information from charts, diagrams, or scanned reports while applying reasoning to the combined content.

Large-scale code review

It handles extensive codebases within its context window for analysis, debugging suggestions, and generating improvements across multiple files.

Long-context instruction tasks

The model follows complex natural language instructions over lengthy inputs like full research papers or conversation histories to produce coherent summaries or solutions.

Strengths & limitations

Strengths

+Fast response times
+Cost-efficient for scale
+Solid vision capabilities
+Effective on everyday tasks

Limitations

–Weaker on complex multi-step reasoning than larger models
–Can miss subtle details in very long contexts
–No native audio or video generation

Cost calculator

Estimate what GPT-4o-mini would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00045

per request

$4.5

estimated / month

Based on GPT-4o-mini's $0.15/1M input · $0.60/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-4o-mini",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-4o-mini

Editor's verdict

Our take on GPT-4o-mini

GPT-4o-mini is OpenAI's proprietary multimodal with a 128K-token context window.

On independent testing it scores 12.6 on the Artificial Analysis Intelligence Index, running at roughly 55 tokens per second with about 1.23s to first token.

At $0.60 per 1M output tokens, it is very cost-efficient for its class.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to fast response times and cost-efficient for scale.

Did you find this helpful?

Frequently asked questions

It provides a context length of 128000 tokens for handling large inputs in a single request.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other GPT models

Sibling versions in the GPT family from OpenAI.

GPT-5.4

OpenAI · Multimodal

Verified

Multimodal model excelling at large-scale text, image and file tasks.

ClosedII 56.81050K ctx$15.00/1M out

GPT-5.3-Codex

OpenAI · Multimodal

Verified

Multimodal coding model with 400k-token context from OpenAI.

ClosedII 53.6400K ctx$14.00/1M out

GPT-5.5

OpenAI · Multimodal

Verified

OpenAI's multimodal model built for massive file, image, and text inputs.

ClosedII 50.81050K ctx$30.00/1M out

GPT-5.2-Codex

OpenAI · Multimodal

Verified

Multimodal model handling text and images at scale.

ClosedII 49400K ctx$14.00/1M out

GPT-5.4 Mini

OpenAI · Multimodal

Verified

Multimodal model for large-scale file, image, and text processing.

ClosedII 48.9400K ctx$4.50/1M out

GPT-5.2

OpenAI · Multimodal

Verified

OpenAI's multimodal model for large-scale file, image, and text tasks.

ClosedII 46.6400K ctx$14.00/1M out

Promote GPT-4o-mini

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT-4o-mini 1

GPT-4o-mini

About GPT-4o-mini

Capabilities

Benchmarks & performance

How GPT-4o-mini compares

Intelligence

Speed

Price

Best for

Multimodal document analysis

Large-scale code review

Long-context instruction tasks

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What context window does GPT-4o-mini support?

Is GPT-4o-mini available via API?

Does GPT-4o-mini accept image inputs?

Where is pricing information listed for GPT-4o-mini?

What file types can GPT-4o-mini process?

User reviews

Other GPT models

GPT-5.4

GPT-5.3-Codex

GPT-5.5

GPT-5.2-Codex

GPT-5.4 Mini

GPT-5.2

Similar models

Claude Opus 4.6

GPT-4.1 Nano

GPT-4.1

Promote GPT-4o-mini