Skip to content
GPT-4o-mini (2024-07-18) logo

GPT-4o-mini (2024-07-18)

Verified

Fast, affordable multimodal model for text and image tasks.

OpenAIMultimodalClosedII 12.6
Vision
Model page
Updated 2026-06-15

About GPT-4o-mini (2024-07-18)

GPT-4o-mini is built as a smaller-scale multimodal system from OpenAI. It supports combined text and visual inputs along with file handling while remaining fully proprietary. The architecture emphasizes reduced computational demands compared with larger siblings.

Its strengths lie in balancing capability with speed and cost for everyday workloads. The model processes mixed media reliably without requiring open weights or local hosting. This design suits production environments where latency and pricing matter.

Developers commonly use it for chat interfaces, image analysis, and document summarization. It integrates well into applications needing quick multimodal responses. Typical deployments include customer support tools and content review pipelines.

Capabilities

Multimodal understanding (text, image, file)
Long-context reasoning
Code generation and analysis
Image description and visual reasoning
File content extraction and summarization
General instruction following and conversation

Benchmarks & performance

Independent evaluation scores and measured speed.

12.6
Intelligence Index
55
Tokens / sec
1.23s
Time to first token

Source: Artificial Analysis

How GPT-4o-mini (2024-07-18) compares

GPT-4o-mini (2024-07-18) (striped bar) vs other multimodal on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · GPT-4o-mini (2024-07-18) ranks #82 of 88

15
GPT-4o
15
GPT-4o
14
Qwen3 VL 8B Instruct
14
GPT-4 Turbo
14
Llama 4 Scout
13
GPT-4.1 Nano
13
GPT-4o-mini
13
GPT-4o-mini
12
Claude 3 Haiku
12
Saba
10
Gemma 3 27B
9
Gemma 3 12B
6
Gemma 3 4B

Speed

Output tokens per second · Higher is better · GPT-4o-mini (2024-07-18) ranks #57 of 76

63
Claude Fable 5
62
Claude Opus 4.8
62
Claude Opus 4.8
60
GPT-5.5
58
MiniMax M3
58
Qwen3.6 27B
55
GPT-4o-mini
55
GPT-4o-mini
54
Kimi K2.5
53
Qwen3.7 Plus
53
Qwen3 VL 235B A22B Instruct
52
Qwen3.6 Plus
52
Gemma 4 26B A4B

Price

USD per 1M output tokens · Lower is better · GPT-4o-mini (2024-07-18) ranks #32 of 155

$0.42
Qwen3 VL 32B Instruct
$0.50
Qwen3 VL 8B Instruct
$0.52
Qwen3 VL 30B A3B Instruct
$0.55
Mistral Small 3.1 24B
$0.60
Llama 4 Maverick
$0.60
Mistral Small 4
$0.60
GPT-4o-mini
$0.60
GPT-4o-mini
$0.60
Saba
$0.88
Qwen3 VL 235B A22B Instruct
$0.90
Codestral 2508
$0.90
GLM 4.6V
$0.97
Qwen3.6 35B A3B

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Document Analysis with Visuals

Handles extended documents up to 128000 tokens that include images or files, enabling summarization and extraction of insights from reports containing charts or diagrams.

Code Generation and Review

Supports code generation, analysis, and debugging across languages while maintaining context over large codebases or multiple files.

Image Description and Reasoning

Delivers accurate visual reasoning and descriptions for images, supporting tasks like content analysis or accessibility features.

Strengths & limitations

Strengths

  • +Fast and cost-efficient responses
  • +Good balance of capability and speed
  • +Handles mixed text and image inputs effectively
  • +Suitable for high-volume or real-time use cases

Limitations

  • Less depth on complex reasoning than larger models
  • No audio or video modality support
  • Can still hallucinate or miss nuances on edge cases

Cost calculator

Estimate what GPT-4o-mini (2024-07-18) would cost for your usage.

$0.00045
per request
$4.5
estimated / month

Based on GPT-4o-mini (2024-07-18)'s $0.15/1M input · $0.60/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-4o-mini-2024-07-18",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-4o-mini-2024-07-18

Editor's verdict

Our take on GPT-4o-mini (2024-07-18)

GPT-4o-mini (2024-07-18) is OpenAI's proprietary multimodal with a 128K-token context window.

On independent testing it scores 12.6 on the Artificial Analysis Intelligence Index, running at roughly 55 tokens per second with about 1.23s to first token.

At $0.60 per 1M output tokens, it is very cost-efficient for its class.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to fast and cost-efficient responses and good balance of capability and speed.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 128000 tokens for processing extended inputs.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other GPT models

Sibling versions in the GPT family from OpenAI.

Promote GPT-4o-mini (2024-07-18)

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT-4o-mini (2024-07-18) 1