Skip to content
GPT Audio Mini logo

GPT Audio Mini

Verified

OpenAI's compact model for seamless text and audio processing.

OpenAIAudio & MusicClosed
Model page
Updated 2026-06-14

About GPT Audio Mini

Built by OpenAI, GPT Audio Mini combines text and audio modalities in a single system. Its 128000-token context supports extended audio transcripts or mixed conversations. The model remains closed-source with parameters undisclosed.

It excels at tasks that blend spoken audio with textual instructions. Typical usage includes audio analysis, transcription support, and interactive voice scenarios where large context matters.

Capabilities

Audio input and output processing
Text-audio multimodal understanding
Long-context audio reasoning
Speech transcription and synthesis
Conversational audio interactions
Audio content analysis over extended sequences

Best for

Extended Audio Transcription

The model handles transcription of lengthy recordings like conferences or interviews by leveraging its 128000-token context window to maintain accuracy across long sessions.

Podcast and Meeting Summarization

It processes full audio files from podcasts or business meetings to generate structured summaries while retaining key details from the entire input.

Audio Analysis in Research

Researchers can use it to analyze spoken content in academic or qualitative studies, drawing insights from large volumes of audio data in a single pass.

Strengths & limitations

Strengths

  • +Seamless integration of text and audio modalities
  • +Efficient handling of large audio contexts
  • +Optimized for audio-centric tasks
  • +Built on established OpenAI GPT architecture

Limitations

  • Smaller model scale may reduce depth on complex non-audio tasks
  • No vision or other non-text modalities supported
  • Audio focus could limit general-purpose versatility

Cost calculator

Estimate what GPT Audio Mini would cost for your usage.

$0.00180
per request
$18
estimated / month

Based on GPT Audio Mini's $0.60/1M input · $2.40/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-audio-mini",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: openai/gpt-audio-mini

Editor's verdict

Our take on GPT Audio Mini

GPT Audio Mini is OpenAI's proprietary audio & music with a 128K-token context window.

At $2.40 per 1M output tokens, it is mid-priced for its class.

It is available through OpenAI's API and aggregators like OpenRouter.

Best suited to seamless integration of text and audio modalities and efficient handling of large audio contexts.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 128000 tokens for processing audio inputs.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other GPT models

Sibling versions in the GPT family from OpenAI.

Promote GPT Audio Mini

Add this badge to your website, or share the tool.

DFeatured on DhanasviGPT Audio Mini 1