GPT Audio Mini
VerifiedOpenAI's compact model for seamless text and audio processing.
About GPT Audio Mini
Built by OpenAI, GPT Audio Mini combines text and audio modalities in a single system. Its 128000-token context supports extended audio transcripts or mixed conversations. The model remains closed-source with parameters undisclosed.
It excels at tasks that blend spoken audio with textual instructions. Typical usage includes audio analysis, transcription support, and interactive voice scenarios where large context matters.
Capabilities
Best for
Extended Audio Transcription
The model handles transcription of lengthy recordings like conferences or interviews by leveraging its 128000-token context window to maintain accuracy across long sessions.
Podcast and Meeting Summarization
It processes full audio files from podcasts or business meetings to generate structured summaries while retaining key details from the entire input.
Audio Analysis in Research
Researchers can use it to analyze spoken content in academic or qualitative studies, drawing insights from large volumes of audio data in a single pass.
Strengths & limitations
Strengths
- +Seamless integration of text and audio modalities
- +Efficient handling of large audio contexts
- +Optimized for audio-centric tasks
- +Built on established OpenAI GPT architecture
Limitations
- –Smaller model scale may reduce depth on complex non-audio tasks
- –No vision or other non-text modalities supported
- –Audio focus could limit general-purpose versatility
Cost calculator
Estimate what GPT Audio Mini would cost for your usage.
Based on GPT Audio Mini's $0.60/1M input · $2.40/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "openai/gpt-audio-mini",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: openai/gpt-audio-mini
Editor's verdict
GPT Audio Mini is OpenAI's proprietary audio & music with a 128K-token context window.
At $2.40 per 1M output tokens, it is mid-priced for its class.
It is available through OpenAI's API and aggregators like OpenRouter.
Best suited to seamless integration of text and audio modalities and efficient handling of large audio contexts.
Frequently asked questions
The model provides a context window of 128000 tokens for processing audio inputs.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other GPT models
Sibling versions in the GPT family from OpenAI.