What context length does Gemini 2.5 Flash support?

The model provides a context window of 1048576 tokens.

How can users access Gemini 2.5 Flash?

It is offered by Google through official AI platforms and APIs for developers and enterprises.

What types of inputs can Gemini 2.5 Flash process?

It supports multimodal inputs including text, images, audio, video, and files with fast inference.

Gemini 2.5 Flash

Verified

Google's fast multimodal model for unified text, image, audio, and video tasks.

GoogleMultimodalClosed

Vision

Model page

Updated 2026-06-15

About Gemini 2.5 Flash

Gemini 2.5 Flash uses a multimodal architecture that processes different input types in a single forward pass. Its design emphasizes low latency while maintaining support for very long contexts. The model is available only through Google's API and does not release weights.

A key strength is the ability to reason across mixed media within one conversation or document. The million-token context allows analysis of lengthy transcripts, video timelines, or multi-page documents without chunking. Performance remains consistent across text, visual, and auditory modalities.

Typical usage includes building assistants that summarize videos, answer questions about images and audio clips, or generate reports from combined file uploads. Developers integrate it into workflows requiring real-time multimodal understanding rather than single-modality text generation.

Capabilities

Multimodal understanding across text, image, audio, video and files

Long-context reasoning and analysis

Code generation and interpretation

Audio and video content processing

File analysis and summarization

Fast multimodal inference

How Gemini 2.5 Flash compares

Gemini 2.5 Flash (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemini 2.5 Flash ranks #69 of 139

$2.0

Mistral Medium 3

$2.0

Kimi K2.5

$2.1

Qwen3.5-122B-A10B

$2.3

Qwen3.5 397B A17B

$2.5

Grok 4.3

$2.5

Grok 4.20

$2.5

Gemini 2.5 Flash

$2.5

Nova 2 Lite

$2.6

Qwen3 VL 235B A22B Thinking

$3.0

Gemini 3 Flash Preview

$3.4

Kimi K2.6

$3.4

MoonshotAI Kimi Latest

$3.4

MoonshotAI Kimi Latest

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-context Multimodal Document Analysis

Handles reasoning and summarization over large files combining text, images, audio, and video within its 1048576-token context window.

Audio and Video Content Processing

Performs fast inference for transcription, analysis, and extraction of insights from extended audio or video inputs alongside other modalities.

Code Generation with Multimodal Inputs

Generates and interprets code while incorporating visual, audio, or file-based context for development and debugging tasks.

Strengths & limitations

Strengths

+Broad native support for multiple input modalities
+Efficient handling of very large contexts
+Strong balance of speed and capability
+Versatile across text, vision and audio tasks

Limitations

–Lower peak performance than larger Gemini variants on complex tasks
–Speed optimizations may reduce depth on nuanced reasoning
–Practical limits on full 1M-token context utilization

Cost calculator

Estimate what Gemini 2.5 Flash would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00155

per request

$15.5

estimated / month

Based on Gemini 2.5 Flash's $0.30/1M input · $2.50/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemini-2.5-flash",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemini-2.5-flash

Editor's verdict

Our take on Gemini 2.5 Flash

Gemini 2.5 Flash is Google's proprietary multimodal with a 1049K-token context window.

At $2.50 per 1M output tokens, it is mid-priced for its class.

It is available through Google's API and aggregators like OpenRouter.

Best suited to broad native support for multiple input modalities and efficient handling of very large contexts.

Did you find this helpful?

Frequently asked questions

Pricing details are available directly from Google based on usage volume and access method.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Gemini models

Sibling versions in the Gemini family from Google.

Gemini 3.5 Flash

Google · Multimodal

Verified

Google's fast multimodal model for text, image, video and audio tasks.

ClosedII 54.81049K ctx$9.00/1M out

Gemini 3.1 Flash Lite

Google · Multimodal

Verified

Google's fast multimodal model for efficient text, image, and video tasks.

ClosedII 33.51049K ctx$1.50/1M out

Gemini 3.1 Flash Lite Preview

Google · Multimodal

Verified

Google's efficient multimodal preview for fast, large-context AI tasks.

Closed1049K ctx$1.50/1M out

Gemini 3 Flash Preview

Google · Multimodal

Verified

Google's fast multimodal model for text, image, audio and video tasks.

Closed1049K ctx$3.00/1M out

Gemini 2.5 Flash Lite

Google · Multimodal

Verified

Google's fast, lightweight multimodal model for text, image, audio, and video tasks.

Closed1049K ctx$0.40/1M out

Gemini 3.1 Pro Preview Custom Tools

Google · Multimodal

Verified

Google's multimodal preview model with custom tools and massive context handling.

Closed1049K ctx$12.00/1M out

Promote Gemini 2.5 Flash

Add this badge to your website, or share the tool.

DFeatured on DhanasviGemini 2.5 Flash 2

Gemini 2.5 Flash

About Gemini 2.5 Flash

Capabilities