What input types does Gemini 2.5 Flash Lite accept?

It processes multimodal inputs including text, images, video, audio, and files.

How can developers access Gemini 2.5 Flash Lite?

Access is provided through Google AI Studio and the Gemini API.

Is Gemini 2.5 Flash Lite suitable for real-time applications?

Its fast text generation makes it appropriate for latency-sensitive tasks such as chat and quick analysis.

Does Gemini 2.5 Flash Lite support audio transcription?

Yes, it performs audio transcription and analysis alongside other modalities.

Gemini 2.5 Flash Lite

Verified

Google's fast, lightweight multimodal model for text, image, audio, and video tasks.

GoogleMultimodalClosed

Vision

Model page

Updated 2026-06-15

About Gemini 2.5 Flash Lite

Built as a streamlined member of the Gemini family, the model prioritizes speed and resource efficiency while supporting multiple input types. Its architecture enables simultaneous handling of text, images, audio, and video within one large context window. This design suits scenarios where rapid multimodal integration is required without open-weight access.

Strengths center on balanced performance for real-time applications and large-scale input processing. Typical usage includes content analysis, media summarization, and interactive tools that combine visual, auditory, and textual data streams in a single query.

Capabilities

Multimodal input processing

Long-context reasoning

Vision and video understanding

Audio transcription and analysis

File handling and reasoning

Fast text generation

How Gemini 2.5 Flash Lite compares

Gemini 2.5 Flash Lite (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemini 2.5 Flash Lite ranks #16 of 124

$0.28

MiMo-V2.5

$0.30

Llama 4 Scout

$0.30

Seed 1.6 Flash

$0.30

Voxtral Small 24B 2507

$0.33

Gemma 4 26B A4B

$0.35

Gemma 4 31B

$0.40

Gemini 2.5 Flash Lite

$0.40

GPT-4.1 Nano

$0.40

Gemini 2.5 Flash Lite Preview 09-2025

$0.40

GPT-5 Nano

$0.40

Seed-2.0-Mini

$0.42

Qwen3 VL 32B Instruct

$0.50

Qwen3 VL 8B Instruct

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-context document reasoning

Processes entire code repositories, research papers, or legal contracts up to 1M tokens to deliver coherent summaries and answers without chunking.

Video and vision analysis

Understands video clips and images to extract events, objects, and narratives while combining them with text or audio inputs.

Multimodal file workflows

Handles mixed files containing text, images, and audio to perform transcription, reasoning, and rapid text output in a single pass.

Strengths & limitations

Strengths

+Optimized for speed and efficiency
+Strong multimodal integration
+Handles very large contexts effectively
+Suitable for high-volume, lightweight tasks

Limitations

–Reduced depth on highly complex reasoning
–May underperform full-scale models on nuanced tasks
–Lite design trades capability for cost and latency

Cost calculator

Estimate what Gemini 2.5 Flash Lite would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00030

per request

estimated / month

Based on Gemini 2.5 Flash Lite's $0.10/1M input · $0.40/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemini-2.5-flash-lite",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemini-2.5-flash-lite

Editor's verdict

Our take on Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is Google's proprietary multimodal with a 1049K-token context window.

At $0.40 per 1M output tokens, it is very cost-efficient for its class.

It is available through Google's API and aggregators like OpenRouter.

Best suited to optimized for speed and efficiency and strong multimodal integration.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 1,048,576 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Gemini models

Sibling versions in the Gemini family from Google.

Gemini 3.5 Flash

Google · Multimodal

Verified

Google's fast multimodal model for text, image, video and audio tasks.

ClosedII 54.81049K ctx$9.00/1M out

Gemini 3.1 Flash Lite

Google · Multimodal

Verified

Google's fast multimodal model for efficient text, image, and video tasks.

ClosedII 33.51049K ctx$1.50/1M out

Gemini 2.5 Pro

Google · Multimodal

Verified

Google's multimodal model for long-context reasoning across media types.

Closed1049K ctx$10.00/1M out

Gemini 3 Flash Preview

Google · Multimodal

Verified

Google's fast multimodal model for text, image, audio and video tasks.

Closed1049K ctx$3.00/1M out

Gemini 3.1 Pro Preview

Google · Multimodal

Verified

Mastering complex multimodal tasks with an enormous context window.

Closed1049K ctx$12.00/1M out

Gemini 3.1 Flash Lite Preview

Google · Multimodal

Verified

Google's efficient multimodal preview for fast, large-context AI tasks.

Closed1049K ctx$1.50/1M out

Promote Gemini 2.5 Flash Lite

Add this badge to your website, or share the tool.

DFeatured on DhanasviGemini 2.5 Flash Lite 2

Gemini 2.5 Flash Lite

About Gemini 2.5 Flash Lite

Capabilities