Skip to content
Gemini 3.1 Flash Lite Preview logo

Gemini 3.1 Flash Lite Preview

Verified

Google's efficient multimodal preview for fast, large-context AI tasks.

GoogleMultimodalClosed
Vision
Model page
Updated 2026-06-14

About Gemini 3.1 Flash Lite Preview

Designed with an emphasis on speed, the model balances multimodal processing across multiple input types while keeping resource demands low. Its architecture supports extended context to maintain coherence over long inputs. As a preview release, it provides early access to refinements in Google's Gemini family.

Strengths include seamless handling of mixed media and a context capacity that accommodates lengthy documents or conversations. The non-open weights allow Google to deliver controlled updates and consistent performance. This setup favors reliability over customization for end users.

Common applications range from media analysis and content generation to building interactive tools that combine audio, video, and text. Enterprises use it for scalable tasks where efficiency and broad input support matter most. Developers integrate the model into prototypes needing quick multimodal responses.

Capabilities

Multimodal understanding across text, image, video, audio and files
Long-context reasoning over large inputs
Video and audio content analysis
File interpretation and summarization
Efficient text generation and reasoning
Cross-modal integration

How Gemini 3.1 Flash Lite Preview compares

Gemini 3.1 Flash Lite Preview (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemini 3.1 Flash Lite Preview ranks #39 of 122

$1.1
Step 3.7 Flash
$1.2
MiniMax M3
$1.3
GPT-5.4 Nano
$1.3
ERNIE 4.5 VL 424B A47B
$1.3
Qwen3.7 Plus
$1.4
Qwen3 VL 8B Thinking
$1.5
Gemini 3.1 Flash Lite Preview
$1.5
Gemini 3.1 Flash Lite
$1.5
Mistral Large 3 2512
$1.5
Perceptron Mk1
$1.6
Qwen3.5 Plus 2026-02-15
$1.6
Qwen3.5-27B
$1.6
Qwen3 VL 30B A3B Thinking

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-form multimodal document analysis

Processes entire reports or books containing text and embedded images or charts within its 1,048,576-token context for comprehensive summarization or extraction.

Video content understanding tasks

Handles sequences of image frames alongside text prompts to generate descriptions, detect events, or answer questions about extended video material.

Integrated data reasoning workflows

Combines large volumes of mixed media inputs for scenarios like reviewing legal files that include scanned documents, diagrams, and transcripts in one pass.

Strengths & limitations

Strengths

  • +Broad native support for multiple modalities
  • +Very large context window for document and media tasks
  • +Lightweight design optimized for speed
  • +Unified handling of video, audio and files

Limitations

  • Preview model may show inconsistent behavior
  • Lite variant trades depth for efficiency
  • Experimental features can be less reliable than stable releases

Cost calculator

Estimate what Gemini 3.1 Flash Lite Preview would cost for your usage.

$0.00100
per request
$10
estimated / month

Based on Gemini 3.1 Flash Lite Preview's $0.25/1M input · $1.50/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemini-3.1-flash-lite-preview",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemini-3.1-flash-lite-preview

Editor's verdict

Our take on Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's proprietary multimodal with a 1049K-token context window.

At $1.50 per 1M output tokens, it is mid-priced for its class.

It is available through Google's API and aggregators like OpenRouter.

Best suited to broad native support for multiple modalities and very large context window for document and media tasks.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 1,048,576 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Gemini models

Sibling versions in the Gemini family from Google.

Promote Gemini 3.1 Flash Lite Preview

Add this badge to your website, or share the tool.

DFeatured on DhanasviGemini 3.1 Flash Lite Preview 2