How do I access Gemini 3.1 Flash Lite Preview?

Access is provided via Google AI platforms during its preview phase.

What are the pricing details for Gemini 3.1 Flash Lite Preview?

Pricing information for preview models is listed in Google AI Studio or Vertex AI documentation.

What input types does Gemini 3.1 Flash Lite Preview accept?

As a multimodal model it processes text along with images and other media formats.

Is Gemini 3.1 Flash Lite Preview suitable for production use?

It is released as a preview, so stability and feature completeness should be verified against current Google release notes.

Gemini 3.1 Flash Lite Preview

Verified

Google's efficient multimodal preview for fast, large-context AI tasks.

GoogleMultimodalClosed

Vision

Model page

Updated 2026-06-14

About Gemini 3.1 Flash Lite Preview

Designed with an emphasis on speed, the model balances multimodal processing across multiple input types while keeping resource demands low. Its architecture supports extended context to maintain coherence over long inputs. As a preview release, it provides early access to refinements in Google's Gemini family.

Strengths include seamless handling of mixed media and a context capacity that accommodates lengthy documents or conversations. The non-open weights allow Google to deliver controlled updates and consistent performance. This setup favors reliability over customization for end users.

Common applications range from media analysis and content generation to building interactive tools that combine audio, video, and text. Enterprises use it for scalable tasks where efficiency and broad input support matter most. Developers integrate the model into prototypes needing quick multimodal responses.

Capabilities

Multimodal understanding across text, image, video, audio and files

Long-context reasoning over large inputs

Video and audio content analysis

File interpretation and summarization

Efficient text generation and reasoning

Cross-modal integration

How Gemini 3.1 Flash Lite Preview compares

Gemini 3.1 Flash Lite Preview (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemini 3.1 Flash Lite Preview ranks #39 of 122

$1.1

Step 3.7 Flash

$1.2

MiniMax M3

$1.3

GPT-5.4 Nano

$1.3

ERNIE 4.5 VL 424B A47B

$1.3

Qwen3.7 Plus

$1.4

Qwen3 VL 8B Thinking

$1.5

Gemini 3.1 Flash Lite Preview

$1.5

Gemini 3.1 Flash Lite

$1.5

Mistral Large 3 2512

$1.5

Perceptron Mk1

$1.6

Qwen3.5 Plus 2026-02-15

$1.6

Qwen3.5-27B

$1.6

Qwen3 VL 30B A3B Thinking

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-form multimodal document analysis

Processes entire reports or books containing text and embedded images or charts within its 1,048,576-token context for comprehensive summarization or extraction.

Video content understanding tasks

Handles sequences of image frames alongside text prompts to generate descriptions, detect events, or answer questions about extended video material.

Integrated data reasoning workflows

Combines large volumes of mixed media inputs for scenarios like reviewing legal files that include scanned documents, diagrams, and transcripts in one pass.

Strengths & limitations

Strengths

+Broad native support for multiple modalities
+Very large context window for document and media tasks
+Lightweight design optimized for speed
+Unified handling of video, audio and files

Limitations

–Preview model may show inconsistent behavior
–Lite variant trades depth for efficiency
–Experimental features can be less reliable than stable releases

Cost calculator

Estimate what Gemini 3.1 Flash Lite Preview would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00100

per request

$10

estimated / month

Based on Gemini 3.1 Flash Lite Preview's $0.25/1M input · $1.50/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemini-3.1-flash-lite-preview",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemini-3.1-flash-lite-preview

Editor's verdict

Our take on Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's proprietary multimodal with a 1049K-token context window.

At $1.50 per 1M output tokens, it is mid-priced for its class.

It is available through Google's API and aggregators like OpenRouter.

Best suited to broad native support for multiple modalities and very large context window for document and media tasks.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 1,048,576 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Gemini models

Sibling versions in the Gemini family from Google.

Gemini 3.5 Flash

Google · Multimodal

Verified

Google's fast multimodal model for text, image, video and audio tasks.

ClosedII 54.81049K ctx$9.00/1M out