How can I access Gemma 4 26B A4B?

It is available through Google's AI developer platforms and APIs.

Is there pricing information for using Gemma 4 26B A4B?

Pricing details are provided on Google's official AI model documentation pages.

What types of tasks is Gemma 4 26B A4B best suited for?

It performs well on multimodal tasks involving long-context reasoning, image understanding, and video analysis.

Does Gemma 4 26B A4B support instruction following?

Yes, the model is designed for instruction following across text and multimodal inputs.

Gemma 4 26B A4B

Verified

Google's open multimodal model for text, image, and video with 262k context.

GoogleMultimodalOpen

Function callingJSON modeStructured outputsReasoningVision

Model page

Updated 2026-06-15

About Gemma 4 26B A4B

The model combines a 26B parameter architecture with native support for image, text, and video modalities. Its 262144-token context window allows handling of long multimodal sequences in a single pass. As an open-weight release from Google, it provides direct access to weights for customization and local inference.

Typical usage includes multimodal content analysis, video understanding, and cross-modal generation tasks. Developers leverage its open nature for fine-tuning on domain-specific image-text-video datasets. The design emphasizes broad accessibility while maintaining strong performance on mixed-modality inputs.

Capabilities

Long-context reasoning

Image understanding

Video analysis

Multimodal integration

Text generation

Instruction following

How Gemma 4 26B A4B compares

Gemma 4 26B A4B (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemma 4 26B A4B ranks #14 of 124

$0.20

Mistral Small 3.2 24B

$0.26

Qwen3.5-Flash

$0.28

MiMo-V2.5

$0.30

Llama 4 Scout

$0.30

Seed 1.6 Flash

$0.30

Voxtral Small 24B 2507

$0.33

Gemma 4 26B A4B

$0.35

Gemma 4 31B

$0.40

Gemini 2.5 Flash Lite

$0.40

GPT-4.1 Nano

$0.40

Gemini 2.5 Flash Lite Preview 09-2025

$0.40

GPT-5 Nano

$0.40

Seed-2.0-Mini

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Extended Video Content Review

Processes hours of video input alongside transcripts to identify patterns, summarize events, and answer queries spanning the full duration.

Large-Scale Multimodal Reports

Integrates text, images, and charts from lengthy documents to generate reasoned summaries and extract cross-referenced insights.

Complex Instruction Execution

Follows detailed multi-step prompts that combine visual analysis with long-context text generation for tasks like research synthesis.

Strengths & limitations

Strengths

+Large 256k-token context window
+Native support for image, text, and video inputs
+Efficient 26B-scale architecture

Limitations

–No audio modality support
–May trail larger models on complex reasoning tasks
–Higher inference cost for video processing

Pricing by provider

Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.

Provider	Input /1M	Output /1M	Context	Uptime
DekaLLM(bf16)	$0.06	$0.33	262K	93.1%
DeepInfra(fp8)	$0.07	$0.34	262K	99.2%
Cloudflare	$0.10	$0.30	256K	99.8%
Ambient	$0.10	$0.30	262K	—
SiliconFlow(fp8)	$0.12	$0.40	262K	99.9%
Parasail(bf16)	$0.13	$0.40	262K	99.4%
Novita(bf16)	$0.13	$0.40	262K	99.8%
NextBit(bf16)	$0.13	$0.40	262K	99.3%
Google	$0.15	$0.60	262K	100.0%
Venice(bf16)	$0.16	$0.50	256K	99.6%

Cost calculator

Estimate what Gemma 4 26B A4B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00022

per request

$2.25

estimated / month

Based on Gemma 4 26B A4B's $0.06/1M input · $0.33/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemma-4-26b-a4b-it",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemma-4-26b-a4b-it

Editor's verdict

Our take on Gemma 4 26B A4B

Gemma 4 26B A4B is Google's open-weight multimodal with a 262K-token context window.

At $0.33 per 1M output tokens, it is very cost-efficient for its class, served by 10 providers.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to large 256k-token context window and native support for image, text, and video inputs.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 262144 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Gemma models

Sibling versions in the Gemma family from Google.

Gemma 4 31B

Google · Multimodal

Verified

Google's open multimodal model for long-context image, text and video tasks.

Open262K ctx$0.35/1M out

Gemma 3 4B

Google · Multimodal

Verified

Google's open multimodal model for efficient text and image understanding.

Open131K ctx$0.10/1M out

Gemma 3 12B

Google · Multimodal

Verified

Google's open multimodal model for text and image understanding.

Open131K ctx$0.15/1M out

Promote Gemma 4 26B A4B

Add this badge to your website, or share the tool.

DFeatured on DhanasviGemma 4 26B A4B 1

Gemma 4 26B A4B

About Gemma 4 26B A4B

Capabilities