Skip to content
Gemma 4 26B A4B logo

Gemma 4 26B A4B

Verified

Google's open multimodal model for text, image, and video with 262k context.

GoogleMultimodalOpen
Function callingJSON modeStructured outputsReasoningVision
Model page
Updated 2026-06-15

About Gemma 4 26B A4B

The model combines a 26B parameter architecture with native support for image, text, and video modalities. Its 262144-token context window allows handling of long multimodal sequences in a single pass. As an open-weight release from Google, it provides direct access to weights for customization and local inference.

Typical usage includes multimodal content analysis, video understanding, and cross-modal generation tasks. Developers leverage its open nature for fine-tuning on domain-specific image-text-video datasets. The design emphasizes broad accessibility while maintaining strong performance on mixed-modality inputs.

Capabilities

Long-context reasoning
Image understanding
Video analysis
Multimodal integration
Text generation
Instruction following

How Gemma 4 26B A4B compares

Gemma 4 26B A4B (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Gemma 4 26B A4B ranks #14 of 124

$0.20
Mistral Small 3.2 24B
$0.26
Qwen3.5-Flash
$0.28
MiMo-V2.5
$0.30
Llama 4 Scout
$0.30
Seed 1.6 Flash
$0.30
Voxtral Small 24B 2507
$0.33
Gemma 4 26B A4B
$0.35
Gemma 4 31B
$0.40
Gemini 2.5 Flash Lite
$0.40
GPT-4.1 Nano
$0.40
Gemini 2.5 Flash Lite Preview 09-2025
$0.40
GPT-5 Nano
$0.40
Seed-2.0-Mini

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Extended Video Content Review

Processes hours of video input alongside transcripts to identify patterns, summarize events, and answer queries spanning the full duration.

Large-Scale Multimodal Reports

Integrates text, images, and charts from lengthy documents to generate reasoned summaries and extract cross-referenced insights.

Complex Instruction Execution

Follows detailed multi-step prompts that combine visual analysis with long-context text generation for tasks like research synthesis.

Strengths & limitations

Strengths

  • +Large 256k-token context window
  • +Native support for image, text, and video inputs
  • +Efficient 26B-scale architecture

Limitations

  • No audio modality support
  • May trail larger models on complex reasoning tasks
  • Higher inference cost for video processing

Pricing by provider

Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.

ProviderInput /1MOutput /1MContextUptime
DekaLLM(bf16)$0.06$0.33262K93.1%
DeepInfra(fp8)$0.07$0.34262K99.2%
Cloudflare$0.10$0.30256K99.8%
Ambient$0.10$0.30262K
SiliconFlow(fp8)$0.12$0.40262K99.9%
Parasail(bf16)$0.13$0.40262K99.4%
Novita(bf16)$0.13$0.40262K99.8%
NextBit(bf16)$0.13$0.40262K99.3%
Google$0.15$0.60262K100.0%
Venice(bf16)$0.16$0.50256K99.6%

Cost calculator

Estimate what Gemma 4 26B A4B would cost for your usage.

$0.00022
per request
$2.25
estimated / month

Based on Gemma 4 26B A4B's $0.06/1M input · $0.33/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "google/gemma-4-26b-a4b-it",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: google/gemma-4-26b-a4b-it

Editor's verdict

Our take on Gemma 4 26B A4B

Gemma 4 26B A4B is Google's open-weight multimodal with a 262K-token context window.

At $0.33 per 1M output tokens, it is very cost-efficient for its class, served by 10 providers.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to large 256k-token context window and native support for image, text, and video inputs.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 262144 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Gemma models

Sibling versions in the Gemma family from Google.

Promote Gemma 4 26B A4B

Add this badge to your website, or share the tool.

DFeatured on DhanasviGemma 4 26B A4B 1