Skip to content
Granite 4.0 Micro logo

Granite 4.0 Micro

Verified

IBM's compact proprietary LLM for large-context text tasks.

Ibm-graniteLanguage ModelsClosed
Model page
Updated 2026-06-14

About Granite 4.0 Micro

Designed as a non-open-weight model, Granite 4.0 Micro focuses on efficient text processing within a substantial context window. Its architecture supports coherent handling of extended inputs while maintaining proprietary control over deployment.

The model suits organizations that require secure, managed LLM access without public weights. Typical usage includes document analysis, conversational agents, and other text-centric workflows where context length and data privacy matter.

Capabilities

Long-context reasoning
Text generation
Code generation
Instruction following
Question answering
Summarization

How Granite 4.0 Micro compares

Granite 4.0 Micro (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Granite 4.0 Micro ranks #4 of 78

$0.03
Ling-2.6-flash
$0.10
Qwen3 235B A22B Instruct 2507
$0.10
Granite 4.1 8B
$0.11
Granite 4.0 Micro
$0.12
LFM2-24B-A2B
$0.15
Trinity Mini
$0.15
Rnj 1 Instruct
$0.18
DeepSeek V4 Flash
$0.18
gpt-oss-120b
$0.19
Qwen3 30B A3B Instruct 2507
$0.20
Nemotron 3 Nano 30B A3B
$0.21
Hy3 preview
$0.24
Qwen3 14B

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-Context Enterprise Document Processing

Granite 4.0 Micro handles extended inputs up to 131k tokens, making it effective for analyzing lengthy internal reports, contracts, and compliance documents in a single pass.

Resource-Efficient On-Premises Chat Applications

Its micro size supports deployment in constrained environments while maintaining conversation continuity across large context windows for customer support or internal knowledge tools.

Codebase Navigation in Development Workflows

The model processes substantial code repositories within its context limit, aiding tasks like refactoring suggestions and dependency tracing in enterprise codebases.

Strengths & limitations

Strengths

  • +Efficient lightweight design
  • +Strong long-context handling
  • +Enterprise-oriented safety focus
  • +Fast inference on modest hardware

Limitations

  • Text-only modality
  • Smaller scale limits depth on complex tasks
  • May require careful prompting for best results

Cost calculator

Estimate what Granite 4.0 Micro would cost for your usage.

$0.00008
per request
$0.7500
estimated / month

Based on Granite 4.0 Micro's $0.02/1M input · $0.11/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "ibm-granite/granite-4.0-h-micro",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: ibm-granite/granite-4.0-h-micro

Editor's verdict

Our take on Granite 4.0 Micro

Granite 4.0 Micro is Ibm-granite's proprietary language models with a 131K-token context window.

At $0.11 per 1M output tokens, it is very cost-efficient for its class.

It is available through Ibm-granite's API and aggregators like OpenRouter.

Best suited to efficient lightweight design and strong long-context handling.

Did you find this helpful?

Frequently asked questions

The model supports a context window of 131000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Granite models

Sibling versions in the Granite family from Ibm-granite.

Promote Granite 4.0 Micro

Add this badge to your website, or share the tool.

DFeatured on DhanasviGranite 4.0 Micro 1