Granite 4.0 Micro
VerifiedIBM's compact proprietary LLM for large-context text tasks.
About Granite 4.0 Micro
Designed as a non-open-weight model, Granite 4.0 Micro focuses on efficient text processing within a substantial context window. Its architecture supports coherent handling of extended inputs while maintaining proprietary control over deployment.
The model suits organizations that require secure, managed LLM access without public weights. Typical usage includes document analysis, conversational agents, and other text-centric workflows where context length and data privacy matter.
Capabilities
How Granite 4.0 Micro compares
Granite 4.0 Micro (striped bar) vs other language models on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · Granite 4.0 Micro ranks #4 of 78
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Long-Context Enterprise Document Processing
Granite 4.0 Micro handles extended inputs up to 131k tokens, making it effective for analyzing lengthy internal reports, contracts, and compliance documents in a single pass.
Resource-Efficient On-Premises Chat Applications
Its micro size supports deployment in constrained environments while maintaining conversation continuity across large context windows for customer support or internal knowledge tools.
Codebase Navigation in Development Workflows
The model processes substantial code repositories within its context limit, aiding tasks like refactoring suggestions and dependency tracing in enterprise codebases.
Strengths & limitations
Strengths
- +Efficient lightweight design
- +Strong long-context handling
- +Enterprise-oriented safety focus
- +Fast inference on modest hardware
Limitations
- –Text-only modality
- –Smaller scale limits depth on complex tasks
- –May require careful prompting for best results
Cost calculator
Estimate what Granite 4.0 Micro would cost for your usage.
Based on Granite 4.0 Micro's $0.02/1M input · $0.11/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "ibm-granite/granite-4.0-h-micro",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: ibm-granite/granite-4.0-h-micro
Editor's verdict
Granite 4.0 Micro is Ibm-granite's proprietary language models with a 131K-token context window.
At $0.11 per 1M output tokens, it is very cost-efficient for its class.
It is available through Ibm-granite's API and aggregators like OpenRouter.
Best suited to efficient lightweight design and strong long-context handling.
Frequently asked questions
The model supports a context window of 131000 tokens.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other Granite models
Sibling versions in the Granite family from Ibm-granite.