GLM 4.5 Air
VerifiedText LLM with 128k context for efficient language tasks.
About GLM 4.5 Air
GLM 4.5 Air operates exclusively on text data and maintains a fixed context capacity of 131072 tokens. As a closed model it is distributed through Z.AI's controlled channels rather than public weight releases. This design prioritizes managed updates and consistent performance across deployments.
Its architecture focuses on reliable handling of extended text sequences without requiring users to manage model parameters. The absence of disclosed parameter counts keeps emphasis on functional context length and modality rather than scale metrics.
Typical applications include document analysis, conversational agents, and content generation where a large text window improves coherence. Developers integrate it via Z.AI APIs for tasks that benefit from stable, non-open-source inference environments.
Capabilities
How GLM 4.5 Air compares
GLM 4.5 Air (striped bar) vs other language models on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · GLM 4.5 Air ranks #43 of 87
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Long Document Analysis
Processes inputs up to 131072 tokens for in-depth reasoning over lengthy reports, research papers, or legal documents while maintaining coherence across sections.
Code Generation Projects
Generates and refines code in multiple languages based on detailed instructions, supporting iterative development of applications or scripts.
Multilingual Summarization
Handles text summarization and analysis across languages, producing concise outputs from large volumes of international content or mixed-language sources.
Strengths & limitations
Strengths
- +Efficient inference suitable for high-volume use
- +Strong Chinese language capabilities
- +Supports extended 128k token contexts
Limitations
- –Text-only modality
- –Smaller model size may reduce depth on complex tasks
- –General-purpose with no specialized domain expertise
Cost calculator
Estimate what GLM 4.5 Air would cost for your usage.
Based on GLM 4.5 Air's $0.13/1M input · $0.85/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "z-ai/glm-4.5-air",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: z-ai/glm-4.5-air
Editor's verdict
GLM 4.5 Air is Z.AI's proprietary language models with a 131K-token context window.
At $0.85 per 1M output tokens, it is very cost-efficient for its class.
It is available through Z.AI's API and aggregators like OpenRouter.
Best suited to efficient inference suitable for high-volume use and strong chinese language capabilities.
Frequently asked questions
The model provides a context window of 131072 tokens for processing extended inputs.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other GLM models
Sibling versions in the GLM family from Z.AI.