How do I access WizardLM-2 8x22B?

It is available through Microsoft platforms as an LLM developed by Microsoft.

What are the pricing details for using WizardLM-2 8x22B?

Pricing information is provided by Microsoft on their Azure AI model catalog and may vary by usage volume.

Can WizardLM-2 8x22B follow complex multi-step instructions?

Yes, its listed capabilities include complex instruction following and multi-step problem solving.

Is WizardLM-2 8x22B suitable for code generation tasks?

The model is designed for code generation among its core capabilities.

WizardLM-2 8x22B by Microsoft — Specs, Pricing, Benchmarks (2026)

About WizardLM-2 8x22B

WizardLM-2 8x22B builds on the Mixtral 8x22B architecture and applies Microsoft's WizardLM fine-tuning approach. This results in an open-weight model that users can download and run locally or on their own infrastructure. The 65536-token context window enables processing of lengthy documents and extended conversations.

Its strengths include handling detailed instructions and producing coherent multi-turn responses across varied text domains. Because the weights are publicly available, researchers and developers can fine-tune the model further for specialized needs without relying on proprietary APIs.

Typical usage covers chat interfaces, code assistance, summarization of long texts, and exploratory research projects. The text-only modality keeps it focused on language-based workflows rather than multimodal inputs.

Capabilities

Long-context reasoning

Complex instruction following

Code generation

Multi-step problem solving

Text summarization and analysis

Creative content generation

How WizardLM-2 8x22B compares

WizardLM-2 8x22B (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · WizardLM-2 8x22B ranks #56 of 141

$0.50

Qwen3 30B A3B

$0.50

Olmo 3 32B Think

$0.57

Hunyuan A13B Instruct

$0.60

GPT-4o-mini Search Preview

$0.60

Solar Pro 3

$0.60

Command R+

$0.62

WizardLM-2 8x22B

$0.63

Ring-2.6-1T

$0.63

Ling-2.6-1T

$0.65

Gemma 2 27B

$0.65

ReMM SLERP 13B

$0.70

Hermes 3 70B Instruct

$0.74

Llama 3 70B Instruct

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-Context Document Summarization

The model processes up to 65536 tokens to summarize and analyze lengthy reports or research papers while maintaining coherence across sections.

Complex Codebase Refactoring

It handles multi-step code generation and problem solving by following detailed instructions to refactor large software projects.

Creative Narrative Development

Users can generate extended stories or scripts that require consistent character tracking and plot progression over many tokens.

Strengths & limitations

Strengths

+Strong instruction adherence
+Effective extended context handling
+Versatile reasoning across domains

Limitations

–Text-only modality
–High inference compute demands
–Risk of factual hallucinations

Cost calculator

Estimate what WizardLM-2 8x22B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00093

per request

$9.3

estimated / month

Based on WizardLM-2 8x22B's $0.62/1M input · $0.62/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "microsoft/wizardlm-2-8x22b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: microsoft/wizardlm-2-8x22b

Editor's verdict

Our take on WizardLM-2 8x22B

WizardLM-2 8x22B is Microsoft's open-weight language models with a 66K-token context window.

At $0.62 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to strong instruction adherence and effective extended context handling.

Did you find this helpful?

Frequently asked questions

The model supports a context length of 65536 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Similar models

Other language models worth comparing.

DeepSeek V4 Pro

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text contexts.

OpenII 51.51049K ctx$0.87/1M out

DeepSeek V4 Flash

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text context handling.

OpenII 46.51049K ctx$0.18/1M out

Qwen3 Coder Plus

Alibaba Qwen · Language Models

Verified

Open-weight coder built for million-token codebases and complex tasks.

Open1000K ctx$3.25/1M out

WizardLM-2 8x22B

About WizardLM-2 8x22B

Capabilities

How WizardLM-2 8x22B compares

Price

Best for

Long-Context Document Summarization

Complex Codebase Refactoring

Creative Narrative Development

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What is the context window size for WizardLM-2 8x22B?

How do I access WizardLM-2 8x22B?

What are the pricing details for using WizardLM-2 8x22B?

Can WizardLM-2 8x22B follow complex multi-step instructions?

Is WizardLM-2 8x22B suitable for code generation tasks?

User reviews

Similar models

DeepSeek V4 Pro

DeepSeek V4 Flash

Qwen3 Coder Plus

Promote WizardLM-2 8x22B