Skip to content
Qwen2.5 72B Instruct logo

Qwen2.5 72B Instruct

Verified

Open-weight LLM optimized for instruction following and long-context reasoning.

Alibaba QwenLanguage ModelsOpenII 15.6
Model page
Updated 2026-06-15

About Qwen2.5 72B Instruct

Built on a transformer architecture, Qwen2.5 72B Instruct was trained on diverse multilingual and code corpora. Its open-weight release enables researchers and developers to inspect, fine-tune, and deploy the model locally or in private environments.

The 131K token context supports extended documents and multi-turn conversations without truncation. Typical applications include coding assistance, technical writing, data summarization, and building domain-specific chat agents.

Users commonly integrate the model via frameworks such as Hugging Face Transformers or vLLM for both research experiments and production inference workloads.

Capabilities

Long-context reasoning
Code generation and debugging
Mathematical problem solving
Multilingual text generation
Instruction following
Tool use and function calling

Benchmarks & performance

Independent evaluation scores and measured speed.

15.6
Intelligence Index
11.9
Coding Index

Source: Artificial Analysis

How Qwen2.5 72B Instruct compares

Qwen2.5 72B Instruct (striped bar) vs other language models on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · Qwen2.5 72B Instruct ranks #50 of 67

20
Qwen3 Next 80B A3B Instruct
20
Qwen3 Coder 30B A3B Instruct
20
Qwen3 235B A22B
17
R1 Distill Qwen 32B
17
DeepSeek V3
16
R1 Distill Llama 70B
16
Qwen2.5 72B Instruct
15
Qwen3 32B
14
Command A
13
Mistral Large 2407
13
Qwen2.5 Coder 32B Instruct
13
Qwen3 14B
13
Mistral Small 3

Price

USD per 1M output tokens · Lower is better · Qwen2.5 72B Instruct ranks #44 of 141

$0.40
Llama 3.1 70B Instruct
$0.40
GLM 4.7 Flash
$0.40
Qwen3 30B A3B Thinking 2507
$0.40
Qwen3 8B
$0.40
Llama 3.3 Nemotron Super 49B V1.5
$0.40
Hermes 4 70B
$0.40
Qwen2.5 72B Instruct
$0.40
UnslopNemo 12B
$0.41
DeepSeek V3.2 Exp
$0.43
Rocinante 12B
$0.45
Nemotron 3 Super
$0.50
Cydonia 24B V4.1
$0.50
Qwen3 30B A3B

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-Document Analysis

The model handles inputs up to 131072 tokens for in-depth reasoning over lengthy texts such as research papers, legal contracts, or technical documentation.

Software Development Assistance

It supports code generation, debugging, and tool use to help developers write, test, and refine programs in multiple languages.

Multilingual Math and Instruction Tasks

Strong performance in mathematical problem solving combined with multilingual text generation and instruction following enables accurate responses across languages and complex calculations.

Strengths & limitations

Strengths

  • +Strong coding and technical reasoning
  • +Robust multilingual support including Chinese
  • +Effective long-context handling
  • +Clear and structured responses

Limitations

  • Text-only modality
  • No native vision or multimodal input
  • Knowledge cutoff limits real-time information

Cost calculator

Estimate what Qwen2.5 72B Instruct would cost for your usage.

$0.00056
per request
$5.6
estimated / month

Based on Qwen2.5 72B Instruct's $0.36/1M input · $0.40/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "qwen/qwen-2.5-72b-instruct",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: qwen/qwen-2.5-72b-instruct

Editor's verdict

Our take on Qwen2.5 72B Instruct

Qwen2.5 72B Instruct is Alibaba Qwen's open-weight language models with a 131K-token context window.

On independent testing it scores 15.6 on the Artificial Analysis Intelligence Index.

At $0.40 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to strong coding and technical reasoning and robust multilingual support including chinese.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 131072 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Qwen models

Sibling versions in the Qwen family from Alibaba Qwen.

Promote Qwen2.5 72B Instruct

Add this badge to your website, or share the tool.

DFeatured on DhanasviQwen2.5 72B Instruct 1