How do I access the model?

It is available through Alibaba Cloud APIs and the Qwen platform for developers.

Is there usage pricing information?

Pricing follows Alibaba Cloud's token-based billing; check the official console for current rates.

Can it handle code-related tasks?

Yes, the model is suitable for code generation, review, and repository-scale analysis due to its context length.

Does it support function calling?

Qwen Plus variants include tool-use and function-calling capabilities for agentic applications.

Qwen Plus 0728 (thinking)

Verified

Handles complex reasoning across one million tokens of context.

Alibaba QwenLanguage ModelsOpen

Model page

Updated 2026-06-14

About Qwen Plus 0728 (thinking)

Built on the Qwen architecture, this release provides publicly available weights for local or cloud deployment. Its one-million-token context window allows processing of entire books, large codebases, or lengthy documents in a single pass. The added thinking designation indicates optimization for structured, multi-step reasoning chains.

Strengths include retention of information over very long sequences and coherent generation across extended outputs. Because the weights are open, developers can fine-tune the model for domain-specific tasks without vendor lock-in. Text-only modality keeps inference efficient compared with multimodal variants.

Typical usage covers research synthesis, legal document review, software engineering with large repositories, and any workflow that benefits from maintaining context across tens or hundreds of thousands of tokens.

Capabilities

Long-context reasoning

Code generation

Mathematical problem solving

Multilingual text generation

Complex instruction following

Logical analysis

How Qwen Plus 0728 (thinking) compares

Qwen Plus 0728 (thinking) (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Qwen Plus 0728 (thinking) ranks #29 of 63

$0.50

Cydonia 24B V4.1

$0.50

Olmo 3 32B Think

$0.60

Solar Pro 3

$0.63

Ring-2.6-1T

$0.63

Ling-2.6-1T

$0.75

Mercury 2

$0.78

Qwen Plus 0728

$0.78

Qwen Plus 0728

$0.79

DeepSeek V3.1

$0.80

Trinity Large Thinking

$0.80

Qwen3 Coder Next

$0.87

DeepSeek V4 Pro

$0.87

MiMo-V2.5-Pro

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-document analysis

The 1M-token context window enables processing and reasoning over entire books, code repositories, or lengthy reports in a single pass.

Complex multi-step reasoning

The 'thinking' variant supports extended chain-of-thought processes for tasks like advanced math, logic puzzles, or strategic planning.

Multilingual enterprise workflows

Qwen models excel at Chinese-English bilingual tasks such as translating technical documentation or handling cross-language customer support at scale.

Strengths & limitations

Strengths

+Strong Chinese-English bilingual performance
+Effective handling of very long inputs
+Solid technical and coding assistance
+Clear step-by-step reasoning style

Limitations

–Text-only modality
–May still hallucinate on niche facts
–Performance varies across domains

Cost calculator

Estimate what Qwen Plus 0728 (thinking) would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00065

per request

$6.5

estimated / month

Based on Qwen Plus 0728 (thinking)'s $0.26/1M input · $0.78/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "qwen/qwen-plus-2025-07-28:thinking",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: qwen/qwen-plus-2025-07-28:thinking

Editor's verdict

Our take on Qwen Plus 0728 (thinking)

Qwen Plus 0728 (thinking) is Alibaba Qwen's open-weight language models with a 1000K-token context window.

At $0.78 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to strong chinese-english bilingual performance and effective handling of very long inputs.

Did you find this helpful?

Frequently asked questions

Qwen Plus 0728 (thinking) supports up to 1,000,000 tokens of context.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Qwen models

Sibling versions in the Qwen family from Alibaba Qwen.

Qwen3.7 Max

Alibaba Qwen · Language Models

Verified

Qwen3.7 Max processes up to one million tokens in a single pass.

OpenII 56.61000K ctx$3.75/1M out

Qwen3.7 Plus

Alibaba Qwen · Multimodal

Verified

Open-weight multimodal model for million-token text and image tasks.

OpenII 53.31000K ctx$1.28/1M out

Qwen3.6 Max Preview

Alibaba Qwen · Language Models

Verified

Open-weight LLM optimized for long-context text reasoning and analysis.

OpenII 51.8262K ctx$6.24/1M out

Qwen3.6 27B

Alibaba Qwen · Multimodal

Verified

Multimodal model for long-context text, image, and video processing.

OpenII 45.8262K ctx$2.00/1M out

Qwen3.6 35B A3B

Alibaba Qwen · Multimodal

Verified

Multimodal model for long-context text, image, and video analysis.

OpenII 43.5262K ctx$0.97/1M out

Qwen3.6 Plus

Alibaba Qwen · Multimodal

Verified

Qwen3.6 Plus handles long multimodal sequences across text, images, and video.

Open1000K ctx$1.95/1M out

Promote Qwen Plus 0728 (thinking)

Add this badge to your website, or share the tool.

DFeatured on DhanasviQwen Plus 0728 (thinking) 2

Qwen Plus 0728 (thinking)

About Qwen Plus 0728 (thinking)

Capabilities