What is the pricing for Qwen3 Next 80B A3B Thinking?

Pricing details are not specified in the available model information.

How can users access Qwen3 Next 80B A3B Thinking?

The model is developed by Alibaba Qwen and may be accessed through their official platforms or APIs.

What types of tasks is Qwen3 Next 80B A3B Thinking best suited for?

It is designed for long-context reasoning, code generation, mathematical problem solving, and multilingual text generation with complex instructions.

Does Qwen3 Next 80B A3B Thinking support chain-of-thought reasoning?

Yes, the model includes explicit step-by-step chain-of-thought capabilities.

Qwen3 Next 80B A3B Thinking

Verified

Open-weight LLM optimized for extended reasoning over long texts.

Alibaba QwenLanguage ModelsOpen

Model page

Updated 2026-06-15

About Qwen3 Next 80B A3B Thinking

Its design centers on efficient handling of lengthy input sequences enabled by the large context capacity. Released as open weights, the model supports customization and local deployment by the community. The inclusion of 'Thinking' in its name indicates a focus on structured, step-by-step processing.

Key strengths lie in maintaining coherence across extensive documents and conversations. It processes text inputs exclusively without support for other modalities. This makes it suitable for scenarios where context preservation is critical.

Users typically apply it to research analysis, technical writing, and multi-turn dialogue systems. Developers integrate the model into tools requiring detailed textual comprehension and generation.

Capabilities

Long-context reasoning

Step-by-step chain-of-thought

Code generation and analysis

Mathematical problem solving

Multilingual text generation

Complex instruction following

How Qwen3 Next 80B A3B Thinking compares

Qwen3 Next 80B A3B Thinking (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Qwen3 Next 80B A3B Thinking ranks #44 of 98

$0.63

Ring-2.6-1T

$0.63

Ling-2.6-1T

$0.75

Mercury 2

$0.77

DeepSeek V3 0324

$0.78

Qwen Plus 0728

$0.78

Qwen Plus 0728

$0.78

Qwen3 Next 80B A3B Thinking

$0.79

DeepSeek V3.1

$0.80

Qwen3 Coder Next

$0.80

Skyfall 36B V2

$0.80

Coder Large

$0.85

Trinity Large Thinking

$0.85

GLM 4.5 Air

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Document Reasoning

The model excels at processing and reasoning over extensive inputs up to 262144 tokens, such as analyzing full research papers or lengthy legal documents with sustained coherence.

Mathematical Problem Solving

It performs well on complex math tasks by applying step-by-step chain-of-thought to break down equations, proofs, and quantitative problems accurately.

Code Generation and Analysis

The model handles code-related work including writing, debugging, and reviewing large codebases while following detailed technical instructions.

Strengths & limitations

Strengths

+Strong reasoning via thinking-focused design
+Effective 256k context utilization
+Competitive coding and math performance
+Efficient MoE-style inference

Limitations

–Text-only modality
–High compute requirements for full model
–May over-reason on simple tasks

Cost calculator

Estimate what Qwen3 Next 80B A3B Thinking would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00049

per request

$4.9

estimated / month

Based on Qwen3 Next 80B A3B Thinking's $0.10/1M input · $0.78/1M output. Estimate only — actual cost varies by provider and caching.

Download & self-host Qwen3 Next 80B A3B Thinking

This is an open-weight model. Download the weights from Hugging Face or load it directly with Transformers.

81B

Parameters (safetensors)

21,529

Monthly downloads

489

Hugging Face likes

Download · transformers

# Install the Hugging Face CLI
pip install -U "huggingface_hub[cli]"

# Download the model weights
hf download Qwen/Qwen3-Next-80B-A3B-Thinking

# Or load it directly in Python
from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("Qwen/Qwen3-Next-80B-A3B-Thinking")
model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-Next-80B-A3B-Thinking", device_map="auto")

View Qwen/Qwen3-Next-80B-A3B-Thinking on Hugging Face

Inference providers

Hosted APIs that serve Qwen3 Next 80B A3B Thinking (via Hugging Face Inference Providers).

novita

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "qwen/qwen3-next-80b-a3b-thinking",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: qwen/qwen3-next-80b-a3b-thinking

Editor's verdict

Our take on Qwen3 Next 80B A3B Thinking

Qwen3 Next 80B A3B Thinking is Alibaba Qwen's open-weight language models with a 262K-token context window.

At $0.78 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it (81B parameters) or call it through a hosted API.

Best suited to strong reasoning via thinking-focused design and effective 256k context utilization.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 262144 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Qwen models

Sibling versions in the Qwen family from Alibaba Qwen.

Qwen3.7 Max

Alibaba Qwen · Language Models

Verified

Qwen3.7 Max processes up to one million tokens in a single pass.

OpenII 56.61000K ctx$3.75/1M out

Qwen3.7 Plus

Alibaba Qwen · Multimodal

Verified

Open-weight multimodal model for million-token text and image tasks.

OpenII 53.31000K ctx$1.28/1M out

Qwen3.6 Max Preview

Alibaba Qwen · Language Models

Verified

Open-weight LLM optimized for long-context text reasoning and analysis.

OpenII 51.8262K ctx$6.24/1M out

Qwen3.6 27B

Alibaba Qwen · Multimodal

Verified

Multimodal model for long-context text, image, and video processing.

OpenII 45.8262K ctx$2.00/1M out

Qwen3.6 35B A3B

Alibaba Qwen · Multimodal

Verified

Multimodal model for long-context text, image, and video analysis.

OpenII 43.5262K ctx$0.97/1M out

Qwen3.5-Flash

Alibaba Qwen · Multimodal

Verified

Fast open-weight multimodal model for million-token text, image, and video tasks.

Open1000K ctx$0.26/1M out

Promote Qwen3 Next 80B A3B Thinking

Add this badge to your website, or share the tool.

DFeatured on DhanasviQwen3 Next 80B A3B Thinking 1

Qwen3 Next 80B A3B Thinking

About Qwen3 Next 80B A3B Thinking

Capabilities