Who developed this model?

R1 Distill Qwen 32B was created by DeepSeek.

What capabilities does it offer for coding?

The model provides code generation, debugging, and multi-step problem solving.

Is it effective for mathematical reasoning?

Yes, it includes dedicated strengths in mathematical reasoning and knowledge synthesis.

R1 Distill Qwen 32B

Verified

Distilled 32B reasoning model with extended context for efficient inference.

DeepSeekLanguage ModelsOpenII 17.2

Model page

Updated 2026-06-15

About R1 Distill Qwen 32B

The model uses knowledge distillation techniques to transfer capabilities from DeepSeek's larger R1 system into the Qwen 32B base. This approach preserves core reasoning behaviors while reducing computational requirements for deployment. The resulting weights remain fully open for research and commercial use.

Its 128k token context enables handling of extended inputs such as lengthy codebases, technical documentation, or multi-turn conversations without truncation. The text-only modality focuses resources on language understanding and generation tasks. Users typically run the model locally or via APIs for applications requiring strong logical inference.

Common usage includes coding assistance, mathematical problem solving, and analysis of long-form content. The open-weight release allows fine-tuning on domain-specific data while maintaining the distilled reasoning strengths.

Capabilities

Long-context reasoning

Multi-step problem solving

Code generation and debugging

Mathematical reasoning

Instruction following

Knowledge synthesis

Benchmarks & performance

Independent evaluation scores and measured speed.

17.2

Intelligence Index

Source: Artificial Analysis

How R1 Distill Qwen 32B compares

R1 Distill Qwen 32B (striped bar) vs other language models on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · R1 Distill Qwen 32B ranks #47 of 67

DeepSeek V3 0324

INTELLECT-3

GLM 4.7 Flash

Qwen3 Next 80B A3B Instruct

Qwen3 Coder 30B A3B Instruct

Qwen3 235B A22B

R1 Distill Qwen 32B

DeepSeek V3

R1 Distill Llama 70B

Qwen2.5 72B Instruct

Qwen3 32B

Command A

Mistral Large 2407

Price

USD per 1M output tokens · Lower is better · R1 Distill Qwen 32B ranks #30 of 141

$0.20

Llama 3.2 1B Instruct

$0.20

Reka Flash 3

$0.21

Hy3 preview

$0.24

Qwen3 14B

$0.27

Qwen3 Coder 30B A3B Instruct

$0.28

Qwen3 32B

$0.29

R1 Distill Qwen 32B

$0.30

MiMo-V2-Flash

$0.30

Step 3.5 Flash

$0.30

gpt-oss-safeguard-20b

$0.32

Llama 3.3 70B Instruct

$0.34

Llama 3.2 3B Instruct

$0.34

DeepSeek V3.2

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long document analysis

The model processes and reasons over texts up to 128000 tokens, enabling synthesis of insights from extensive reports or research collections.

Complex software development

It generates, debugs, and iterates on code through multi-step problem solving for large-scale programming projects.

Advanced quantitative tasks

Strong mathematical reasoning supports step-by-step solutions to intricate problems in science and engineering.

Strengths & limitations

Strengths

+Strong chain-of-thought reasoning from R1 distillation
+Efficient performance for 32B scale
+Handles extended contexts effectively
+Competent across STEM tasks

Limitations

–Text-only modality
–Distilled model may trail full-scale R1 on hardest problems
–Standard LLM risks of hallucination on niche topics

Cost calculator

Estimate what R1 Distill Qwen 32B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00044

per request

$4.35

estimated / month

Based on R1 Distill Qwen 32B's $0.29/1M input · $0.29/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "deepseek/deepseek-r1-distill-qwen-32b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: deepseek/deepseek-r1-distill-qwen-32b

Editor's verdict

Our take on R1 Distill Qwen 32B

R1 Distill Qwen 32B is DeepSeek's open-weight language models with a 128K-token context window.

On independent testing it scores 17.2 on the Artificial Analysis Intelligence Index.

At $0.29 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to strong chain-of-thought reasoning from r1 distillation and efficient performance for 32b scale.

Did you find this helpful?

Frequently asked questions

It supports a context window of 128000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other R models

Sibling versions in the R family from DeepSeek.

R1 0528

DeepSeek · Language Models

Verified

DeepSeek's open LLM handles extensive text with a 163k-token context.

OpenII 27.1164K ctx$2.15/1M out

R1

DeepSeek · Language Models

Verified

DeepSeek R1 handles massive text contexts as an open LLM.

OpenII 27.1164K ctx$2.50/1M out

R1 Distill Llama 70B

DeepSeek · Language Models

Verified

DeepSeek's distilled Llama 70B for advanced reasoning.

OpenII 16128K ctx$0.80/1M out

Promote R1 Distill Qwen 32B

Add this badge to your website, or share the tool.

DFeatured on DhanasviR1 Distill Qwen 32B 1

R1 Distill Qwen 32B

About R1 Distill Qwen 32B

Capabilities

Benchmarks & performance