Skip to content
Mistral Small 3 logo

Mistral Small 3

Verified

Open-weight LLM for efficient, versatile text tasks.

MistralLanguage ModelsOpenII 12.7
Model page
Updated 2026-06-15

About Mistral Small 3

Mistral Small 3 follows a transformer-based design focused on text inputs and outputs. Its open-weight release allows full access to model parameters for inspection and modification. The 32768-token context enables handling of longer documents without truncation.

Strengths include deployment flexibility on varied hardware and support for fine-tuning on domain-specific data. Typical usage covers chat interfaces, summarization pipelines, and code assistance tools. Users integrate it into applications where transparency and local control matter most.

Capabilities

Instruction following
Text generation and summarization
Code generation and completion
Logical reasoning
Multilingual text processing
Conversational responses

Benchmarks & performance

Independent evaluation scores and measured speed.

12.7
Intelligence Index
163
Tokens / sec
0.74s
Time to first token

Source: Artificial Analysis

How Mistral Small 3 compares

Mistral Small 3 (striped bar) vs other language models on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · Mistral Small 3 ranks #56 of 67

16
Qwen2.5 72B Instruct
15
Qwen3 32B
14
Command A
13
Mistral Large 2407
13
Qwen2.5 Coder 32B Instruct
13
Qwen3 14B
13
Mistral Small 3
13
Qwen3 30B A3B
12
Granite 4.1 8B
11
Qwen3 8B
11
LFM2-24B-A2B
10
Phi 4
10
Mistral Large

Speed

Output tokens per second · Higher is better · Mistral Small 3 ranks #9 of 45

234
MiniMax M2.5
233
MiniMax M2.1
218
gpt-oss-20b
197
Qwen3.7 Max
191
Step 3.5 Flash
171
Trinity Large Thinking
163
Mistral Small 3
157
MiMo-V2-Flash
153
Qwen3 Next 80B A3B Instruct
128
Ring-2.6-1T
124
Hy3 preview
120
MiniMax M2
119
Granite 4.1 8B

Price

USD per 1M output tokens · Lower is better · Mistral Small 3 ranks #5 of 141

$0.03
Ling-2.6-flash
$0.03
Mistral Nemo
$0.03
Llama 3.1 8B Instruct
$0.05
Llama 3 8B Lunaris
$0.08
Mistral Small 3
$0.10
Qwen3 235B A22B Thinking 2507
$0.10
Qwen3 235B A22B Instruct 2507
$0.10
Qwen2.5 7B Instruct
$0.10
Granite 4.1 8B
$0.11
Granite 4.0 Micro
$0.12
LFM2-24B-A2B
$0.12
Gemma 3n 4B
$0.14
gpt-oss-20b

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Code Generation and Completion

Mistral Small 3 performs well on code generation and completion tasks, producing functional snippets and completing partial code in multiple programming languages.

Multilingual Text Processing

The model handles multilingual text processing effectively, supporting generation, translation, and analysis across various languages in a single workflow.

Document Summarization and Reasoning

It delivers strong results in text summarization and logical reasoning, condensing long inputs while maintaining key arguments and conclusions.

Strengths & limitations

Strengths

  • +Efficient and fast inference
  • +Cost-effective for general use
  • +Solid performance relative to model size
  • +Reliable on everyday language tasks

Limitations

  • Text-only modality
  • Less capable on highly complex or specialized reasoning
  • 32k context limits very long-document handling

Cost calculator

Estimate what Mistral Small 3 would cost for your usage.

$0.00009
per request
$0.9000
estimated / month

Based on Mistral Small 3's $0.05/1M input · $0.08/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "mistralai/mistral-small-24b-instruct-2501",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: mistralai/mistral-small-24b-instruct-2501

Editor's verdict

Our take on Mistral Small 3

Mistral Small 3 is Mistral's open-weight language models with a 33K-token context window.

On independent testing it scores 12.7 on the Artificial Analysis Intelligence Index, running at roughly 163 tokens per second with about 0.74s to first token.

At $0.08 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to efficient and fast inference and cost-effective for general use.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 32768 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Mistral models

Sibling versions in the Mistral family from Mistral.

Promote Mistral Small 3

Add this badge to your website, or share the tool.

DFeatured on DhanasviMistral Small 3 0