Where can users access Mistral Small 3?

It is available via Mistral AI's official API and partner platforms that host Mistral models.

What is the pricing for Mistral Small 3?

Current pricing details are listed on Mistral AI's official documentation and billing pages.

Is Mistral Small 3 suitable for conversational applications?

Yes, its conversational response and instruction-following capabilities make it appropriate for chat-based interfaces.

Can Mistral Small 3 handle code-related tasks?

The model supports code generation and completion as one of its core capabilities.

Mistral Small 3

Verified

Open-weight LLM for efficient, versatile text tasks.

MistralLanguage ModelsOpenII 12.7

Model page

Updated 2026-06-15

About Mistral Small 3

Mistral Small 3 follows a transformer-based design focused on text inputs and outputs. Its open-weight release allows full access to model parameters for inspection and modification. The 32768-token context enables handling of longer documents without truncation.

Strengths include deployment flexibility on varied hardware and support for fine-tuning on domain-specific data. Typical usage covers chat interfaces, summarization pipelines, and code assistance tools. Users integrate it into applications where transparency and local control matter most.

Capabilities

Instruction following

Text generation and summarization

Code generation and completion

Logical reasoning

Multilingual text processing

Conversational responses

Benchmarks & performance

Independent evaluation scores and measured speed.

12.7

Intelligence Index

163

Tokens / sec

0.74s

Time to first token

Source: Artificial Analysis

How Mistral Small 3 compares

Mistral Small 3 (striped bar) vs other language models on intelligence, speed and price.

Intelligence

Artificial Analysis Intelligence Index · Higher is better · Mistral Small 3 ranks #56 of 67

Qwen2.5 72B Instruct

Qwen3 32B

Command A

Mistral Large 2407

Qwen2.5 Coder 32B Instruct

Qwen3 14B

Mistral Small 3

Qwen3 30B A3B

Granite 4.1 8B

Qwen3 8B

LFM2-24B-A2B

Phi 4

Mistral Large

Speed

Output tokens per second · Higher is better · Mistral Small 3 ranks #9 of 45

234

MiniMax M2.5

233

MiniMax M2.1

218

gpt-oss-20b

197

Qwen3.7 Max

191

Step 3.5 Flash

171

Trinity Large Thinking

163

Mistral Small 3

157

MiMo-V2-Flash

153

Qwen3 Next 80B A3B Instruct

128

Ring-2.6-1T

124

Hy3 preview

120

MiniMax M2

119

Granite 4.1 8B

Price

USD per 1M output tokens · Lower is better · Mistral Small 3 ranks #5 of 141

$0.03

Ling-2.6-flash

$0.03

Mistral Nemo

$0.03

Llama 3.1 8B Instruct

$0.05

Llama 3 8B Lunaris

$0.08

Mistral Small 3

$0.10

Qwen3 235B A22B Thinking 2507

$0.10

Qwen3 235B A22B Instruct 2507

$0.10

Qwen2.5 7B Instruct

$0.10

Granite 4.1 8B

$0.11

Granite 4.0 Micro

$0.12

LFM2-24B-A2B

$0.12

Gemma 3n 4B

$0.14

gpt-oss-20b

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Code Generation and Completion

Mistral Small 3 performs well on code generation and completion tasks, producing functional snippets and completing partial code in multiple programming languages.

Multilingual Text Processing

The model handles multilingual text processing effectively, supporting generation, translation, and analysis across various languages in a single workflow.

Document Summarization and Reasoning

It delivers strong results in text summarization and logical reasoning, condensing long inputs while maintaining key arguments and conclusions.

Strengths & limitations

Strengths

+Efficient and fast inference
+Cost-effective for general use
+Solid performance relative to model size
+Reliable on everyday language tasks

Limitations

–Text-only modality
–Less capable on highly complex or specialized reasoning
–32k context limits very long-document handling

Cost calculator

Estimate what Mistral Small 3 would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00009

per request

$0.9000

estimated / month

Based on Mistral Small 3's $0.05/1M input · $0.08/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "mistralai/mistral-small-24b-instruct-2501",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: mistralai/mistral-small-24b-instruct-2501

Editor's verdict

Our take on Mistral Small 3

Mistral Small 3 is Mistral's open-weight language models with a 33K-token context window.

On independent testing it scores 12.7 on the Artificial Analysis Intelligence Index, running at roughly 163 tokens per second with about 0.74s to first token.

At $0.08 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to efficient and fast inference and cost-effective for general use.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 32768 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Mistral models

Sibling versions in the Mistral family from Mistral.

Mistral Medium 3.5

Mistral · Multimodal

Verified

Mistral's closed multimodal model for long-context text, image, and file tasks.

ClosedII 39.2262K ctx$7.50/1M out

Mistral Small 4

Mistral · Multimodal

Verified

Open-weight multimodal model for long-context text and image tasks.

OpenII 27.8262K ctx$0.60/1M out

Mistral Medium 3.1

Mistral · Multimodal

Verified

Open-weight multimodal model for text, image, and file tasks.

ClosedII 21.3131K ctx$2.00/1M out

Mistral Medium 3

Mistral · Multimodal

Verified

Open-weight multimodal model for text, image, and file tasks.

ClosedII 18.8131K ctx$2.00/1M out

Mistral Large 2407

Mistral · Language Models

Verified

Mistral Large 2407 offers open-weight multimodal capabilities for text and file tasks.

ClosedII 13131K ctx$6.00/1M out

Mistral Large

Mistral · Language Models

Verified

Mistral Large processes text and files with a 128k context window.

ClosedII 9.9128K ctx$6.00/1M out

Promote Mistral Small 3

Add this badge to your website, or share the tool.

DFeatured on DhanasviMistral Small 3 0

Mistral Small 3

About Mistral Small 3

Capabilities

Benchmarks & performance