Skip to content
Mistral Small 3.2 24B logo

Mistral Small 3.2 24B

Verified

Open-weight multimodal model for efficient image and text tasks.

MistralMultimodalOpen
Vision
Model page
Updated 2026-06-15

About Mistral Small 3.2 24B

Mistral Small 3.2 24B combines vision and language processing in a single 24B parameter architecture. It is released as open weights to allow full inspection, fine-tuning, and local deployment. The design supports both image and text modalities through a shared context window of 128000 tokens.

Its strengths center on flexibility for multimodal workloads without requiring proprietary APIs. The model balances parameter count and capability to run on accessible hardware while retaining strong performance across vision-language tasks. Open availability encourages community-driven adaptations and research.

Typical usage includes visual question answering, document understanding, and image captioning pipelines. Developers integrate it into applications needing simultaneous analysis of text and images. Fine-tuning on domain-specific datasets is a common workflow for production deployments.

Capabilities

Multimodal image-text understanding
Long-context reasoning
Code generation and assistance
Instruction following
Multilingual text generation

How Mistral Small 3.2 24B compares

Mistral Small 3.2 24B (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Mistral Small 3.2 24B ranks #10 of 139

$0.15
Ministral 3 8B 2512
$0.15
Qwen3.5-9B
$0.15
Gemma 3 12B
$0.16
Gemma 3 27B
$0.18
Llama Guard 4 12B
$0.20
Ministral 3 14B 2512
$0.20
Mistral Small 3.2 24B
$0.20
UI-TARS 7B
$0.26
Qwen3.5-Flash
$0.28
MiMo-V2.5
$0.30
Llama 4 Scout
$0.30
Seed 1.6 Flash
$0.30
Voxtral Small 24B 2507

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Multimodal Document Analysis

Processes extensive reports or articles containing both text and embedded images within its 128,000-token context for summarization and extraction tasks.

Extended Visual Reasoning Conversations

Maintains coherence across lengthy dialogues that combine image inputs with detailed textual follow-up questions and instructions.

Multimodal Content Creation Workflows

Supports iterative generation and refinement of text paired with visual elements over long sessions without losing earlier context.

Strengths & limitations

Strengths

  • +Efficient inference suitable for deployment
  • +Solid integration of vision and language
  • +Supports extended 128k context windows
  • +Good balance of capability and speed

Limitations

  • Smaller model size limits performance on hardest tasks
  • Restricted to image and text modalities only
  • May underperform larger models in nuanced reasoning

Cost calculator

Estimate what Mistral Small 3.2 24B would cost for your usage.

$0.00017
per request
$1.7
estimated / month

Based on Mistral Small 3.2 24B's $0.07/1M input · $0.20/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "mistralai/mistral-small-3.2-24b-instruct",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: mistralai/mistral-small-3.2-24b-instruct

Editor's verdict

Our take on Mistral Small 3.2 24B

Mistral Small 3.2 24B is Mistral's open-weight multimodal with a 128K-token context window.

At $0.20 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to efficient inference suitable for deployment and solid integration of vision and language.

Did you find this helpful?

Frequently asked questions

The model supports a context window of 128,000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Mistral models

Sibling versions in the Mistral family from Mistral.

Promote Mistral Small 3.2 24B

Add this badge to your website, or share the tool.

DFeatured on DhanasviMistral Small 3.2 24B 1