Is Mistral Small 3.1 24B a multimodal model?

Yes, it is specified as a multimodal model from Mistral.

How do I access Mistral Small 3.1 24B?

Access is available via Mistral's official API and developer platforms.

What is the pricing for Mistral Small 3.1 24B?

Current pricing details are listed on Mistral's website and depend on usage volume.

What are common use cases for this model?

It is suited for applications combining text and image understanding over long contexts.

Mistral Small 3.1 24B

Verified

Open multimodal model for integrated text and image tasks.

MistralMultimodalOpen

Vision

Model page

Updated 2026-06-15

About Mistral Small 3.1 24B

Mistral Small 3.1 24B features a multimodal architecture that accepts both textual and visual inputs. Released with open weights by Mistral, it supports broad research and customization. Its 128000-token context window accommodates lengthy documents or multi-turn interactions.

The design enables joint reasoning across images and text without requiring separate pipelines. Open availability removes licensing barriers and allows fine-tuning for specialized domains. This approach delivers practical performance for cross-modal workloads.

Common uses include visual question answering, image captioning, and multimodal assistants. Teams integrate it into content analysis tools, educational platforms, and research prototypes. The model suits both individual experimentation and production deployments.

Capabilities

Multimodal text and image understanding

Long-context reasoning

Instruction following

Document and image analysis

Text generation and summarization

Conversational responses

How Mistral Small 3.1 24B compares

Mistral Small 3.1 24B (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Mistral Small 3.1 24B ranks #24 of 124

$0.40

Gemini 2.5 Flash Lite Preview 09-2025

$0.40

GPT-5 Nano

$0.40

Seed-2.0-Mini

$0.42

Qwen3 VL 32B Instruct

$0.50

Qwen3 VL 8B Instruct

$0.52

Qwen3 VL 30B A3B Instruct

$0.55

Mistral Small 3.1 24B

$0.60

Llama 4 Maverick

$0.60

Mistral Small 4

$0.60

Saba

$0.88

Qwen3 VL 235B A22B Instruct

$0.90

Codestral 2508

$0.90

GLM 4.6V

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Multimodal document processing

Handles combined text and image inputs within its 128k token context for tasks like analyzing illustrated reports or mixed-media articles.

Long-context visual Q&A

Supports detailed question answering over extended sequences that include both textual passages and associated images.

Efficient multimodal inference

Provides balanced performance for production applications needing vision-language capabilities without excessive resource demands.

Strengths & limitations

Strengths

+Efficient handling of combined text and image inputs
+Strong long-context processing up to 128k tokens
+Balanced performance for a 24B model
+Good instruction adherence

Limitations

–Smaller scale may limit depth on complex reasoning tasks
–Multimodal support restricted to text and images
–Context length capped at 128k tokens

Cost calculator

Estimate what Mistral Small 3.1 24B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00063

per request

$6.25

estimated / month

Based on Mistral Small 3.1 24B's $0.35/1M input · $0.55/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "mistralai/mistral-small-3.1-24b-instruct",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: mistralai/mistral-small-3.1-24b-instruct

Editor's verdict

Our take on Mistral Small 3.1 24B

Mistral Small 3.1 24B is Mistral's open-weight multimodal with a 128K-token context window.

At $0.55 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to efficient handling of combined text and image inputs and strong long-context processing up to 128k tokens.

Did you find this helpful?

Frequently asked questions

The model supports a context window of 128000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Mistral models

Sibling versions in the Mistral family from Mistral.

Mistral Medium 3.5

Mistral · Multimodal

Verified

Mistral's closed multimodal model for long-context text, image, and file tasks.

ClosedII 39.2262K ctx$7.50/1M out

Mistral Large 3 2512

Mistral · Multimodal

Verified

Open-weight multimodal model for long-context text, image, and file tasks.

Closed262K ctx$1.50/1M out

Mistral Small 4

Mistral · Multimodal

Verified

Open-weight multimodal model for long-context text and image tasks.

Open262K ctx$0.60/1M out

Mistral Medium 3.1

Mistral · Multimodal

Verified

Open-weight multimodal model for text, image, and file tasks.

Closed131K ctx$2.00/1M out

Mistral Small 3.2 24B

Mistral · Multimodal

Verified

Open-weight multimodal model for efficient image and text tasks.

Open128K ctx$0.20/1M out

Promote Mistral Small 3.1 24B

Add this badge to your website, or share the tool.

DFeatured on DhanasviMistral Small 3.1 24B 1

Mistral Small 3.1 24B

About Mistral Small 3.1 24B

Capabilities