Back to models
Mistral Small
Cost-efficient reasoning model for low-latency workloads
| Context window | 32,000 tokens |
| Max output tokens | 8,192 tokens |
Pricing
| Type | Input | Output |
|---|---|---|
| Standard | $0.10 / 1M tokens | $0.30 / 1M tokens |
Parameters
| Parameter | Default | Range |
|---|---|---|
| Max stop sequences | 0 | — |
Supported features
Capabilities
- Streaming
- Function Calling
- Structured Output
- Json Mode
- System Prompt
Endpoints
- Chat Completions
Use cases
- text-chat
- function-calling
Model specifications
| Model ID | mistral-small-latest |
| Family | Mistral Small |
| Access type | Closed |
| Input modalities | text |
| Output modalities | text |
| Last updated | December 31, 2025 |