Back to models
o4-mini
Fast, cost-efficient reasoning model, succeeded by GPT-5 mini
| Context window | 128,000 tokens |
| Max output tokens | 4,096 tokens |
Pricing
| Type | Input | Output |
|---|---|---|
| Standard | $2.00 / 1M tokens | $8.00 / 1M tokens |
| Cached input | $0.50 / 1M tokens | |
Parameters
| Parameter | Default | Range |
|---|---|---|
| Max stop sequences | 0 | — |
Supported features
Capabilities
- Streaming
- Reasoning
- System Prompt
Endpoints
- Chat Completions
- Batch
Use cases
- text-chat
- reasoning
Model specifications
| Model ID | o4-mini |
| Family | Other |
| Access type | Closed |
| Input modalities | text |
| Output modalities | text |
| Last updated | January 3, 2026 |