Back to models
Qwen3-Omni-Flash
Qwen3-Omni model discovered from Alibaba Cloud Model Studio official documentation. Category: fast. Capabilities: reasoning, audio, audio.
| Context window | 65,536 tokens |
| Max output tokens | 16,384 tokens |
| Max input tokens | 49,152 tokens |
Pricing
| Type | Input | Output |
|---|---|---|
| Standard | $0.25 / 1M tokens | $0.96 / 1M tokens |
Parameters
| Parameter | Default | Range |
|---|---|---|
| Temperature | 1 | 0 to 2 |
| Top P | 1 | 0 to 1 |
| Frequency penalty | 0 | -2 to 2 |
| Presence penalty | 0 | -2 to 2 |
| Max stop sequences | 4 | — |
Supported features
Capabilities
- Streaming
- Function Calling
- Json Mode
- Reasoning
- System Prompt
Endpoints
- Chat Completions
- Fine Tuning
Use cases
- text-chat
- function-calling
- reasoning
Model specifications
| Model ID | qwen3-omni-flash |
| Family | qwen3-omni |
| Access type | Open Source |
| Input modalities | text, audio, video |
| Output modalities | text, audio |