Qwen•qwen3-omni

Qwen3-Omni-Flash

Name: Qwen3-Omni-Flash
Brand: Qwen
Price: 0.25 USD
Availability: InStock

Active

Qwen3-Omni model discovered from Alibaba Cloud Model Studio official documentation. Category: fast. Capabilities: reasoning, audio, audio.

Open provider page Official docs Pricing source

Context window

66K

65,536 tokens

Max output

16K

16,384 tokens

Input price

$0.25

Per 1M input tokens

Output price

$0.96

Per 1M output tokens

Modalities

Text + Audio

Input: Text, Audio, Video; output: Text, Audio

API surface

Chat Completions

2 supported endpoints

Overview

Where this model fits best

Use this section to quickly decide whether the model belongs in chat, coding, reasoning, embedding, rerank, vision, audio, or agent workflows.

Use cases

What this model should be considered for

Selection signal

text-chat

function-calling

reasoning

Best fit

Use this model when you need a well-documented, structured option inside the registry and want a single place to inspect pricing, capabilities, and operational limits.

Capabilities

What you can actually do with it

Feature flags, API endpoints, and tool support are separated so integration constraints are easy to scan.

Capabilities

High-level features exposed by the model runtime.

StreamingFunction CallingJson ModeReasoningSystem Prompt

Endpoints

APIs and surfaces this model can be called through.

Chat CompletionsFine Tuning

Pricing

Pricing and billing signals

Known prices are shown per 1M tokens. Missing official prices are marked as not published instead of being treated as free.

Standard API

Input

$0.25 / 1M

Output

$0.96 / 1M

Default request pricing when using the primary endpoint.

Controls

Runtime knobs worth knowing

Supported request parameters help developers understand sampling, output limits, reasoning controls, and structured output behavior.

Temperature

0 to 2

Top P

0 to 1

Presence Penalty

-2 to 2

Frequency Penalty

-2 to 2

Response Format

text

text, json_object

Specifications

Technical reference

Canonical identifiers, family, modalities, token limits, training information, and update metadata.

Model ID

qwen3-omni-flash

Use this exact identifier in API calls and SDK configuration.

Provider

Qwen

Family

qwen3-omni

Access type

Open Source

Input modalities

text, audio, video

Output modalities

text, audio

Max input tokens

49,152

Max output tokens

16,384