M
ModelMeta
</> API
499 Models·19 Providers·Updated 2h ago
All systems operational
Back to models

Qwen3-Omni-Flash

Qwen3-Omni model discovered from Alibaba Cloud Model Studio official documentation. Category: fast. Capabilities: reasoning, audio, audio.

Context window65,536 tokens
Max output tokens16,384 tokens
Max input tokens49,152 tokens

Pricing

TypeInputOutput
Standard$0.25 / 1M tokens$0.96 / 1M tokens

Parameters

ParameterDefaultRange
Temperature10 to 2
Top P10 to 1
Frequency penalty0-2 to 2
Presence penalty0-2 to 2
Max stop sequences4

Supported features

Capabilities

  • Streaming
  • Function Calling
  • Json Mode
  • Reasoning
  • System Prompt

Endpoints

  • Chat Completions
  • Fine Tuning

Use cases

  • text-chat
  • function-calling
  • reasoning

Model specifications

Model IDqwen3-omni-flash
Familyqwen3-omni
Access typeOpen Source
Input modalitiestext, audio, video
Output modalitiestext, audio