Back to models

Qwen-VL-Max

Multimodal vision-language model with advanced image understanding capabilities.

Context Window
33K
Input Price
$14.00
per 1M tokens
Output Price
$14.00
per 1M tokens
Family
Qwen-VL

Capabilities

Vision
Function Calling
JSON Mode
Streaming
Reasoning
Image Generation

Input Modalities

textimage

Output Modalities

text

Model Information

Status
active
Access Type
open source
License
Proprietary
Last Updated
12/31/2025