Back to models
Qwen-VL-Max
Multimodal vision-language model with advanced image understanding capabilities.
Context Window
33K
Input Price
$14.00
per 1M tokens
Output Price
$14.00
per 1M tokens
Family
Qwen-VL
Capabilities
Vision
Function Calling
JSON Mode
Streaming
Reasoning
Image Generation
Input Modalities
text
image
Output Modalities
text
Model Information
Status
active
Access Type
open source
License
Proprietary
Last Updated
12/31/2025