Back to models
Yi-Vision
Multimodal model capable of understanding and analyzing images with high accuracy.
Context Window
16K
Input Price
$0.85
per 1M tokens
Output Price
$1.70
per 1M tokens
Family
Yi
Capabilities
Vision
Function Calling
JSON Mode
Streaming
Reasoning
Image Generation
Input Modalities
text
image
Output Modalities
text
Model Information
Status
active
Access Type
closed
License
Proprietary
Last Updated
12/31/2025
Yi-Vision | ModelHub | Llamacto