Back to models

Yi-Vision

Multimodal model capable of understanding and analyzing images with high accuracy.

Context Window
16K
Input Price
$0.85
per 1M tokens
Output Price
$1.70
per 1M tokens
Family
Yi

Capabilities

Vision
Function Calling
JSON Mode
Streaming
Reasoning
Image Generation

Input Modalities

textimage

Output Modalities

text

Model Information

Status
active
Access Type
closed
License
Proprietary
Last Updated
12/31/2025
Yi-Vision | ModelHub | Llamacto