Back to models
Llama 3.2 90B Vision Instruct
Meta's largest vision model, optimized for document understanding and visual reasoning.
⭐ Flagship
Context Window
128K
Input Price
$N/A
per 1M tokens
Output Price
$N/A
per 1M tokens
Family
Llama 3.2
Capabilities
Vision
Function Calling
JSON Mode
Streaming
Reasoning
Image Generation
Input Modalities
text
image
Output Modalities
text
Model Information
Status
active
Access Type
open weights
License
Proprietary
Last Updated
12/30/2025