Qwen3-VL-32B-Instruct
Qwen3 Vision-Language 32B model with multimodal capabilities.
Input
$0.2
per 1M tokens
Output
$0.6
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
SiliconFlow (硅基流动) is a leading AI infrastructure platform in China, providing optimized inference for 100+ open-source models including DeepSeek, Qwen, GLM, and more. Offers competitive pricing, high performance, and comprehensive model support including LLMs, vision models, image/video generation, embeddings, and audio models.
Indexed for this provider.
Country metadata when available.
Qwen3 Vision-Language 32B model with multimodal capabilities.
Input
$0.2
per 1M tokens
Output
$0.6
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
DeepSeek V3.2 model on SiliconFlow platform with optimized inference.
Input
$0.27
per 1M tokens
Output
$0.42
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
MiniMax model on SiliconFlow platform.
Input
$0.3
per 1M tokens
Output
$1.2
per 1M tokens
Context
1M
window
Max output
Not listed
tokens
Kimi model on SiliconFlow platform.
Input
$0.23
per 1M tokens
Output
$3
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Kimi model on SiliconFlow platform.
Input
$0.4
per 1M tokens
Output
$2
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Kimi model on SiliconFlow platform.
Input
$0.4
per 1M tokens
Output
$2
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Kimi model on SiliconFlow platform.
Input
$0.58
per 1M tokens
Output
$3
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Kimi model on SiliconFlow platform.
Input
$0.58
per 1M tokens
Output
$3.5
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Experimental version of DeepSeek V3.2 with latest features.
Input
$0.27
per 1M tokens
Output
$0.41
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
Pro version of DeepSeek V3.2 with enhanced performance on SiliconFlow.
Input
$0.27
per 1M tokens
Output
$0.42
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
GLM-4.7 Pro model on SiliconFlow platform.
Input
$0.42
per 1M tokens
Output
$2.2
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
GLM-5 Pro model on SiliconFlow platform.
Input
$0.3
per 1M tokens
Output
$2.55
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
DeepSeek R1 reasoning model with extended thinking capabilities.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
DeepSeek R1 reasoning model with extended thinking capabilities.
Input
$0.18
per 1M tokens
Output
$0.18
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
DeepSeek R1 reasoning model with extended thinking capabilities.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.27
per 1M tokens
Output
$0.27
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.59
per 1M tokens
Output
$0.59
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.59
per 1M tokens
Output
$0.59
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.07
per 1M tokens
Output
$0.28
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.09
per 1M tokens
Output
$0.6
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.13
per 1M tokens
Output
$0.6
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.09
per 1M tokens
Output
$0.3
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.09
per 1M tokens
Output
$0.3
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.14
per 1M tokens
Output
$0.57
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.06
per 1M tokens
Output
$0.06
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.07
per 1M tokens
Output
$0.28
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.25
per 1M tokens
Output
$1
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Omni multimodal model on SiliconFlow platform.
Input
$0.29
per 1M tokens
Output
$0.5
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Omni multimodal model on SiliconFlow platform.
Input
$0.29
per 1M tokens
Output
$1
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Omni multimodal model on SiliconFlow platform.
Input
$0.29
per 1M tokens
Output
$1.5
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.3
per 1M tokens
Output
$1.5
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.45
per 1M tokens
Output
$3.5
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen3 Vision-Language 30B active 3B model with efficient inference.
Input
$0.29
per 1M tokens
Output
$1
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen3 Vision-Language 30B active 3B with reasoning capabilities.
Input
$0.29
per 1M tokens
Output
$1
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen3 Vision-Language 32B with enhanced reasoning capabilities.
Input
$0.2
per 1M tokens
Output
$1.5
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen3 Vision-Language 8B model, efficient and capable.
Input
$0.18
per 1M tokens
Output
$0.68
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Qwen Vision-Language model with multimodal capabilities.
Input
$0.18
per 1M tokens
Output
$1
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.15
per 1M tokens
Output
$0.58
per 1M tokens
Context
262K
window
Max output
Not listed
tokens
DeepSeek R1 reasoning model with extended thinking capabilities.
Input
$0.5
per 1M tokens
Output
$2.18
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
Pro version with enhanced performance on SiliconFlow.
Input
$0.5
per 1M tokens
Output
$2.18
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
DeepSeek V3 model on SiliconFlow platform.
Input
$0.25
per 1M tokens
Output
$1
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
DeepSeek V3.1 model on SiliconFlow platform.
Input
$0.27
per 1M tokens
Output
$1
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
DeepSeek V3.1 model on SiliconFlow platform.
Input
$0.27
per 1M tokens
Output
$1
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
Pro version with enhanced performance on SiliconFlow.
Input
$0.27
per 1M tokens
Output
$1
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
Pro version with enhanced performance on SiliconFlow.
Input
$0.25
per 1M tokens
Output
$1
per 1M tokens
Context
164K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.14
per 1M tokens
Output
$0.28
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Baidu ERNIE model on SiliconFlow platform.
Input
$0.28
per 1M tokens
Output
$1.1
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.14
per 1M tokens
Output
$0.86
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.3
per 1M tokens
Output
$0.9
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.39
per 1M tokens
Output
$1.9
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$0.3
per 1M tokens
Output
$0.9
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Free GLM-4 9B model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
THUDM GLM-Z1 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
THUDM GLM-Z1 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Tencent Hunyuan model on SiliconFlow platform.
Input
$0.14
per 1M tokens
Output
$0.57
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Tencent Hunyuan machine translation model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
InternLM chat model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
InclusionAI model on SiliconFlow platform.
Input
$0.14
per 1M tokens
Output
$0.57
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
InclusionAI model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
InclusionAI model on SiliconFlow platform.
Input
$0.14
per 1M tokens
Output
$0.57
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
ByteDance Seed model on SiliconFlow platform.
Input
$0.21
per 1M tokens
Output
$0.57
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Step AI model on SiliconFlow platform.
Input
$0.1
per 1M tokens
Output
$0.3
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Free Qwen2.5 14B model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen 2.5 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen 2.5 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen 2.5 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen 2.5 model on SiliconFlow platform.
Input
$0.05
per 1M tokens
Output
$0.05
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen2.5 7B model on SiliconFlow platform.
Input
$0.05
per 1M tokens
Output
$0.05
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Free Qwen 2.5 model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0.3
per 1M tokens
Output
$1.2
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0.15
per 1M tokens
Output
$0.6
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0.5
per 1M tokens
Output
$2
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
Qwen 3.5 series model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
32K
window
Max output
Not listed
tokens
NetEase Youdao embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
NetEase Youdao reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
BAAI BGE reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen embedding model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Qwen reranker model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
FunAudioLLM CosyVoice TTS model on SiliconFlow platform.
Input
$7.15
per 1M tokens
Output
$7.15
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
IndexTeam TTS model on SiliconFlow platform.
Input
$7.15
per 1M tokens
Output
$7.15
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Kwai Kolors image generation model on SiliconFlow platform.
Input
$0.04
per 1M tokens
Output
$0.04
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
MOSS TTS model on SiliconFlow platform.
Input
$15
per 1M tokens
Output
$15
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Qwen image generation and editing model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Qwen image generation and editing model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Qwen image generation and editing model on SiliconFlow platform.
Input
$0
per 1M tokens
Output
$0
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Wan AI video generation model on SiliconFlow platform.
Input
$0.29
per 1M tokens
Output
$0.29
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Wan AI video generation model on SiliconFlow platform.
Input
$0.29
per 1M tokens
Output
$0.29
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens