ModelMeta
452 model profiles·22 providers·Refresh cadence hourly
Back to providers

SiliconFlow

SiliconFlow (硅基流动) is a leading AI infrastructure platform in China, providing optimized inference for 100+ open-source models including DeepSeek, Qwen, GLM, and more. Offers competitive pricing, high performance, and comprehensive model support including LLMs, vision models, image/video generation, embeddings, and audio models.

Models
100

Indexed for this provider.

Region
CN

Country metadata when available.

This page is now route-specific, so provider discovery is indexable and shareable instead of living behind a client-only subview on the homepage.
SiliconFlow
Flagship

Qwen3-VL-32B-Instruct

Qwen3 Vision-Language 32B model with multimodal capabilities.

Input

$0.2

per 1M tokens

Output

$0.6

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

DeepSeek-V3.2

DeepSeek V3.2 model on SiliconFlow platform with optimized inference.

Input

$0.27

per 1M tokens

Output

$0.42

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

MiniMax-M2.5

MiniMax model on SiliconFlow platform.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

1M

window

Max output

Not listed

tokens

SiliconFlow
Flagship

Kimi-K2.5

Kimi model on SiliconFlow platform.

Input

$0.23

per 1M tokens

Output

$3

per 1M tokens

Context

200K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

Kimi-K2-Instruct-0905

Kimi model on SiliconFlow platform.

Input

$0.4

per 1M tokens

Output

$2

per 1M tokens

Context

200K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

Kimi-K2-Instruct-0905

Kimi model on SiliconFlow platform.

Input

$0.4

per 1M tokens

Output

$2

per 1M tokens

Context

200K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

Kimi-K2-Thinking

Kimi model on SiliconFlow platform.

Input

$0.58

per 1M tokens

Output

$3

per 1M tokens

Context

200K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

Kimi-K2-Thinking

Kimi model on SiliconFlow platform.

Input

$0.58

per 1M tokens

Output

$3.5

per 1M tokens

Context

200K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

DeepSeek-V3.2-Exp

Experimental version of DeepSeek V3.2 with latest features.

Input

$0.27

per 1M tokens

Output

$0.41

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

DeepSeek-V3.2 Pro

Pro version of DeepSeek V3.2 with enhanced performance on SiliconFlow.

Input

$0.27

per 1M tokens

Output

$0.42

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

GLM-4.7

GLM-4.7 Pro model on SiliconFlow platform.

Input

$0.42

per 1M tokens

Output

$2.2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow
Flagship

GLM-5

GLM-5 Pro model on SiliconFlow platform.

Input

$0.3

per 1M tokens

Output

$2.55

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-R1-Distill-Qwen-14B

DeepSeek R1 reasoning model with extended thinking capabilities.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-R1-Distill-Qwen-32B

DeepSeek R1 reasoning model with extended thinking capabilities.

Input

$0.18

per 1M tokens

Output

$0.18

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-R1-Distill-Qwen-7B

DeepSeek R1 reasoning model with extended thinking capabilities.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-VL-32B-Instruct

Qwen Vision-Language model with multimodal capabilities.

Input

$0.27

per 1M tokens

Output

$0.27

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-VL-72B-Instruct

Qwen Vision-Language model with multimodal capabilities.

Input

$0.59

per 1M tokens

Output

$0.59

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2-VL-72B-Instruct

Qwen Vision-Language model with multimodal capabilities.

Input

$0.59

per 1M tokens

Output

$0.59

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-14B

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.07

per 1M tokens

Output

$0.28

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-235B-A22B-Instruct-2507

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.09

per 1M tokens

Output

$0.6

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-235B-A22B-Thinking-2507

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.13

per 1M tokens

Output

$0.6

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-30B-A3B-Instruct-2507

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.09

per 1M tokens

Output

$0.3

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-30B-A3B-Thinking-2507

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.09

per 1M tokens

Output

$0.3

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-32B

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.14

per 1M tokens

Output

$0.57

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-8B

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.06

per 1M tokens

Output

$0.06

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Coder-30B-A3B-Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.07

per 1M tokens

Output

$0.28

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Coder-480B-A35B-Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.25

per 1M tokens

Output

$1

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Omni-30B-A3B-Captioner

Qwen Omni multimodal model on SiliconFlow platform.

Input

$0.29

per 1M tokens

Output

$0.5

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Omni-30B-A3B-Instruct

Qwen Omni multimodal model on SiliconFlow platform.

Input

$0.29

per 1M tokens

Output

$1

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Omni-30B-A3B-Thinking

Qwen Omni multimodal model on SiliconFlow platform.

Input

$0.29

per 1M tokens

Output

$1.5

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-235B-A22B-Instruct

Qwen Vision-Language model with multimodal capabilities.

Input

$0.3

per 1M tokens

Output

$1.5

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-235B-A22B-Thinking

Qwen Vision-Language model with multimodal capabilities.

Input

$0.45

per 1M tokens

Output

$3.5

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-30B-A3B-Instruct

Qwen3 Vision-Language 30B active 3B model with efficient inference.

Input

$0.29

per 1M tokens

Output

$1

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-30B-A3B-Thinking

Qwen3 Vision-Language 30B active 3B with reasoning capabilities.

Input

$0.29

per 1M tokens

Output

$1

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-32B-Thinking

Qwen3 Vision-Language 32B with enhanced reasoning capabilities.

Input

$0.2

per 1M tokens

Output

$1.5

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-8B-Instruct

Qwen3 Vision-Language 8B model, efficient and capable.

Input

$0.18

per 1M tokens

Output

$0.68

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-VL-8B-Thinking

Qwen Vision-Language model with multimodal capabilities.

Input

$0.18

per 1M tokens

Output

$1

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

QwQ-32B

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.15

per 1M tokens

Output

$0.58

per 1M tokens

Context

262K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-R1

DeepSeek R1 reasoning model with extended thinking capabilities.

Input

$0.5

per 1M tokens

Output

$2.18

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-R1 Pro

Pro version with enhanced performance on SiliconFlow.

Input

$0.5

per 1M tokens

Output

$2.18

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V3

DeepSeek V3 model on SiliconFlow platform.

Input

$0.25

per 1M tokens

Output

$1

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V3.1

DeepSeek V3.1 model on SiliconFlow platform.

Input

$0.27

per 1M tokens

Output

$1

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V3.1-Terminus

DeepSeek V3.1 model on SiliconFlow platform.

Input

$0.27

per 1M tokens

Output

$1

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V3.1-Terminus Pro

Pro version with enhanced performance on SiliconFlow.

Input

$0.27

per 1M tokens

Output

$1

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V3 Pro

Pro version with enhanced performance on SiliconFlow.

Input

$0.25

per 1M tokens

Output

$1

per 1M tokens

Context

164K

window

Max output

Not listed

tokens

SiliconFlow

DeepSeek-V2.5

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.14

per 1M tokens

Output

$0.28

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

ERNIE-4.5-300B-A47B

Baidu ERNIE model on SiliconFlow platform.

Input

$0.28

per 1M tokens

Output

$1.1

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4.1V-9B-Thinking

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4-32B-0414

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4.5-Air

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.14

per 1M tokens

Output

$0.86

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4.5V

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$0.9

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4.6

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.39

per 1M tokens

Output

$1.9

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4.6V

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$0.9

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-4-9B

Free GLM-4 9B model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-Z1-32B-0414

THUDM GLM-Z1 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

GLM-Z1-9B-0414

THUDM GLM-Z1 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Hunyuan-A13B-Instruct

Tencent Hunyuan model on SiliconFlow platform.

Input

$0.14

per 1M tokens

Output

$0.57

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Hunyuan-MT-7B

Tencent Hunyuan machine translation model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

internlm2_5-7b-chat

InternLM chat model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Ling-flash-2.0

InclusionAI model on SiliconFlow platform.

Input

$0.14

per 1M tokens

Output

$0.57

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Ling-mini-2.0

InclusionAI model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Ring-flash-2.0

InclusionAI model on SiliconFlow platform.

Input

$0.14

per 1M tokens

Output

$0.57

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Seed-OSS-36B-Instruct

ByteDance Seed model on SiliconFlow platform.

Input

$0.21

per 1M tokens

Output

$0.57

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Step-3.5-Flash

Step AI model on SiliconFlow platform.

Input

$0.1

per 1M tokens

Output

$0.3

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-14B-Instruct

Free Qwen2.5 14B model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-32B-Instruct

Free Qwen 2.5 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-72B-Instruct

Free Qwen 2.5 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-72B-Instruct-128K

Free Qwen 2.5 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-7B-Instruct

Free Qwen 2.5 model on SiliconFlow platform.

Input

$0.05

per 1M tokens

Output

$0.05

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-7B-Instruct

Free Qwen2.5 7B model on SiliconFlow platform.

Input

$0.05

per 1M tokens

Output

$0.05

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen2.5-Coder-32B-Instruct

Free Qwen 2.5 model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-122B-A10B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-27B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-35B-A3B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0.15

per 1M tokens

Output

$0.6

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-397B-A17B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0.5

per 1M tokens

Output

$2

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-4B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3.5-9B

Qwen 3.5 series model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

32K

window

Max output

Not listed

tokens

SiliconFlow

bce-embedding-base_v1

NetEase Youdao embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bce-reranker-base_v1

NetEase Youdao reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-large-en-v1.5

BAAI BGE embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-large-zh-v1.5

BAAI BGE embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-m3

BAAI BGE embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-m3

BAAI BGE embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-reranker-v2-m3

BAAI BGE reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

bge-reranker-v2-m3

BAAI BGE reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Embedding-0.6B

Qwen embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Embedding-4B

Qwen embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Embedding-8B

Qwen embedding model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Reranker-0.6B

Qwen reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Reranker-4B

Qwen reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

Qwen3-Reranker-8B

Qwen reranker model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

8.2K

window

Max output

Not listed

tokens

SiliconFlow

CosyVoice2-0.5B

FunAudioLLM CosyVoice TTS model on SiliconFlow platform.

Input

$7.15

per 1M tokens

Output

$7.15

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

IndexTTS-2

IndexTeam TTS model on SiliconFlow platform.

Input

$7.15

per 1M tokens

Output

$7.15

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Kolors

Kwai Kolors image generation model on SiliconFlow platform.

Input

$0.04

per 1M tokens

Output

$0.04

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

MOSS-TTSD-v0.5

MOSS TTS model on SiliconFlow platform.

Input

$15

per 1M tokens

Output

$15

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Qwen-Image

Qwen image generation and editing model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Qwen-Image-Edit

Qwen image generation and editing model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Qwen-Image-Edit-2509

Qwen image generation and editing model on SiliconFlow platform.

Input

$0

per 1M tokens

Output

$0

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Wan2.2-I2V-A14B

Wan AI video generation model on SiliconFlow platform.

Input

$0.29

per 1M tokens

Output

$0.29

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

SiliconFlow

Wan2.2-T2V-A14B

Wan AI video generation model on SiliconFlow platform.

Input

$0.29

per 1M tokens

Output

$0.29

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens