ModelMeta
452 model profiles·22 providers·Refresh cadence hourly
Back to providers

Infini-AI

Infini-AI (无问芯穹) provides AI computing optimization and large model deployment on diverse chips. They offer GenStudio API service with multiple pre-built LLM models from various vendors.

Models
57

Indexed for this provider.

Region
Global

Country metadata when available.

This page is now route-specific, so provider discovery is indexable and shareable instead of living behind a client-only subview on the homepage.
Infini-AI

Kimi K2.5

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$3

per 1M tokens

Context

256K

window

Max output

Not listed

tokens

Infini-AI

Kimi K2 Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$2.29

per 1M tokens

Context

256K

window

Max output

Not listed

tokens

Infini-AI

Kimi K2 Thinking

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$2.29

per 1M tokens

Context

256K

window

Max output

Not listed

tokens

Infini-AI

Baichuan M2 32b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.414

per 1M tokens

Output

$1.66

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek R1

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$2.29

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek R1 Distill Qwen 32b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.214

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$1.14

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3.1

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$1.71

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3.1 Terminus

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$1.71

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3.2

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.429

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3.2 Exp

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.429

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Deepseek V3.2 Thinking

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.429

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Ernie 4.5 21b A3b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.071

per 1M tokens

Output

$0.286

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Ernie 4.5 300b A47b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.114

per 1M tokens

Output

$0.457

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.5

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.429

per 1M tokens

Output

$2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.5 Air

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.114

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.5v

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$1.71

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.6

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.429

per 1M tokens

Output

$2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.6v

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 4.7

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.429

per 1M tokens

Output

$2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Glm 5

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.857

per 1M tokens

Output

$3.14

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Megrez 3b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Minimax M2

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Minimax M2.1

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Minimax M2.5

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Minimax M2.7

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.3

per 1M tokens

Output

$1.2

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 14b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.143

per 1M tokens

Output

$0.429

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 32b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 72b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.571

per 1M tokens

Output

$1.71

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 7b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.071

per 1M tokens

Output

$0.143

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 Coder 32b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 Vl 32b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$1.01

per 1M tokens

Output

$3.02

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 Vl 72b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$2.01

per 1M tokens

Output

$6.03

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen2.5 Vl 7b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.251

per 1M tokens

Output

$0.629

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 14b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.143

per 1M tokens

Output

$1.43

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 235b A22b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$2.86

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 235b A22b Instruct 2507

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$1.14

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 30b A3b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.107

per 1M tokens

Output

$1.07

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 32b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$2.86

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 8b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.071

per 1M tokens

Output

$0.714

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 Coder 480b A35b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$1.29

per 1M tokens

Output

$5.14

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 Next 80b A3b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.143

per 1M tokens

Output

$0.571

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 Next 80b A3b Thinking

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.143

per 1M tokens

Output

$1.43

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 Vl 235b A22b Instruct

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$1.14

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwen3 Vl 235b A22b Thinking

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$2.86

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Qwq 32b

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

$0.286

per 1M tokens

Output

$0.857

per 1M tokens

Context

128K

window

Max output

Not listed

tokens

Infini-AI

Bge M3

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Bge Reranker V2 M3

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Deepseek Ocr

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Deepseek Ocr 2

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Doubao Seedream 4 0 250828

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Doubao Seedream 5 0 260128

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Hailuo

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Jina Embeddings V2 Base Code

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Jina Embeddings V2 Base Zh

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Seedance 1.0

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens

Infini-AI

Vidu

Structured profile for pricing, context windows, runtime controls, and model capabilities.

Input

Contact

per 1M tokens

Output

Contact

per 1M tokens

Context

Not listed

window

Max output

Not listed

tokens