Command R+
Cohere's most powerful model, optimized for RAG and tool use
Input
$3
per 1M tokens
Output
$15
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Cohere leads in providing enterprise AI solutions, specializing in Retrieval-Augmented Generation (RAG) and complex tool use. Their Command R series is optimized for long-context industrial applications.
Indexed for this provider.
Country metadata when available.
Cohere's most powerful model, optimized for RAG and tool use
Input
$3
per 1M tokens
Output
$15
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Cohere reasoning-optimized model
Input
$2.5
per 1M tokens
Output
$10
per 1M tokens
Context
289K
window
Max output
Not listed
tokens
Cohere multimodal model with vision capabilities
Input
$2.5
per 1M tokens
Output
$10
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Cohere specialized translation model
Input
$2.5
per 1M tokens
Output
$10
per 1M tokens
Context
9K
window
Max output
Not listed
tokens
High-performance model for long-context tasks and RAG
Input
$0.5
per 1M tokens
Output
$1.5
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Compact 7B parameter model for efficient inference
Input
$0.038
per 1M tokens
Output
$0.15
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Fast reranking model
Input
$0.2
per 1M tokens
Output
Contact
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
Professional reranking model
Input
$2
per 1M tokens
Output
Contact
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
Multimodal embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Semantic search reranking model for English
Input
$2
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
State-of-the-art English text embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
High-performance multilingual embedding model supporting 100+ languages
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
Compact 7B parameter model for efficient inference
Input
$0.038
per 1M tokens
Output
$0.15
per 1M tokens
Context
132K
window
Max output
Not listed
tokens
Open-source multilingual vision-language model
Input
$0.5
per 1M tokens
Output
$1.5
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
Open-source multilingual vision-language model
Input
$0.5
per 1M tokens
Output
$1.5
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
Instruction-following model for business applications
Input
$1
per 1M tokens
Output
$2
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
Semantic reranking model
Input
$2
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
Semantic reranking model
Input
$2
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
Lightweight embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
Lightweight embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
Text embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
Multilingual embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
512
window
Max output
Not listed
tokens
Multilingual embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
256
window
Max output
Not listed
tokens
Lightweight embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Text embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Multilingual embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Multilingual embedding model
Input
$0.1
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens