O1
Previous full o-series reasoning model
Input
$15
per 1M tokens
Output
$60
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Structured provider page with indexed model coverage, pricing entry points and catalog detail.
Indexed for this provider.
Country metadata when available.
Previous full o-series reasoning model
Input
$15
per 1M tokens
Output
$60
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
A small model alternative to o1
Input
$1.1
per 1M tokens
Output
$4.4
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
Reasoning model for complex tasks, succeeded by GPT-5
Input
$3.5
per 1M tokens
Output
$14
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
A small model alternative to o3
Input
$1.1
per 1M tokens
Output
$4.4
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
OFast, intelligent, flexible GPT model
Input
$4.25
per 1M tokens
Output
$17
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
O miniFast, affordable small model for focused tasks
Input
$0.25
per 1M tokens
Output
$1
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
PreviewDeprecatedPreview of our first o-series reasoning model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
-2024-12-17
Input
$15
per 1M tokens
Output
$60
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
O-2024-05-13
Input
$8.75
per 1M tokens
Output
$26.25
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
O-2024-08-06
Input
$3.75
per 1M tokens
Output
$15
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
O-2024-11-20
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
-turbo-2024-04-09
Input
$10
per 1M tokens
Output
$30
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
NewThe best model for coding and agentic tasks across industries
Input
$3.5
per 1M tokens
Output
$28
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$30
per 1M tokens
Output
$60
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
-0613
Input
$30
per 1M tokens
Output
$60
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$10
per 1M tokens
Output
$30
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$10
per 1M tokens
Output
$30
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
An older high-intelligence GPT model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
200K
window
Max output
Not listed
tokens
OGPT-4o model used in ChatGPT
Input
$5
per 1M tokens
Output
$15
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
O-mini-2024-07-18
Input
$0.3
per 1M tokens
Output
$1.2
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
O mini TTSText-to-speech model powered by GPT-4o mini
Input
Contact
per 1M tokens
Output
$12
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Version of o1 with more compute for better responses
Input
$150
per 1M tokens
Output
$600
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
-deep-researchOur most powerful deep research model
Input
$10
per 1M tokens
Output
$40
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
Version of o3 with more compute for better responses
Input
$20
per 1M tokens
Output
$80
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
O AudioGPT-4o models capable of audio inputs and outputs
Input
$40
per 1M tokens
Output
$80
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
O mini AudioSmaller model capable of audio inputs and outputs
Input
$10
per 1M tokens
Output
$20
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
O mini Search PreviewFast, affordable small model for web search
Input
$0.15
per 1M tokens
Output
$0.6
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
O Search PreviewGPT model for web search in Chat Completions
Input
$2.5
per 1M tokens
Output
$10
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
O mini RealtimeSmaller realtime model for text and audio inputs and outputs
Input
$10
per 1M tokens
Output
$20
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
O RealtimeModel capable of realtime text and audio inputs and outputs
Input
$40
per 1M tokens
Output
$80
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
O mini TranscribeSpeech-to-text model powered by GPT-4o mini
Input
$3
per 1M tokens
Output
Contact
per 1M tokens
Context
2K
window
Max output
Not listed
tokens
O TranscribeSpeech-to-text model powered by GPT-4o
Input
$6
per 1M tokens
Output
Contact
per 1M tokens
Context
2K
window
Max output
Not listed
tokens
O Transcribe DiarizeTranscription model that identifies who's speaking when
Input
$6
per 1M tokens
Output
Contact
per 1M tokens
Context
2K
window
Max output
Not listed
tokens
O-audio-preview-2024-12-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-audio-preview-2025-06-03
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-audio-preview-2024-12-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-realtime-preview-2024-12-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-search-preview-2025-03-11
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-transcribe-2025-03-20
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-transcribe-2025-12-15
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-tts-2025-03-20
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-mini-tts-2025-12-15
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-realtime-preview-2024-12-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-realtime-preview-2025-06-03
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-search-preview-2025-03-11
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-03-19
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Version of GPT-5 that produces smarter and more precise responses
Input
$15
per 1M tokens
Output
$120
per 1M tokens
Context
272K
window
Max output
Not listed
tokens
T powerful open-weight model, fits into an H100 GPU
Input
$0.039
per 1M tokens
Output
$0.19
per 1M tokens
Context
131K
window
Max output
Not listed
tokens
D open-weight model for low latency
Input
$0.03
per 1M tokens
Output
$0.11
per 1M tokens
Context
131K
window
Max output
Not listed
tokens
Replacement for the GPT-3 ada and babbage base models
Input
$0.4
per 1M tokens
Output
$0.4
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Image model used in ChatGPT.
Input
$8
per 1M tokens
Output
$32
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Replacement for the GPT-3 curie and davinci base models
Input
$2
per 1M tokens
Output
$2
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Preview (Deprecated)Deprecated large model.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Previous intelligent reasoning model for coding and agentic tasks with configurable reasoning effort
Input
$2.5
per 1M tokens
Output
$20
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
The best model for coding and agentic tasks with configurable reasoning effort
Input
$2.5
per 1M tokens
Output
$20
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
A version of GPT-5.1 optimized for agentic coding in Codex.
Input
$2.5
per 1M tokens
Output
$20
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
-Codex-MaxOur most intelligent coding model optimized for long-horizon, agentic coding tasks.
Input
$2.5
per 1M tokens
Output
$20
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
MiniSmaller, more cost-effective, less-capable version of GPT-5.1-Codex
Input
$0.25
per 1M tokens
Output
$2
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Version of GPT-5.2 that produces smarter and more precise responses.
Input
$21
per 1M tokens
Output
$168
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
-CodexA version of GPT-5 optimized for agentic coding in Codex
Input
$2.5
per 1M tokens
Output
$20
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
A faster, cost-efficient version of GPT-5 for well-defined tasks
Input
$0.45
per 1M tokens
Output
$3.6
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Fastest, most cost-efficient version of GPT-5
Input
$0.05
per 1M tokens
Output
$0.4
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Image 1Our previous image generation model
Input
Contact
per 1M tokens
Output
$0.011
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Image 1.5State-of-the-art image generation model.
Input
Contact
per 1M tokens
Output
$0.009
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
A cost-efficient version of GPT Image 1
Input
Contact
per 1M tokens
Output
$0.005
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Potentially harmful content in text and images
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Generation text-only moderation model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Generation text-only moderation model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
128K
window
Max output
Not listed
tokens
Reasoning model optimized for the Codex CLI
Input
$1.5
per 1M tokens
Output
$6
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
-deep-researchFaster, more affordable deep research model
Input
$2
per 1M tokens
Output
$8
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
Fast, cost-efficient reasoning model, succeeded by GPT-5 mini
Input
$2
per 1M tokens
Output
$8
per 1M tokens
Context
100K
window
Max output
Not listed
tokens
Smartest non-reasoning model
Input
$3.5
per 1M tokens
Output
$14
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
Smaller, faster version of GPT-4.1
Input
$0.7
per 1M tokens
Output
$2.8
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
Fastest, most cost-efficient version of GPT-4.1
Input
$0.2
per 1M tokens
Output
$0.8
per 1M tokens
Context
32.8K
window
Max output
Not listed
tokens
-turbo-0125
Input
$0.5
per 1M tokens
Output
$1.5
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
$1.5
per 1M tokens
Output
$2
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
-turbo-1106
Input
$1
per 1M tokens
Output
$2
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
GPT-5.1 model used in ChatGPT
Input
$1.25
per 1M tokens
Output
$10
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
GPT-5.2 model used in ChatGPT
Input
$1.75
per 1M tokens
Output
$14
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
GPT-5 model used in ChatGPT
Input
$1.25
per 1M tokens
Output
$10
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
R audio inputs and outputs with Chat Completions API
Input
$32
per 1M tokens
Output
$64
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
A cost-efficient version of GPT Audio
Input
$10
per 1M tokens
Output
$20
per 1M tokens
Context
16.4K
window
Max output
Not listed
tokens
An older high-intelligence GPT model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Capable embedding model
Input
$0.13
per 1M tokens
Output
$0.065
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Embedding model
Input
$0.02
per 1M tokens
Output
$0.01
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
Embedding model
Input
$0.1
per 1M tokens
Output
$0.05
per 1M tokens
Context
8.2K
window
Max output
Not listed
tokens
PreviewDeprecatedAn older fast GPT model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
L capable of realtime text and audio inputs and outputs
Input
$32
per 1M tokens
Output
$64
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
A cost-efficient version of GPT Realtime
Input
$10
per 1M tokens
Output
$20
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
Model optimized for speed
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
HDText-to-speech model optimized for quality
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
4.1K
window
Max output
Not listed
tokens
Model for computer use tool
Input
$3
per 1M tokens
Output
$12
per 1M tokens
Context
1K
window
Max output
Not listed
tokens
Our first image generation model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Previous generation image generation model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-turbo-16k
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-turbo-instruct
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-turbo-instruct-0914
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-04-14
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-mini-2025-04-14
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-nano-2025-04-14
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-11-13
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-08-07
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-12-11
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-codex
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-pro-2025-12-11
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-chat-latest
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-codex
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2026-03-05
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-mini
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-mini-2026-03-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-nano
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-nano-2026-03-17
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-pro
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-pro-2026-03-05
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-mini-2025-08-07
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-nano-2025-08-07
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-pro-2025-10-06
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-search-api
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-search-api-2025-10-14
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-1.5
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
O-2025-08-28
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
I-2025-10-06
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
I-2025-12-15
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
E-1.5
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
E-2025-08-28
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
I-2025-10-06
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
I-2025-12-15
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
-2025-01-31
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
2Flagship video generation with synced audio
Input
Contact
per 1M tokens
Output
$0.1
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Most advanced synced-audio video generation
Input
Contact
per 1M tokens
Output
$0.5
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
Structured profile for pricing, context windows, runtime controls, and model capabilities.
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens
General-purpose speech recognition model
Input
Contact
per 1M tokens
Output
Contact
per 1M tokens
Context
Not listed
window
Max output
Not listed
tokens