ModelMeta
890 model profiles·26 providers·Refresh cadence hourly
Back to models
OpenAI

gpt-image-2

Active

Context window

N/A

Not published

Max output

4K

4,096 tokens

Input price

$0.0429

Per 1M input tokens

Output price

Free

Per 1M output tokens

Modalities

Text + Image

Input: Text, Image; output: Image

API surface

Image Generation

1 supported endpoint

Overview

Where this model fits best

Use this section to quickly decide whether the model belongs in chat, coding, reasoning, embedding, rerank, vision, audio, or agent workflows.

Use cases

What this model should be considered for

Selection signal
image-gen

Best fit

Use this model when you need a well-documented, structured option inside the registry and want a single place to inspect pricing, capabilities, and operational limits.

Capabilities

What you can actually do with it

Feature flags, API endpoints, and tool support are separated so integration constraints are easy to scan.

Endpoints

APIs and surfaces this model can be called through.

Image Generation

Tools

Execution and orchestration tooling supported by the model.

Image Generation

Pricing

Pricing and billing signals

Known prices are shown per 1M tokens. Missing official prices are marked as not published instead of being treated as free.

Standard API

Input

$0.0429 / 1M

Output

Free

Default request pricing when using the primary endpoint.

Cached input

Input

$1.25 / 1M

Output

Free

Useful when prompt caching changes the economics of repeated requests.

Controls

Runtime knobs worth knowing

Supported request parameters help developers understand sampling, output limits, reasoning controls, and structured output behavior.

Temperature

1

0 to 2

Top P

1

0 to 1

Presence Penalty

0

-2 to 2

Frequency Penalty

0

-2 to 2

Specifications

Technical reference

Canonical identifiers, family, modalities, token limits, training information, and update metadata.

Model ID

gpt-image-2

Use this exact identifier in API calls and SDK configuration.

Provider

OpenAI

Access type

Closed

Input modalities

text, image

Output modalities

image

Max output tokens

4,096