Back to models

GPT-4.1

Smartest non-reasoning model

Context window32,768 tokens
Max output tokens32,768 tokens

Pricing

TypeInputOutput
Standard$3.50 / 1M tokens$14.00 / 1M tokens
Cached input$0.88 / 1M tokens

Parameters

ParameterDefaultRange
Max stop sequences0

Supported features

Capabilities

  • Streaming
  • Function Calling
  • Structured Output
  • Json Mode
  • Reasoning
  • System Prompt

Endpoints

  • Chat Completions
  • Batch

Use cases

  • text-chat
  • function-calling
  • reasoning

Model specifications

Model IDgpt-4.1
FamilyGPT-4
Access typeClosed
Input modalitiestext
Output modalitiestext
Last updatedJanuary 3, 2026
GPT-4.1 | ModelHub | ModelHub