OpenAI · model

o4 Mini

$1.10/1M in · $4.40/1M out · 200K context. That's cheaper than 27% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.

output price · per 1M tokens
$4.40
input $1.10/1M · list
Context
200K
Input / 1M
$1.10
Output / 1M
$4.40
Modality
text + image + file
Provider
OpenAI
Tokenizer
GPT

Snapshot · source: provider list ↗ · how we measure →

cost at a typical workload
$770 / mo

For 1,500 input + 500 output tokens across 200,000 requests/month. That's 128× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).

Tune the workload in the calculator →

output price history
output / 1M

Flat at $4.40 since tracking began 2026-06-14. Full history →

Where it fits

  • Multimodal prompts — accepts image input alongside text.
  • Quality-first tasks where capability matters more than rate — a first-party flagship rather than a budget pick.

Watch for

    These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.

    Related models

    See how the leading models compare head-to-head →

    How to read o4 Mini's pricing

    Two numbers decide most of the bill: $1.10/1M for input (everything you send — prompt, context, attachments) and $4.40/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 4.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This is the provider's first-party list price, reconciled against the routed market each run.

    Frequently asked questions

    How much does o4 Mini cost per million tokens?

    o4 Mini is priced at $1.10 per 1M input tokens and $4.40 per 1M output tokens (first-party list price) as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.

    What is o4 Mini's context window?

    o4 Mini has a 200K-token context window — roughly 300 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.

    Related