Inflection · model

Inflection 3 Productivity

$2.50/1M in · $10.00/1M out · 8K context. That's cheaper than 15% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.

output price · per 1M tokens
$10.00
input $2.50/1M · routed
Context
8K
Input / 1M
$2.50
Output / 1M
$10.00
Modality
Text only
Provider
Inflection
Tokenizer
Other

Snapshot · source: OpenRouter ↗ · how we measure →

cost at a typical workload
$1,750 / mo

For 1,500 input + 500 output tokens across 200,000 requests/month. That's 292× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).

Tune the workload in the calculator →

output price history
output / 1M

Flat at $10.00 since tracking began 2026-06-14. Full history →

Where it fits

  • A mid-range option — $2.50 in / $10.00 out per 1M with a 8K-token context.

Watch for

  • Small 8K-token context — not for long documents or large codebases.
  • Output at $10.00/1M is among the priciest tracked — output-heavy or reasoning-heavy workloads add up fast.
  • Priced from OpenRouter's routed market, not a first-party list price — availability and rate can shift without notice.

These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.

Related models

See how the leading models compare head-to-head →

How to read Inflection 3 Productivity's pricing

Two numbers decide most of the bill: $2.50/1M for input (everything you send — prompt, context, attachments) and $10.00/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 4.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This row is OpenRouter's routed market price; it can move as providers and routing change.

Frequently asked questions

How much does Inflection 3 Productivity cost per million tokens?

Inflection 3 Productivity is priced at $2.50 per 1M input tokens and $10.00 per 1M output tokens, from OpenRouter's routed market price as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.

What is Inflection 3 Productivity's context window?

Inflection 3 Productivity has a 8K-token context window — roughly 12 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.

Related