DeepSeek · model

DeepSeek V4 Flash

$0.14/1M in · $0.28/1M out · 1.0M context. That's cheaper than 86% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.

output price · per 1M tokens
$0.28
input $0.14/1M · first-party
Context
1.0M
Input / 1M
$0.14
Output / 1M
$0.28
Modality
Text only
Provider
DeepSeek
Tokenizer
DeepSeek

Snapshot · source: provider list ↗ · how we measure →

cost at a typical workload
$70.00 / mo

For 1,500 input + 500 output tokens across 200,000 requests/month. That's 12× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).

Tune the workload in the calculator →

output price history
output / 1M

Flat at $0.28 since tracking began 2026-06-14. Full history →

Where it fits

  • High-volume, cost-sensitive work — at $0.28/1M output it sits in the cheapest quarter of tracked models.
  • Long inputs — a 1.0M-token window holds roughly 1,573 pages of text at once.

Watch for

  • OpenRouter routes this below the lab's own price; we show the first-party list rate per our reconciliation rule.
  • Text-only — no image, audio, or file input.

These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.

Compare DeepSeek V4 Flash

Related models

How to read DeepSeek V4 Flash's pricing

Two numbers decide most of the bill: $0.14/1M for input (everything you send — prompt, context, attachments) and $0.28/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 2.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This is the provider's first-party list price, reconciled against the routed market each run.

Frequently asked questions

How much does DeepSeek V4 Flash cost per million tokens?

DeepSeek V4 Flash is priced at $0.14 per 1M input tokens and $0.28 per 1M output tokens (first-party list price) as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.

What is DeepSeek V4 Flash's context window?

DeepSeek V4 Flash has a 1.0M-token context window — roughly 1,573 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.

Related