Google · model
Gemini 3.5 Flash
$1.50/1M in · $9.00/1M out · 1.0M context. That's cheaper than 20% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.
- Context
- 1.0M
- Input / 1M
- $1.50
- Output / 1M
- $9.00
- Modality
- text + image + audio + video + file
- Provider
- Tokenizer
- Gemini
Snapshot · source: provider list ↗ · how we measure →
For 1,500 input + 500 output tokens across 200,000 requests/month. That's 225× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).
Flat at $9.00 since tracking began 2026-06-14. Full history →
Where it fits
- Long inputs — a 1.0M-token window holds roughly 1,573 pages of text at once.
- Multimodal prompts — accepts image input alongside text (and audio).
- Quality-first tasks where capability matters more than rate — a first-party flagship rather than a budget pick.
Watch for
These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.
Related models
How to read Gemini 3.5 Flash's pricing
Two numbers decide most of the bill: $1.50/1M for input (everything you send — prompt, context, attachments) and $9.00/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 6.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This is the provider's first-party list price, reconciled against the routed market each run.
Frequently asked questions
How much does Gemini 3.5 Flash cost per million tokens?
Gemini 3.5 Flash is priced at $1.50 per 1M input tokens and $9.00 per 1M output tokens (first-party list price) as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.
What is Gemini 3.5 Flash's context window?
Gemini 3.5 Flash has a 1.0M-token context window — roughly 1,573 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.
Related
- API cost calculator — your monthly cost for Gemini 3.5 Flash and every other model.
- Model comparison — the full sortable table.
- Price history — how token prices move over time.
- LLM pricing explained — input, output, cached and reasoning tokens.