Google · model
Gemini 3.1 Flash Lite
$0.25/1M in · $1.50/1M out · 1.0M context. That's cheaper than 49% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.
- Context
- 1.0M
- Input / 1M
- $0.25
- Output / 1M
- $1.50
- Modality
- text + image + audio + video + file
- Provider
- Tokenizer
- Gemini
Snapshot · source: provider list ↗ · how we measure →
For 1,500 input + 500 output tokens across 200,000 requests/month. That's 38× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).
Flat at $1.50 since tracking began 2026-06-14. Full history →
Where it fits
- Long inputs — a 1.0M-token window holds roughly 1,573 pages of text at once.
- Multimodal prompts — accepts image input alongside text (and audio).
Watch for
These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.
Related models
How to read Gemini 3.1 Flash Lite's pricing
Two numbers decide most of the bill: $0.25/1M for input (everything you send — prompt, context, attachments) and $1.50/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 6.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This is the provider's first-party list price, reconciled against the routed market each run.
Frequently asked questions
How much does Gemini 3.1 Flash Lite cost per million tokens?
Gemini 3.1 Flash Lite is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens (first-party list price) as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.
What is Gemini 3.1 Flash Lite's context window?
Gemini 3.1 Flash Lite has a 1.0M-token context window — roughly 1,573 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.
Related
- API cost calculator — your monthly cost for Gemini 3.1 Flash Lite and every other model.
- Model comparison — the full sortable table.
- Price history — how token prices move over time.
- LLM pricing explained — input, output, cached and reasoning tokens.