Ibm Granite · model
Granite 4.0 Micro
$0.017/1M in · $0.11/1M out · 131K context. That's cheaper than 96% of the 298 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.
- Context
- 131K
- Input / 1M
- $0.017
- Output / 1M
- $0.11
- Modality
- Text only
- Provider
- Ibm Granite
- Tokenizer
- Other
Snapshot · source: OpenRouter ↗ · how we measure →
For 1,500 input + 500 output tokens across 200,000 requests/month. That's 2.7× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).
Flat at $0.11 since tracking began 2026-06-14. Full history →
Where it fits
- High-volume, cost-sensitive work — at $0.11/1M output it sits in the cheapest quarter of tracked models.
- Input-heavy jobs like summarization and RAG — input is cheap at $0.017/1M, and these read far more than they write.
Watch for
- Priced from OpenRouter's routed market, not a first-party list price — availability and rate can shift without notice.
- Text-only — no image, audio, or file input.
These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.
Related models
How to read Granite 4.0 Micro's pricing
Two numbers decide most of the bill: $0.017/1M for input (everything you send — prompt, context, attachments) and $0.11/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 6.6× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This row is OpenRouter's routed market price; it can move as providers and routing change.
Frequently asked questions
How much does Granite 4.0 Micro cost per million tokens?
Granite 4.0 Micro is priced at $0.017 per 1M input tokens and $0.11 per 1M output tokens, from OpenRouter's routed market price as of the 2026-06-15 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.
What is Granite 4.0 Micro's context window?
Granite 4.0 Micro has a 131K-token context window — roughly 197 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.
Related
- API cost calculator — your monthly cost for Granite 4.0 Micro and every other model.
- Model comparison — the full sortable table.
- Price history — how token prices move over time.
- LLM pricing explained — input, output, cached and reasoning tokens.