Z.AI · model

GLM 4.6

$0.50/1M in · $2.00/1M out · 205K context. That's cheaper than 48% of the 341 models we track, by output price. Here's what it costs, how its price has moved, and where it fits.

output price · per 1M tokens

$2.00

input $0.50/1M · routed

Context: 205K
Input / 1M: $0.50
Output / 1M: $2.00
Modality: Text only
Provider: Z.AI
Tokenizer: Other

Snapshot 2026-07-29 · source: OpenRouter ↗ · how we measure →

cost at a typical workload

$350 / mo

For 1,500 input + 500 output tokens across 200,000 requests/month. That's 58× the cheapest tracked option (Ling-2.6-flash, $6.00/mo).

Tune the workload in the calculator →

output price history

output / 1M

2026-06-29 → 2026-07-29. Full history →

Where it fits

A mid-range option — $0.50 in / $2.00 out per 1M with a 205K-token context.

Watch for

Priced from OpenRouter's routed market, not a first-party list price — availability and rate can shift without notice.
Text-only — no image, audio, or file input.

These notes are derived from price, context and modality — structural facts, not measured quality. Measured cost-to-finish-a-task lives on the real-cost index.

Compare GLM 4.6

GLM 4.6 vs GPT-5.4 mini GLM 4.6 vs GPT-5.4 nano GLM 4.6 vs Claude Haiku 4.5 GLM 4.6 vs Gemini 2.5 Pro GLM 4.6 vs Gemini 2.5 Flash GLM 4.6 vs Gemini 2.5 Flash-Lite GLM 4.6 vs Grok 4.3 GLM 4.6 vs Grok 4.20 GLM 4.6 vs DeepSeek V4 Pro GLM 4.6 vs Mistral Medium 3.5 GLM 4.6 vs Mistral Large 3 GLM 4.6 vs Llama 4 Maverick GLM 4.6 vs Qwen3 Max GLM 4.6 vs Kimi K2 Thinking GLM 4.6 vs MiniMax M2.5

How to read GLM 4.6's pricing

Two numbers decide most of the bill: $0.50/1M for input (everything you send — prompt, context, attachments) and $2.00/1M for output (everything it generates, including hidden reasoning tokens). Output is priced 4.0× the input rate here, so the shape of your workload — how much it reads versus writes — matters as much as the headline figure. This row is OpenRouter's routed market price; it can move as providers and routing change.

Frequently asked questions

How much does GLM 4.6 cost per million tokens?

GLM 4.6 is priced at $0.50 per 1M input tokens and $2.00 per 1M output tokens, from OpenRouter's routed market price as of the 2026-07-29 snapshot. Output is the figure that usually drives the bill. Enter your own token volumes in the cost calculator for a monthly estimate.

What is GLM 4.6's context window?

GLM 4.6 has a 205K-token context window — roughly 307 pages of text. Prompt, attachments, conversation and the model's own output all share that budget, and every token you send is billed at the input rate.

API cost calculator — your monthly cost for GLM 4.6 and every other model.
Model comparison — the full sortable table.
Price history — how token prices move over time.
LLM pricing explained — input, output, cached and reasoning tokens.