Question 1

Is GLM 4.6 or Gemini 2.5 Flash-Lite cheaper?

Accepted Answer

For a typical workload (1,500 input + 500 output tokens × 200,000 requests/month), Gemini 2.5 Flash-Lite costs $70.00/mo versus $350/mo for GLM 4.6 — about 5.0× less. Because output is priced higher than input, the winner can flip if your workload writes much more or less than this; check your own numbers in the calculator.

Question 2

What's the main difference between GLM 4.6 and Gemini 2.5 Flash-Lite?

Accepted Answer

On price, GLM 4.6 is $0.50/$2.00 per 1M (in/out) and Gemini 2.5 Flash-Lite is $0.10/$0.40. By technical class (size & context) it's Mid (GLM 4.6) versus Mini (Gemini 2.5 Flash-Lite). Gemini 2.5 Flash-Lite has the larger context window at 1.0M tokens. Only Gemini 2.5 Flash-Lite accepts image input.

Question 3

Why is Gemini 2.5 Flash-Lite so much cheaper than GLM 4.6?

Accepted Answer

Gemini 2.5 Flash-Lite has a much lower per-token rate — $0.40/1M output versus $2.00, and it's a smaller technical class (Mini vs Mid). The headline rate isn't the whole story, though: a verbose model can cost more to finish a task than its rate implies — that's what measured cost-per-task captures.

Question 4

Which handles longer prompts, GLM 4.6 or Gemini 2.5 Flash-Lite?

Accepted Answer

Gemini 2.5 Flash-Lite — its 1.0M-token context window (~1,573 pages of text) is the larger of the two, by roughly 5×.

metric	GLM 4.6	Gemini 2.5 Flash-Lite
Input / 1M	$0.50	$0.10
Output / 1M	$2.00	$0.40
Context	205K	1.0M
Technical class	Mid	Mini
Cost @ typical workload	$350/mo	$70.00/mo
Modality	Text only	text + image + audio + video
Price source	routed	list
Provider	Z.AI	Google

GLM 4.6 vs Gemini 2.5 Flash-Lite

Which should you pick?

Frequently asked questions

Is GLM 4.6 or Gemini 2.5 Flash-Lite cheaper?

What's the main difference between GLM 4.6 and Gemini 2.5 Flash-Lite?

Why is Gemini 2.5 Flash-Lite so much cheaper than GLM 4.6?

Which handles longer prompts, GLM 4.6 or Gemini 2.5 Flash-Lite?

More comparisons

Related