Question 1

What is a token?

Accepted Answer

A token is the unit a model reads and bills in — a sub-word chunk averaging about four characters of English. Common words are one token; rare words, code, and non-English text split into more. Billing, context limits, and rate limits are all measured in tokens, not words or characters.

Question 2

Are these counts exact for every model?

Accepted Answer

The o200k and cl100k counts are exact — those are OpenAI's real tokenizers, run in your browser. Anthropic and Google use their own tokenizers that aren't published as client-side libraries, so their token counts differ slightly (usually within ~10%). For exact Claude counts, use Anthropic's count-tokens API. The cost column uses the o200k count as a close cross-model proxy.

Question 3

Why do o200k and cl100k give different numbers?

Accepted Answer

They're different vocabularies. cl100k_base is the tokenizer for GPT-4 and GPT-3.5; o200k_base is the newer, larger vocabulary used by GPT-4o and later. The newer tokenizer is generally more efficient on code and non-English text, so the same text often costs fewer tokens under o200k.

Question 4

Does this send my text anywhere?

Accepted Answer

No. Tokenization runs entirely in your browser — the tokenizer is downloaded on first use and your text never leaves the page.

model	provider	input /1M	cost / 1K sends
Gemini 2.5 Flash list	Google	$0.30	—
DeepSeek V4 Pro list	DeepSeek	$0.43	—
Mistral Large 3 list	Mistral	$0.50	—
GPT-5.4 mini list	OpenAI	$0.75	—
Claude Haiku 4.5 list	Anthropic	$1.00	—
Grok 4.3 list	xAI	$1.25	—
Gemini 3.1 Pro list	Google	$2.00	—
Claude Sonnet 4.6 list	Anthropic	$3.00	—
GPT-5.5 list	OpenAI	$5.00	—
Claude Opus 4.8 list	Anthropic	$5.00	—

LLM token counter

Cost to send this text

Tokens are the unit that matters

Different models, different tokenizers

From tokens to cost

Frequently asked questions

What is a token?

Are these counts exact for every model?

Why do o200k and cl100k give different numbers?

Does this send my text anywhere?

Related