comparisons

Compare LLMs head-to-head

156 side-by-side comparisons across 24 leading models — token pricing, context window, modality and the real monthly cost for a typical workload. Build a custom comparison below, pick a ready-made matchup, or start from a model.

Build your own comparison

Plot 2–5 models by cost against technical class at a typical workload. Snapshot 2026-07-29.

Positioned by published specs — size, context and modality — not measured performance; a smaller model can sometimes outperform a larger one on your task.

↑ technical class (size & context) · cost on the horizontal axis

Selected models compared by price, context, technical class and monthly cost.
model	in / 1M	out / 1M	context	modality	class	cost / mo

Models

Flagship matchups

Frontier models — quality-first, output ≥ $15/1M.

Mid-range matchups

The workhorse tier — output $2–15/1M.

Budget matchups

High-volume, cost-sensitive — output under $2/1M.

How these comparisons work

Every comparison is generated from the same sourced catalog the rest of the site uses — no hand-picked numbers. Each page shows input and output token prices, the context window, accepted modalities, the price source, and the modelled monthly cost at a typical workload, with the cheaper option marked. Prices are list or routed market figures captured with a date; the ranking can shift with your own token mix, which is why every page links to the cost calculator.

Model comparison table — all tracked models, sortable and filterable.
Real cost-per-task — measured cost to finish a job, not just list price.
Price history — how these prices move over time.