LLM Releases

Cost per million tokens

All reports

LLM API pricing comparison

Models with published API pricing, sorted cheapest-first by output cost. Input and output rates are shown separately because they bill differently.

Headline “price” for an LLM is really two numbers: the cost per million input tokens and per million outputtokens. Output is typically the pricier of the two, so the table is sorted by output cost — the figure that tends to dominate generation-heavy and agentic workloads.

Only models with a published price appear below. Open-weight models you run yourself are excluded unless their lab also lists a hosted API rate.

Prices are list rates per 1M tokens in USD and can change without notice; always confirm against the provider before budgeting.

ModelLabAccessInput $/MtokOutput $/MtokContextReleasedSource
Grok 4.3xAIProprietary$1.25$2.51MMay 6, 2026source
ERNIE 5.1BaiduProprietary$0.59$2.65128KMay 8, 2026source
Kimi K2.7 CodeMoonshotModified MIT$0.95$4262KJun 18, 2026source
Qwen3.7-PlusQwenProprietary$2.5$7.51MJun 2, 2026source
Qwen3.7-MaxQwenProprietary$2.5$7.51MMay 19, 2026source
Claude Opus 4.8AnthropicProprietary$15$75500KMay 28, 2026source
Claude Opus 4.6AnthropicProprietary$15$75200KFeb 5, 2026source

Frequently asked questions

Why are input and output prices listed separately?

Providers bill input (prompt) and output (generated) tokens at different rates — output is usually several times more expensive. Your real cost depends on the ratio of the two in your workload, so a model that looks cheap on input can be expensive for long generations, and vice-versa.

How do I estimate my actual cost?

Multiply your expected input tokens by the input rate and your expected output tokens by the output rate, both per million, then add them. For chat and retrieval workloads input usually dominates; for drafting and agentic generation, output dominates.

Are open-weight models free?

Self-hosting open-weight models has no per-token licence fee, but you still pay for the hardware or a hosting provider's API. The prices here are list API prices for the model's primary hosted endpoint, where one is published.

Related