Is LLM Pricing Converging?

The LMC Price Convergence Index

Last updated: just now

A robust dispersion metric measuring how tightly priced the AI model market is around its own median, in log10 dollar space, across four quality tiers. Computed weekly using biweight midvariance with 80% BCa bootstrap confidence intervals.

Frontier (rolling top 10)

The median output price of the world's top 10 coding models rose from $10.3 to $20.8 per million tokens over 13 weeks - a 101.0% increase.

That is a compound weekly rate of +5.5%. Over the same window, the robust PCI compressed from 0.969 to 0.576 (40.6% tighter).

Commodity (< $50/M)

The commodity tier median rose from $0.78 to $1.12 (44.9% increase).

The commodity tier includes every paid coding model with output price below $50 per million - the surface that cloud buyers actually feel. Current sample size n=284.

Weeks observed

Frontier PCI

0.576

CI80 0.552-0.792

Commodity PCI

0.689

n=284

Frontier slope

-0.0315

per week log(PCI)

Rolling top-10 frontier

PCI biweight midvariance

LMMarketCap.com

Composition-fixed top-10

same slugs throughout

LMMarketCap.com

Commodity tier

paid coding, < $50/M out

LMMarketCap.com

Weekly summary across all four tiers

Robust PCI = biweight midvariance of log10 blended price

Week	Frontier PCI	CI80	Frontier $/M	Comp-fixed PCI	Mid-tier PCI	Commodity PCI	Commodity $/M
02-16	0.969	0.863-1.159	$10.3	0.071	0.727	0.739	$0.78
02-23	0.504	0.063-1.049	$9.66	0.071	0.741	0.739	$0.77
03-02	0.957	0.843-1.135	$9.66	0.071	0.671	0.738	$0.79
03-09	0.824	0.609-1.041	$7.56	0.071	0.829	0.738	$0.80
03-16	0.773	0.592-1.028	$4.80	0.071	0.727	0.723	$0.89
03-23	0.773	0.590-1.034	$4.80	0.071	0.826	0.714	$0.93
03-30	0.773	0.578-1.005	$4.80	0.071	0.695	0.696	$0.93
04-06	0.644	0.612-0.784	$15.1	0.071	0.231	0.688	$0.93
04-13	0.644	0.610-0.783	$15.1	0.143	0.231	0.683	$0.95
04-20	0.587	0.499-0.757	$20.8	0.548	0.313	0.685	$0.99
04-27	0.665	0.590-0.855	$15.1	0.548	0.231	0.683	$1.12
05-04	0.562	0.533-0.741	$20.8	0.548	0.397	0.689	$1.12
05-11	0.543	0.383-0.741	$68.3	0.576	0.434	0.687	$1.11
05-18	0.576	0.552-0.792	$20.8	0.576	0.538	0.689	$1.12

Who is moving the frontier variance?

Leave-one-provider-out (LOPO): the share of variance that disappears when this provider's frontier models are excluded. A negative share means the provider currently lowers variance (their prices are central).

OpenAI53.4%

Google22.2%

Anthropic-27.2%

Where the PCI cannot help you

·We will not predict a long-run price floor from 14 observations. The data simply does not identify it. We will publish a data-driven asymptote when we have at least 20 weekly snapshots.
·PCI is a dispersion measure, not a price level. Two quarters can both report PCI of 0.3 with median prices 20x apart. Read the median price column and PCI together, never separately.
·PCI does not measure whether providers are racing each other to the bottom. It measures dispersion around the moving median. Everyone can drop their price by 30 percent in lockstep and the PCI will not budge.
·The prices feeding this index are public list prices for blended input/output token pairs. Enterprise contracts, reserved capacity, and prompt-cache pricing all live below this surface and cannot be measured externally.
·Image and video generation models price per asset (per image, per second of video) and are excluded from every tier on this page. Their dispersion lives in a separate metric we do not publish yet.

Full methodology

Formal definition, tier specification, why log-space dispersion, estimator choice, a worked example with last week's actual numbers, what PCI is not, data lineage, changelog, and references.

Read

Machine-readable JSON

The complete time series for all four tiers, including robust PCI, raw stdev, IQR(log10), median price, BCa 80% CIs, OLS slope, and per-provider LOPO contributions.

/api/pci/series

Pricing history Best-value AI models Cost trends Price changes Weekly reports Scoring methodology

Frequently Asked Questions

PCI measures how tightly priced a quality tier of the AI model market is around its own median, in log10 dollar space. The headline value is the biweight midvariance of log blended prices for the 10 models in the rolling frontier tier. A PCI near 0 means prices are clustered into a narrow band; a PCI near 1 means prices span a full order of magnitude. We work in log space because token pricing varies across roughly four orders of magnitude (sub-cent per million to several hundred dollars per million), which makes any linear measure useless.

Sample standard deviation has 100 percent efficiency at the Gaussian but 0 percent breakdown - one extreme outlier completely controls the value. With only 10 models in the frontier tier, that is a fatal property for a metric we want to publish. Biweight midvariance retains 87 percent Gaussian efficiency while tolerating up to 50 percent contamination, so a single mispriced or experimentally-priced flagship cannot move the headline. We still publish the sample stdev alongside as the raw measure for readers who prefer the textbook formula.

The trend slope is estimated by OLS regression of log(PCI) on the week index, with an 80 percent CI from a moving block bootstrap. With only 14 weekly observations the CI on the slope is necessarily wide, and that is the correct, honest answer. We refuse to publish a tighter CI than the data supports, and we will not publish any long-run floor estimate until we have at least 20 weekly observations. The current slope is -0.0315 per week with an 80 percent CI of (-0.0179, +0.0158).

Rolling top-10 uses whichever 10 models hold the top spots on the live leaderboard each week, so the membership changes whenever a new flagship launches or an old one is overtaken. Composition-fixed uses the same 10 slugs across every week (the latest week's top 10), with no backfill - if a slug did not exist in an early week we simply drop it from that week's sample, and we report the smaller n. The two series tell you different things. Rolling answers "what does the current frontier cost". Composition-fixed answers "how have these specific 10 models repriced over time". The gap between them is the composition effect.

Two reasons. First, the long tail of premium endpoints (image generation video, reasoning specialist tiers above $500/M) is so far above the main commodity market in price that including them turns the dispersion into a measure of "is the long tail still long" rather than "is the commodity market converging". Second, those endpoints are typically priced as launch experiments and reprice on quarterly or annual cycles, contributing pure noise to a weekly metric. The commodity tier on this page additionally caps at $50/M to track the cloud-buyer surface specifically.

Mostly no, by construction. Biweight midvariance downweights any single observation more than three or four MADs away from the median by zero, so a single anomalous price has bounded influence on the headline. The per-provider variance contribution table on this page shows leave-one-provider-out shares so you can see directly which provider currently moves the metric the most. In the latest week, the largest single contributor is OpenAI at 53 percent of variance.

Week

Frontier PCI

CI80

Frontier $/M

02-16

0.969

0.863-1.159

$10.3

02-23

0.504

0.063-1.049

$9.66

03-02

0.957

0.843-1.135

$9.66

03-09

0.824

0.609-1.041

$7.56

03-16

0.773

0.592-1.028

$4.80

03-23

0.773

0.590-1.034

$4.80

03-30

0.773

0.578-1.005

$4.80

04-06

0.644

0.612-0.784

$15.1

04-13

0.644

0.610-0.783

$15.1

04-20

0.587

0.499-0.757

$20.8

04-27

0.665

0.590-0.855

$15.1

05-04

0.562

0.533-0.741

$20.8

05-11

0.543

0.383-0.741

$68.3

05-18

0.576

0.552-0.792

$20.8

Is LLM Pricing Converging?

Rolling top-10 frontier

Composition-fixed top-10

Commodity tier

Weekly summary across all four tiers

Who is moving the frontier variance?

Where the PCI cannot help you

Full methodology

Machine-readable JSON

Related pages

Is LLM Pricing Converging?

Rolling top-10 frontier

Composition-fixed top-10

Commodity tier

Weekly summary across all four tiers

Who is moving the frontier variance?

Where the PCI cannot help you

Full methodology

Machine-readable JSON

Related pages