Degradation Tracker

Detect when AI models may be declining. This tracker monitors rank movements (position changes on the leaderboard) and flags models that have dropped significantly over 24 hours or 7 days. A higher “degradation points” number means more warning signs.

Models at Risk

Declining (7d)

Fragile

Sustained Decline

Top Models by Degradation Risk Score

LMMarketCap.com

Models at Risk

27 models showing signs of degradation, ranked by risk score. Higher risk scores indicate more concerning performance trends.

Deg. Pts	Model	Provider	Quality	Rank 24h	Rank 7d	Severity	State
146	GLM 5V TurboZhipu AI	Zhipu AI	40.0	-146	+115	high	preliminary
38	Mistral NemoMistral AI	Mistral AI	39.9	-11	-11	high	fragile
8	Trinity Large Thinkingarcee-ai	arcee-ai	62.7	+1	-4	medium	stable
3	Lyria 3 Pro PreviewGoogle	Google	40.0	-1	-1	low	stable
3	Lyria 3 Clip PreviewGoogle	Google	40.0	-1	-1	low	stable
3	KAT-Coder-Pro V2Kuaishou	Kuaishou	40.0	-1	-1	low	stable
3	Reka Edgerekaai	rekaai	40.0	-1	-1	low	stable
3	Mistral Small 4Mistral AI	Mistral AI	40.0	-1	-1	low	stable
3	Nemotron 3 Super (free)NVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Nemotron 3 SuperNVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Seed-2.0-LiteByteDance	ByteDance	40.0	-1	-1	low	stable
3	Seed-2.0-MiniByteDance	ByteDance	40.0	-1	-1	low	stable
3	LFM2-24B-A2BLiquid AI	Liquid AI	40.0	-1	-1	low	stable
3	Aion-2.0aion-labs	aion-labs	40.0	-1	-1	low	stable
3	Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	40.0	-1	-1	low	stable
3	Qwen3 Coder NextAlibaba	Alibaba	40.0	-1	-1	low	stable
3	Solar Pro 3Upstage	Upstage	40.0	-1	-1	low	stable
3	Palmyra X5Writer	Writer	40.0	-1	-1	low	stable
3	LFM2.5-1.2B-Thinking (free)Liquid AI	Liquid AI	40.0	-1	-1	low	stable
3	LFM2.5-1.2B-Instruct (free)Liquid AI	Liquid AI	40.0	-1	-1	low	stable
3	GPT AudioOpenAI	OpenAI	40.0	-1	-1	low	stable
3	GPT Audio MiniOpenAI	OpenAI	40.0	-1	-1	low	stable
3	Seed 1.6 FlashByteDance	ByteDance	40.0	-1	-1	low	stable
3	Seed 1.6ByteDance	ByteDance	40.0	-1	-1	low	stable
3	Nemotron 3 Nano 30B A3B (free)NVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Nemotron 3 Nano 30B A3BNVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Coder Largearcee-ai	arcee-ai	39.3	-1	-1	low	stable

Stable Models

272 models with no decline and a stable ranking state. These models are performing consistently.

#	Model	Provider	Score	State
1	Claude Fable 5Anthropic	Anthropic	96.6	stable
2	Claude Opus 4.7 (Fast)Anthropic	Anthropic	94.7	stable
3	Claude Opus 4.7Anthropic	Anthropic	94.7	stable
4	Claude Opus 4.8 (Fast)Anthropic	Anthropic	94.2	stable
5	Claude Opus 4.8Anthropic	Anthropic	94.2	stable
6	GPT-5.5OpenAI	OpenAI	92.2	stable
7	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	91.7	stable
8	Gemini 3.1 Pro PreviewGoogle	Google	91.7	stable
9	GPT-5.4 ProOpenAI	OpenAI	91.5	stable
10	GPT-5.4OpenAI	OpenAI	91.5	stable
11	GPT-5.5 ProOpenAI	OpenAI	90.3	stable
12	GPT-5.2-CodexOpenAI	OpenAI	90.1	stable
13	GPT-5.2 ProOpenAI	OpenAI	90.1	stable
14	GPT-5.2OpenAI	OpenAI	90.1	stable
15	Claude Opus 4.6 (Fast)Anthropic	Anthropic	90.0	stable
16	Claude Opus 4.6Anthropic	Anthropic	90.0	stable
17	Grok 4.20xAI	xAI	88.3	stable
18	GPT-5.3-CodexOpenAI	OpenAI	88.2	stable
19	GPT-5 ProOpenAI	OpenAI	88.2	stable
20	GPT-5 CodexOpenAI	OpenAI	88.2	stable

Showing top 20 of 272 stable models.

How Degradation Is Detected

Our degradation detection system uses multiple signals to identify models that may be declining in quality or reliability.

Declining (7d)

Models whose 7-day rank change is worse than -2 positions. A sustained drop of more than two ranks over a week suggests the model may be losing ground to competitors or experiencing performance issues.

Fragile State

Models classified as "fragile" by our scoring system. These models have inconsistent performance metrics or borderline scores that could shift significantly with small changes in evaluation data.

Sustained Decline

Models declining on both the 24-hour and 7-day timeframes. When a model is losing rank on both short and medium-term windows, it indicates a persistent downward trend rather than temporary fluctuation.

Risk Score

The degradation risk score combines multiple signals: 7-day rank decline weighted 2x, 24-hour rank decline weighted 1x, plus 5 bonus points for fragile state. Higher scores indicate greater risk of meaningful performance degradation.

All Trackers

Coding, image, and video model trackers

Coding Tracker

Daily coding model performance and rankings

Leaderboard

Full model leaderboard with composite scores

Frequently Asked Questions

The tracker uses a multi-signal approach: it monitors 7-day rank decline (weighted 2x), 24-hour rank drops (weighted 1x), and fragile state classification (+5 points). Models are scored on a degradation risk scale where higher values indicate more warning signs of performance decline.

A fragile state indicates that a model has inconsistent performance metrics or borderline scores that could shift significantly with small changes in evaluation data. Fragile models are at higher risk of further ranking drops and warrant closer monitoring.

Yes, models can recover. Degradation may be temporary due to API issues, benchmark fluctuations, or scoring recalibrations. Models that show sustained decline over multiple weeks are more concerning than those with short-term dips. The tracker monitors both 24-hour and 7-day windows to help distinguish temporary noise from real trends.

Deg. Pts

Model

Quality

Rank 24h

Rank 7d

Severity

146

GLM 5V TurboZhipu AI

40.0

-146

+115

high

Mistral NemoMistral AI

39.9

-11

high

Trinity Large Thinkingarcee-ai

62.7

-4

medium

Lyria 3 Pro PreviewGoogle

40.0

-1

low

Lyria 3 Clip PreviewGoogle

40.0

-1

low

KAT-Coder-Pro V2Kuaishou

40.0

-1

low

Reka Edgerekaai

40.0

-1

low

Mistral Small 4Mistral AI

40.0

-1

low

Nemotron 3 Super (free)NVIDIA

40.0

-1

low

Nemotron 3 SuperNVIDIA

40.0

-1

low

Seed-2.0-LiteByteDance

40.0

-1

low

Seed-2.0-MiniByteDance

40.0

-1

low

LFM2-24B-A2BLiquid AI

40.0

-1

low

Aion-2.0aion-labs

40.0

-1

low

Qwen3.5 Plus 2026-02-15Alibaba

40.0

-1

low

Qwen3 Coder NextAlibaba

40.0

-1

low

Solar Pro 3Upstage

40.0

-1

low

Palmyra X5Writer

40.0

-1

low

LFM2.5-1.2B-Thinking (free)Liquid AI

40.0

-1

low

LFM2.5-1.2B-Instruct (free)Liquid AI

40.0

-1

low

GPT AudioOpenAI

40.0

-1

low

GPT Audio MiniOpenAI

40.0

-1

low

Seed 1.6 FlashByteDance

40.0

-1

low

Seed 1.6ByteDance

40.0

-1

low

Nemotron 3 Nano 30B A3B (free)NVIDIA

40.0

-1

low

Nemotron 3 Nano 30B A3BNVIDIA

40.0

-1

low

Coder Largearcee-ai

39.3

-1

low

Model

Score

24h

State

Claude Fable 5Anthropic

96.6

stable

Claude Opus 4.7 (Fast)Anthropic

94.7

stable

Claude Opus 4.7Anthropic

94.7

stable

Claude Opus 4.8 (Fast)Anthropic

94.2

stable

Claude Opus 4.8Anthropic

94.2

stable

GPT-5.5OpenAI

92.2

stable

Gemini 3.1 Pro Preview Custom ToolsGoogle

91.7

stable

Gemini 3.1 Pro PreviewGoogle

91.7

stable

GPT-5.4 ProOpenAI

91.5

stable

GPT-5.4OpenAI

91.5

stable

GPT-5.5 ProOpenAI

90.3

stable

GPT-5.2-CodexOpenAI

90.1

stable

GPT-5.2 ProOpenAI

90.1

stable

GPT-5.2OpenAI

90.1

stable

Claude Opus 4.6 (Fast)Anthropic

90.0

stable

Claude Opus 4.6Anthropic

90.0

stable

Grok 4.20xAI

88.3

stable

GPT-5.3-CodexOpenAI

88.2

stable

GPT-5 ProOpenAI

88.2

stable

GPT-5 CodexOpenAI

88.2

stable

How Degradation Is Detected

Our degradation detection system uses multiple signals to identify models that may be declining in quality or reliability.

Declining (7d)

Fragile State

Models classified as "fragile" by our scoring system. These models have inconsistent performance metrics or borderline scores that could shift significantly with small changes in evaluation data.

Degradation Tracker

Top Models by Degradation Risk Score

Models at Risk

Stable Models

How Degradation Is Detected

Declining (7d)

Fragile State

Sustained Decline

Risk Score

Related

Degradation Tracker

Top Models by Degradation Risk Score

Models at Risk

Stable Models

How Degradation Is Detected

Declining (7d)

Fragile State

Sustained Decline

Risk Score

Related