性能衰退追踪器

检测AI模型可能出现的性能下降。此追踪器监控排名变动(排行榜上的位置变化),并标记在24小时或7天内显著下降的模型。"衰退分"越高,意味着越多的警告信号。

存在风险的模型

下降中 (7天)

不稳定

持续下降

LMMarketCap.com

存在风险的模型

27 个模型显示性能衰退迹象,按风险评分排名。更高的风险评分表示更令人担忧的性能趋势。

衰退分	模型	提供商	质量	24小时排名	7天排名	严重程度	状态
146	GLM 5V TurboZhipu AI	Zhipu AI	40.0	-146	+115	high	preliminary
38	Mistral NemoMistral AI	Mistral AI	39.9	-11	-11	high	fragile
8	Trinity Large Thinkingarcee-ai	arcee-ai	62.7	+1	-4	medium	stable
3	Lyria 3 Pro PreviewGoogle	Google	40.0	-1	-1	low	stable
3	Lyria 3 Clip PreviewGoogle	Google	40.0	-1	-1	low	stable
3	KAT-Coder-Pro V2Kuaishou	Kuaishou	40.0	-1	-1	low	stable
3	Reka Edgerekaai	rekaai	40.0	-1	-1	low	stable
3	Mistral Small 4Mistral AI	Mistral AI	40.0	-1	-1	low	stable
3	Nemotron 3 Super (free)NVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Nemotron 3 SuperNVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Seed-2.0-LiteByteDance	ByteDance	40.0	-1	-1	low	stable
3	Seed-2.0-MiniByteDance	ByteDance	40.0	-1	-1	low	stable
3	LFM2-24B-A2BLiquid AI	Liquid AI	40.0	-1	-1	low	stable
3	Aion-2.0aion-labs	aion-labs	40.0	-1	-1	low	stable
3	Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	40.0	-1	-1	low	stable
3	Qwen3 Coder NextAlibaba	Alibaba	40.0	-1	-1	low	stable
3	Solar Pro 3Upstage	Upstage	40.0	-1	-1	low	stable
3	Palmyra X5Writer	Writer	40.0	-1	-1	low	stable
3	LFM2.5-1.2B-Thinking (free)Liquid AI	Liquid AI	40.0	-1	-1	low	stable
3	LFM2.5-1.2B-Instruct (free)Liquid AI	Liquid AI	40.0	-1	-1	low	stable
3	GPT AudioOpenAI	OpenAI	40.0	-1	-1	low	stable
3	GPT Audio MiniOpenAI	OpenAI	40.0	-1	-1	low	stable
3	Seed 1.6 FlashByteDance	ByteDance	40.0	-1	-1	low	stable
3	Seed 1.6ByteDance	ByteDance	40.0	-1	-1	low	stable
3	Nemotron 3 Nano 30B A3B (free)NVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Nemotron 3 Nano 30B A3BNVIDIA	NVIDIA	40.0	-1	-1	low	stable
3	Coder Largearcee-ai	arcee-ai	39.3	-1	-1	low	stable

272 个模型无下降且排名状态稳定。这些模型表现一致。

#	模型	提供商	评分	状态
1	Claude Fable 5Anthropic	Anthropic	96.6	stable
2	Claude Opus 4.7 (Fast)Anthropic	Anthropic	94.7	stable
3	Claude Opus 4.7Anthropic	Anthropic	94.7	stable
4	Claude Opus 4.8 (Fast)Anthropic	Anthropic	94.2	stable
5	Claude Opus 4.8Anthropic	Anthropic	94.2	stable
6	GPT-5.5OpenAI	OpenAI	92.2	stable
7	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	91.7	stable
8	Gemini 3.1 Pro PreviewGoogle	Google	91.7	stable
9	GPT-5.4 ProOpenAI	OpenAI	91.5	stable
10	GPT-5.4OpenAI	OpenAI	91.5	stable
11	GPT-5.5 ProOpenAI	OpenAI	90.3	stable
12	GPT-5.2-CodexOpenAI	OpenAI	90.1	stable
13	GPT-5.2 ProOpenAI	OpenAI	90.1	stable
14	GPT-5.2OpenAI	OpenAI	90.1	stable
15	Claude Opus 4.6 (Fast)Anthropic	Anthropic	90.0	stable
16	Claude Opus 4.6Anthropic	Anthropic	90.0	stable
17	Grok 4.20xAI	xAI	88.3	stable
18	GPT-5.3-CodexOpenAI	OpenAI	88.2	stable
19	GPT-5 ProOpenAI	OpenAI	88.2	stable
20	GPT-5 CodexOpenAI	OpenAI	88.2	stable

显示 272 个稳定模型中的前 20 个。

我们的性能衰退检测系统使用多种信号来识别可能正在下降的模型。

7天排名变化超过-2位的模型。一周内持续下降超过两个名次,表明该模型可能正在被竞争对手超越或出现性能问题。

被评分系统标记为"脆弱"的模型。这些模型的性能指标不一致或评分处于边界,评估数据的微小变化可能导致显著波动。

在24小时和7天两个时间维度上均下降的模型。当模型在短期和中期窗口都在失去排名时,表明这是持续的下降趋势而非暂时波动。

性能衰退风险评分综合了多个信号:7天排名下降权重2倍,24小时排名下降权重1倍,脆弱状态额外加5分。更高的评分表示有更大的性能衰退风险。