信号分析探索器

了解综合评分如何由各个信号组成。每个信号衡量一个独特的质量维度，经过加权和组合产生每个模型的最终评分。

平均信号贡献

LMMarketCap.com

信号概览

所有评分模型信号数据的高级汇总。

追踪的信号

个独特质量维度

有信号数据的模型

300

共300个模型

每模型平均信号数

5.6

个信号/模型 (平均)

信号重要性排名

按综合评分计算中的平均权重排名的信号。权重越高意味着对最终评分的影响越大。

信号	平均权重	平均评分	平均贡献	最大贡献	顶级模型
Benchmarks	30.0%	72.5	21.74	28.90	Claude Fable 5(28.9)
Capabilities	24.3%	68.2	16.20	30.00	Fugu Ultra(30.0)
Pricing	19.3%	92.4	18.07	25.00	North Mini Code (free)(25.0)
Recency	15.0%	76.2	11.43	15.00	Claude Fable 5(15.0)
Context Window	12.2%	76.6	9.29	12.90	Fugu Ultra(12.9)
Output Capacity	12.2%	68.7	8.28	15.00	MiniMax-01(15.0)

信号贡献分析

按综合评分排列的前10个模型及其堆叠信号贡献。每个彩色段与该信号对总分的贡献成正比。

Benchmarks

Capabilities

Pricing

Recency

Context Window

Output Capacity

Claude Fable 596.6

Claude Opus 4.7 (Fast)94.7

Claude Opus 4.794.7

Claude Opus 4.8 (Fast)94.2

Claude Opus 4.894.2

GPT-5.592.2

Gemini 3.1 Pro Preview Custom Tools91.7

Gemini 3.1 Pro Preview91.7

GPT-5.4 Pro91.5

GPT-5.491.5

信号领先者

每个信号中，按该信号对综合评分的贡献排名的前5个模型。

Benchmarks

Capabilities

Pricing

1.North Mini Code (free)25.0
2.Nemotron 3.5 Content Safety (free)25.0
3.Nemotron 3 Ultra (free)25.0
4.Nemotron 3 Nano Omni (free)25.0
5.Laguna XS.2 (free)25.0

Recency

Context Window

Output Capacity

信号相关性

哪些信号倾向于同步变化？所有模型中信号评分之间的皮尔逊相关系数。接近+1的值表示信号同升同降；接近-1的值表示反向关系。

最相关的配对

Capabilities ↔ Benchmarks+0.616

Capabilities ↔ Context Window+0.471

Capabilities ↔ Recency+0.373

Context Window ↔ Recency+0.359

Benchmarks ↔ Recency+0.358

最不相关的配对

Capabilities ↔ Pricing-0.398

Benchmarks ↔ Pricing-0.396

Pricing ↔ Output Capacity-0.187

Pricing ↔ Context Window-0.161

Pricing ↔ Recency-0.026

方法论

信号如何工作并贡献于综合评分。

信号代表什么

信号是捕捉模型价值不同方面的独立质量维度。每个信号衡量一个特定属性，如基准性能、定价效率、上下文容量或功能广度。它们共同提供了模型质量的多维视角。

权重如何应用

每个信号被赋予一个权重，反映其在整体评估中的重要性。权重以分数形式表示，总和为1.0（100%）。权重为0.25的信号最多可贡献综合评分的25%。权重根据信号与实际模型质量的相关性进行校准。

归一化评分（0-100）

每个信号的原始值被归一化到0-100的范围，使信号可以比较，无论其原始单位如何。100分表示该模型在该信号中排名最高，而0分表示最低性能。先计算Z分数，然后映射到0-100范围。

贡献如何计算

信号的贡献等于其权重乘以归一化评分。例如，权重为0.25、归一化评分为80的信号贡献20分到综合评分。所有贡献的总和即为最终综合评分。这使得很容易看出哪些信号驱动了每个模型的排名。

探索更多

通过基准测试、完整排行榜和其他探索器视图继续探索AI模型数据。

全部探索器基准测试排行榜

Frequently Asked Questions

SignalScore breaks down into six components: Capability (breadth of supported features), Pricing (cost competitiveness), Context (input window size), Recency (how new the model is), Output (generation capacity), and Versatility (range of supported tasks and modalities).

Capability and Pricing each carry 25% weight, making them the two most impactful signals. A model that supports many capabilities (vision, function calling, streaming, reasoning) and has competitive pricing will score significantly higher than one that excels in only one dimension.

Some signals are positively correlated - models with large context windows tend to also have broad capabilities. Others show negative correlation - the most capable premium models often score low on pricing. Understanding these correlations helps explain why some models rank differently than expected.