Llama 3.2 11B Vision Instruct

by MetaRank #283Score 40.0

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

性能概览

评分

40.0

排名

#283

24小时变化

7天变化

状态

stable

置信度

high

信号评分

信号	标准化	权重	贡献	新鲜度
Capabilities capability	50.0	30%	15.0	2026-06-24T11:17:27.686Z
Pricing pricing_tier	99.7	25%	24.9	2026-06-24T11:17:27.686Z
Context Window context_window	73.1	15%	11.0	2026-06-24T11:17:27.686Z
Recency recency	16.8	15%	2.5	2026-06-24T11:17:27.686Z
Output Capacity output_capacity	70.2	15%	10.5	2026-06-24T11:17:27.686Z

主要驱动因素

positive

Pricing

$0.34/M output tokens

$0.34

neutral

Capabilities

Supports vision, JSON mode, streaming

3/7

positive

Context Window

131K token context window

131K

positive

Output Capacity

Up to 16K output tokens per request

16K

功能

功能	支持
视觉	是
推理	否
JSON模式	是
流式输出	是
函数调用	否
网页搜索	否

价格

输入/百万令牌

$0.34

输出/百万令牌

$0.34

上下文窗口

131K

最大输出

16K

Llama 3.2 11B Vision Instruct

性能概览

信号评分

主要驱动因素

功能

价格

相关模型

相关

Llama 3.2 11B Vision Instruct

性能概览

信号评分

主要驱动因素

功能

价格

相关模型

相关