by MetaRank #304Score 40.0
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
| 信号 | 标准化 | 权重 | 贡献 | 新鲜度 |
|---|---|---|---|---|
Capabilities capability | 50.0 | 30% | 15.0 | 2026-05-10T09:07:14.522Z |
Pricing pricing_tier | 99.8 | 25% | 24.9 | 2026-05-10T09:07:14.522Z |
Context Window context_window | 81.2 | 15% | 12.2 | 2026-05-10T09:07:14.522Z |
Recency recency | 25.0 | 15% | 3.8 | 2026-05-10T09:07:14.522Z |
Output Capacity output_capacity | 70.2 | 15% | 10.5 | 2026-05-10T09:07:14.522Z |
| 功能 | 支持 |
|---|---|
| 视觉 | 是 |
| 推理 | 否 |
| JSON模式 | 是 |
| 流式输出 | 是 |
| 函数调用 | 否 |
| 网页搜索 | 否 |