by MetaRank #304Score 40.0
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
| Signal | Normalized | Weight | Contribution | Freshness |
|---|---|---|---|---|
Capabilities capability | 50.0 | 30% | 15.0 | 2026-05-10T07:45:27.873Z |
Pricing pricing_tier | 99.8 | 25% | 24.9 | 2026-05-10T07:45:27.873Z |
Context Window context_window | 81.2 | 15% | 12.2 | 2026-05-10T07:45:27.873Z |
Recency recency | 25.0 | 15% | 3.8 | 2026-05-10T07:45:27.873Z |
Output Capacity output_capacity | 70.2 | 15% | 10.5 | 2026-05-10T07:45:27.873Z |
| Capability | Supported |
|---|---|
| Vision | Yes |
| Reasoning | No |
| JSON Mode | Yes |
| Streaming | Yes |
| Function Calling | No |
| Web Search | No |