Google (29 models) vs NVIDIA (9 models) - compared across composite scores, pricing, capabilities, and context windows.
| Capability | NVIDIA | Leader | |
|---|---|---|---|
Vision | 25/29 | 2/9 | |
Reasoning | 17/29 | 9/9 | |
Function Calling | 19/29 | 9/9 | |
JSON Mode | 26/29 | 6/9 | |
Web Search | 16/29 | 0/9 | |
Streaming | 27/29 | 9/9 | |
Image Output | 4/29 | 0/9 |
| Metric | NVIDIA | |
|---|---|---|
| Cheapest Input (per 1M tokens) | $0.040 Gemma 3 4B | $0.040 Nemotron Nano 9B V2 |
| Cheapest Output (per 1M tokens) | $0.080 | $0.160 |
| Most Expensive Input (per 1M tokens) | $2.00 Gemini 3.1 Pro Preview Custom Tools | $0.100 Nemotron 3 Super |
| Most Expensive Output (per 1M tokens) | $12.00 | $0.450 |
| Free Models | 4 | 5 |
| Max Context Window | 1.0M | 262K |
| Model | Score | Input $/M | Output $/M |
|---|---|---|---|
| Gemini 3 Flash Preview | 88 | $0.500 | $3.00 |
| Gemini 2.5 Pro | 84 | $1.25 | $10.00 |
| Gemini 2.5 Pro Preview 06-05 | 84 | $1.25 | $10.00 |
| Gemini 2.5 Pro Preview 05-06 | 84 | $1.25 | $10.00 |
| Gemini 3.1 Pro Preview Custom Tools | 81 | $2.00 | $12.00 |
| Gemini 3.1 Pro Preview | 81 | $2.00 | $12.00 |
| Gemma 4 31B (free) | 81 | Free | Free |
| Gemma 4 31B | 81 | $0.130 | $0.380 |
| Gemini 3.1 Flash Lite Preview | 80 | $0.250 | $1.50 |
| Gemini 2.5 Flash Lite Preview 09-2025 | 79 | $0.100 | $0.400 |
| Gemini 2.5 Flash Lite | 79 | $0.100 | $0.400 |
| Gemini 2.5 Flash | 79 | $0.300 | $2.50 |
| Gemma 2 27B | 77 | $0.650 | $0.650 |
| Gemma 4 26B A4B (free) | 73 | Free | Free |
| Gemma 4 26B A4B | 73 | $0.060 | $0.330 |
| Gemini 2.0 Flash | 72 | $0.100 | $0.400 |
| Gemini 2.0 Flash Lite | 59 | $0.075 | $0.300 |
| Lyria 3 Pro Preview | 40 | Free | Free |
| Lyria 3 Clip Preview | 40 | Free | Free |
| Gemma 3n 4B | 40 | $0.060 | $0.120 |
| Model | Score | Input $/M | Output $/M |
|---|---|---|---|
| Llama 3.3 Nemotron Super 49B V1.5 | 61 | $0.100 | $0.400 |
| Nemotron 3 Nano Omni (free) | 40 | Free | Free |
| Nemotron 3 Super (free) | 40 | Free | Free |
| Nemotron 3 Super | 40 | $0.090 | $0.450 |
| Nemotron 3 Nano 30B A3B (free) | 40 | Free | Free |
| Nemotron 3 Nano 30B A3B | 40 | $0.050 | $0.200 |
| Nemotron Nano 12B 2 VL (free) | 40 | Free | Free |
| Nemotron Nano 9B V2 (free) | 40 | Free | Free |
| Nemotron Nano 9B V2 | 40 | $0.040 | $0.160 |
Compare any two AI providers side-by-side.
Google's portfolio strategy prioritizes multimodal AI across price points, with 79% of models supporting vision (27/34) while NVIDIA concentrates on specialized reasoning tasks at 91% coverage (10/11). This reflects Google's consumer-facing product integration needs versus NVIDIA's focus on enterprise inference workloads, though Google's top-performing Gemini 2.5 Flash Lite Preview still outscores NVIDIA's best Nemotron 3 Nano by 15 points (60 vs 45).
Google leverages 15 open-source models (44% of portfolio) and 9 free-tier options to create aggressive pricing anchors, while NVIDIA's all-open-source approach (11/11 models) paradoxically results in higher minimum pricing. Google's vertical integration from TPUs to cloud infrastructure enables this pricing strategy, whereas NVIDIA focuses on premium inference optimization that commands $1.80/M at the high end versus Google's $12.00/M ceiling.
NVIDIA's concentrated portfolio delivers more consistent function calling support at 82% coverage despite having 23 fewer models than Google, making it more predictable for enterprise deployments. However, Google's 1M token context window dwarfs NVIDIA's 262K maximum, and Google's 16 function-capable models still outnumber NVIDIA's total portfolio of 11, offering more options for specific use case optimization.
Both providers view real-time web access as outside their core LLM infrastructure play - Google despite having the world's leading search engine, and NVIDIA despite their enterprise focus where RAG patterns often require web data. This gap suggests both are leaving room for specialized providers while Google focuses on multimodal breadth (27 vision models) and NVIDIA on reasoning depth (91% coverage), with average scores clustered at 45 and 40 respectively.
NVIDIA's curation strategy delivers 91% reasoning coverage (10/11) and 82% function calling (9/11) with clear enterprise optimization, while Google's broader portfolio averages just 47% on both capabilities despite 3x more models. For production workloads requiring consistent capabilities across model selections, NVIDIA's focused approach reduces evaluation overhead, though you sacrifice Google's 15-point performance advantage (60 vs 45 top scores) and pay a 4x premium on entry pricing ($0.160 vs $0.040).