How LLM capabilities spread through the catalog, quarter by quarter. Each curve shows the percentage of models released in a given release cohort that shipped with a given capability. 349 models tracked across 10 release quarters.
Catalog last refreshed 2026-04-14 (auto-updated hourly from live provider APIs).
Percentage of models released in each quarter that ship with the given capability.
Share of each release-quarter cohort that ships each capability, expressed as a percentage.
| Cohort | N | Reasoning | Vision | Function Calling | JSON Mode | Web Search | Image Output |
|---|---|---|---|---|---|---|---|
| 2024 Q1 | 4 | 0% | 25% | 100% | 75% | 0% | 0% |
| 2024 Q2 | 13 | 0% | 31% | 46% | 46% | 0% | 15% |
| 2024 Q3 | 20 | 0% | 20% | 45% | 55% | 0% | 20% |
| 2024 Q4 | 27 | 11% | 22% | 59% | 37% | 4% | 7% |
| 2025 Q1 | 45 | 33% | 44% | 31% | 58% | 18% | 0% |
| 2025 Q2 | 43 | 51% | 47% | 74% | 74% | 28% | 0% |
| 2025 Q3 | 61 | 57% | 28% | 90% | 79% | 11% | 0% |
| 2025 Q4 | 59 | 71% | 61% | 83% | 83% | 20% | 7% |
| 2026 Q1 | 58 | 76% | 52% | 83% | 84% | 17% | 2% |
| 2026 Q2 | 9 | 100% | 78% | 100% | 100% | 11% | 0% |
First model in the catalog to ship each capability and how far the capability has spread since.
| Capability | First model | Latest adoption |
|---|---|---|
Reasoning | Falcon3 10B Instruct | 100% |
Vision | Claude 3 Haiku | 78% |
Function Calling | GPT-4 (older v0314) | 100% |
JSON Mode | GPT-4 (older v0314) | 100% |
Web Search | Claude 3.5 Haiku | 11% |
Image Output | DALL-E 3 | 0% |
The fastest-spreading capability over the last year is reasoning. Here is what recently shipped with it.
| Model | Provider | Released |
|---|---|---|
| Claude Opus 4.6 (Fast) | Anthropic | 2026-04-07 |
| GLM 5.1 | Zhipu AI | 2026-04-07 |
| Gemma 4 26B A4B (free) | 2026-04-03 | |
| Gemma 4 26B A4B | 2026-04-03 | |
| Gemma 4 31B (free) | 2026-04-02 | |
| Gemma 4 31B | 2026-04-02 | |
| Qwen3.6 Plus | Alibaba | 2026-04-02 |
| GLM 5V Turbo | Zhipu AI | 2026-04-01 |
| Trinity Large Thinking | arcee-ai | 2026-04-01 |
| Grok 4.20 Multi-Agent | xAI | 2026-03-31 |
| Grok 4.20 | xAI | 2026-03-31 |
| MiMo-V2-Omni | Xiaomi | 2026-03-18 |
Every model we index has a release timestamp and a binary capability manifest. We bucket models into release-quarter cohorts using that timestamp, then compute the share of each cohort that shipped with a given capability. The result is an adoption curve per capability per quarter.
We exclude cohorts before 2024 Q1 from the visual curves because those quarters carry fewer than a dozen tracked releases each, so a single model can swing the percentage by tens of points. The cohort table below the charts shows all included quarters and their sample sizes.
Capability flags follow provider-declared manifests reconciled against documented API surfaces. A model that can accept images is counted under vision. A model that exposes a tool-calling endpoint is counted under function calling. A model that supports an explicit reasoning mode (visible or hidden thinking tokens) is counted under reasoning, even if the base mode is plain chat.
It is a quarter-by-quarter view of how the LLM catalog has adopted each of six core capabilities: reasoning, vision, function calling, JSON mode, web search, and image output. Each cohort is defined by release quarter, and every model in that cohort contributes to the capability percentages for that quarter. This lets us see which capabilities are becoming table-stakes and which are still rare. Across the latest cohort (2026 Q2, N=9), 100% of new releases shipped with reasoning support and 78% shipped with vision input.
Provider roll-up hides the temporal story. A provider that shipped a reasoning model in 2024 Q4 and a non-reasoning model in 2025 Q2 would average out to 50% if we grouped by provider. Release-cohort grouping lets us see the calendar-quarter at which each capability crossed 50% adoption, which is a much more actionable question for buyers planning a rebuild.
Reasoning is the fastest-spreading capability over the last year. It was at 51% one year ago and now sits at 100% of new releases, a gain of 49 percentage points. Extended chain-of-thought / deliberate reasoning mode with visible or hidden thinking tokens.
Yes. The denominator for each cohort is every tracked model released in that quarter, including open source, free, and paid-only. We do not filter by license or pricing because capability adoption is a platform-wide question, and excluding open source would understate how quickly the open ecosystem is catching up on things like reasoning and vision.