300 models ranked for music and songwriting. Scored with bonuses for large output (full lyrics), streaming, web search (references), large context (song collections), and reasoning (music theory).
| # | Model | Score |
|---|---|---|
| 1 | GPT-5.4 ProOpenAI | 94 |
| 2 | GPT-5.4OpenAI | 94 |
| 3 | GPT-5.4 MiniOpenAI | 93 |
| 4 | GPT-5.2 ProOpenAI | 93 |
| 5 | GPT-5.2OpenAI | 93 |
| 6 | Claude Opus 4.6Anthropic | 92 |
| 7 | GPT-5 ProOpenAI | 92 |
| 8 | o3 Deep ResearchOpenAI | 92 |
| 9 | Claude Opus 4.5Anthropic | 90 |
| 10 | GPT-5OpenAI | 90 |
| 11 | Claude Sonnet 4.6Anthropic | 89 |
| 12 | Claude Sonnet 4.5Anthropic | 89 |
| 13 | o3 ProOpenAI | 88 |
| 14 | Grok 4.1 FastxAI | 87 |
| 15 | Gemini 3 Flash PreviewGoogle | 89 |
| 16 | o3OpenAI | 86 |
| 17 | GPT-5.1OpenAI | 85 |
| 18 | GPT-5.4 NanoOpenAI | 85 |
| 19 | GPT-5.3-CodexOpenAI | 85 |
| 20 | GPT-5.2-CodexOpenAI | 85 |
| 21 | GPT-5.1-Codex-MaxOpenAI | 85 |
| 22 | o4 Mini Deep ResearchOpenAI | 85 |
| 23 | o4 Mini HighOpenAI | 85 |
| 24 | o4 MiniOpenAI | 84 |
| 25 | Grok 4 FastxAI | 83 |
| 26 | GPT-5.3 ChatOpenAI | 85 |
| 27 | GPT-5.1 ChatOpenAI | 85 |
| 28 | Claude Haiku 4.5Anthropic | 83 |
| 29 | Gemini 3.1 Pro PreviewGoogle | 86 |
| 30 | MiMo-V2-OmniXiaomi | 85 |
Large output models generate complete lyrics with verse, chorus, and bridge structure. Streaming shows lyrics appearing in real-time for collaborative songwriting sessions.
Reasoning models analyze chord progressions, scales, and harmonic structures. Get suggestions for melody development, key changes, and arrangement decisions with chain-of-thought explanations.
Web search models find reference tracks, sample libraries, and production techniques. Large context windows let you work with full project notes and lyrics simultaneously.
Generate press releases, social media content, EPK descriptions, and playlist pitch emails. Models help craft compelling narratives around your music and brand.
Based on our composite scoring updated hourly, the top-ranked models are shown at the top of this page. Rankings consider benchmarks, pricing, capabilities, and community adoption.
Yes, several models listed on this page offer free tiers or are fully open-source. Look for models marked as Free in the pricing column above.
We use a composite scoring system combining benchmark performance, capability matching, pricing, context window size, and community adoption. Scores are updated hourly.
Rankings refresh every hour using real-time data from benchmarks, API testing, and community metrics. The data shown always reflects the most current performance.