Text-to-Speech by Mistral AI — Repository signal via huggingface trending
This page stays evidence-first until LM Market Cap can verify real availability, pricing, and benchmark coverage.
We introduce Voxtral TTS, an expressive multilingual text-to-speech model that generates natural speech from as little as 3 seconds of reference audio.
This page tracks a pre-release signal for Voxtral-4B-TTS-2603. Once availability is confirmed, it will appear on our LLM Leaderboard with full scoring, benchmarks, and pricing. Until then, release date, pricing, and benchmark claims remain unconfirmed unless they are directly cited above. Check our New AI Models page for the latest releases and the source links above for the current evidence.
Voxtral-4B-TTS-2603 was first detected on March 26, 2026 via huggingface trending. That is a detection signal, not a confirmed launch date, so we keep tracking the source evidence until official availability is verified.
Not yet. The current evidence comes from repository, research, or other detection signals rather than a confirmed provider launch post. Treat this page as a watch page until an official announcement appears.
You can verify the source links, the detection dates, and the evidence type shown on this page. We do not treat release date, pricing, or benchmark availability as confirmed unless those details are cited directly in the source evidence above.