skip to content
gigarouter gigarouter
tasks / text-to-speech

Hosted text-to-speech models

37 models · 0 live as APIs · benchmarked & compared

Text-to-speech (TTS) models convert written text into natural-sounding speech, solving problems such as generating voiceovers for videos, enabling screen readers for accessibility, powering interactive voice response systems, and providing real-time narration in navigation or e-learning applications. In production, TTS is typically integrated via API calls from applications that need to stream audio on demand—common architectures use queuing for batch jobs or low-latency streaming for conversational agents.

Choosing among the 37 models being onboarded (including coqui/XTTS-v2, multiple Qwen/Qwen3-TTS-12Hz variants, OpenMOSS-Team/MOSS-TTS, k2-fsa/OmniVoice, SWivid/F5-TTS, ai4bharat/indic-parler-tts, and Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign) involves a trade-off between quality, speed, and model size. Larger models (e.g., 1.7B parameters) generally produce richer, more expressive speech at the cost of higher latency and compute; smaller models (e.g., 0.6B) offer faster inference and lower cost, suitable for high-throughput or latency-sensitive applications.

Using a hosted API eliminates the overhead of provisioning GPU infrastructure, managing model updates, and scaling for variable demand—making it more economical than self-hosting for most call volumes below tens of thousands of requests per minute.

compare

modelparamsdownloads/mopricestatus
coqui/XTTS-v2-9.3Mat launchcoming soon
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice1916.7M2Mat launchcoming soon
Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice905.8M1.2Mat launchcoming soon
OpenMOSS-Team/MOSS-TTS8489.8M911.8Kat launchcoming soon
k2-fsa/OmniVoice612.6M902.4Kat launchcoming soon
SWivid/F5-TTS-799.1Kat launchcoming soon
ai4bharat/indic-parler-tts937.8M764.6Kat launchcoming soon
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign1916.7M657.8Kat launchcoming soon
openbmb/VoxCPM22290M640.8Kat launchcoming soon
microsoft/VibeVoice-Realtime-0.5B1017.6M638.1Kat launchcoming soon
onnx-community/Kokoro-82M-v1.0-ONNX-576.6Kat launchcoming soon
Qwen/Qwen3-TTS-12Hz-0.6B-Base914.6M571.2Kat launchcoming soon
fishaudio/s2-pro4561.9M434.2Kat launchcoming soon
sesame/csm-1b1552.8M308.2Kat launchcoming soon
microsoft/VibeVoice-1.5B2704M235.5Kat launchcoming soon
OpenMOSS-Team/MOSS-TTS-v1.58489.8M205.8Kat launchcoming soon
bosonai/higgs-tts-2-3b-base5771.3M150.2Kat launchcoming soon
facebook/mms-tts-eng36.3M137Kat launchcoming soon
myshell-ai/MeloTTS-English-135.1Kat launchcoming soon
pnnbao-ump/VieNeu-TTS-v3-Turbo130.9M135Kat launchcoming soon
facebook/hf-seamless-m4t-medium-113.8Kat launchcoming soon
neuphonic/neutts-nano228.7M113.3Kat launchcoming soon
SWivid/E2-TTS-108.8Kat launchcoming soon
bosonai/higgs-tts-3-4b4654.9M108.6Kat launchcoming soon
Misha24-10/F5-TTS_RUSSIAN-89.9Kat launchcoming soon
canopylabs/3b-de-ft-research_release3300.9M86.4Kat launchcoming soon
OpenMOSS-Team/MOSS-TTS-Nano-100M-83.5Kat launchcoming soon
microsoft/speecht5_tts-80.8Kat launchcoming soon
moonshotai/Kimi-Audio-7B-Instruct9766.3M79Kat launchcoming soon
pnnbao-ump/VieNeu-TTS-v2293.7M78.5Kat launchcoming soon
mistralai/Voxtral-4B-TTS-2603-74.5Kat launchcoming soon
kenpath/svara-tts-v1-73.1Kat launchcoming soon
myshell-ai/MeloTTS-Spanish-71.3Kat launchcoming soon
myshell-ai/MeloTTS-Korean-68.4Kat launchcoming soon
Supertone/supertonic-3-65.8Kat launchcoming soon
multimodalart/higgs-audio-v3-tts-4b-transformers4654.9M62.9Kat launchcoming soon
sbintuitions/sarashina2.2-tts809.9M59.6Kat launchcoming soon