Model Gallery

Discover and install AI models from our curated collection

4 models available
1 repositories
Documentation

Find Your Perfect Model

Filter by Model Type

Browse by Tags

vits-piper-en_GB-aru-medium-sherpa
English (en_GB) multi-speaker (12 voices) Piper VITS voice "aru" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Pick a speaker with the numeric voice/speaker id.

Repository: localaiLicense: cc-by-4.0

vits-piper-en_GB-semaine-medium-sherpa
English (en_GB) multi-speaker (4 voices) Piper VITS voice "semaine" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Pick a speaker with the numeric voice/speaker id. Non-commercial use only (CC BY-NC-SA 4.0).

Repository: localaiLicense: cc-by-nc-sa-4.0

vits-piper-en_GB-vctk-medium-sherpa
English (en_GB) multi-speaker (109 voices) Piper VITS voice "vctk" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Pick a speaker with the numeric voice/speaker id.

Repository: localaiLicense: cc-by-4.0

supertonic-3
Supertonic multilingual text-to-speech (Supertone/supertonic-3), served through the native supertonic backend via ONNX Runtime. Lightning-fast on-device flow-matching TTS with 44.1 kHz output, 31 languages, and 10 preset voice styles (F1-F5, M1-M5). No espeak-ng dependency. Defaults to voice F1; override per request with the OpenAI `voice` field, and optionally pass `language=` (e.g. en, ko, ja, it; "na" for language-agnostic).

Repository: localaiLicense: mit