Upload a 30-second audio clip and SpiritBox trains a custom voice model using Piper TTS + Whisper. The entire process runs on your GPU — your voice samples never leave your machine.
| Voice | Language | Quality | Type | Preview |
|---|---|---|---|---|
| en_US-lessac-high | English (US) | High | Built-in | |
| en_GB-alba-medium | English (UK) | Medium | Built-in | |
| de_DE-thorsten-high | German | High | Built-in | |
| fr_FR-siwis-medium | French | Medium | Built-in | |
| es_ES-sharvard-medium | Spanish | Medium | Built-in | |
| ryan-custom-v1 | English (CA) | High | Cloned | |
| brand-narrator-v2 | English (US) | High | Cloned |
Piper TTS supports 50+ languages. Download voice models from the Voices page in the real app.