This demo is running an XTTS model fine-tuned on Estonian data. XTTS is a multilingual text-to-speech and voice-cloning model. This demo features zero-shot voice cloning.


Language

Select an output language for the synthesised speech

This check can improve output if your microphone or reference voice is noisy

I agree to the terms of the CPML: https://coqui.ai/cpml

Examples
Text Prompt Language Reference Audio Cleanup Reference Voice Agree
Pages: