| | --- |
| | language: |
| | - en |
| | base_model: |
| | - coqui/XTTS-v2 |
| | pipeline_tag: text-to-speech |
| | tags: |
| | - XTTS |
| | --- |
| | Instructions: Just extract the zipped files into your XTTS-WebUI main directory. |
| | Also I recommend that you disable DeepSpeed. While it does cut output times in half, it greatly reduces the output quality. |
| |
|
| | Version: 0.1.1 Pre-release. |
| | About this version: This is the first test build of a model that was built on a manually curated dataset. |
| | The dataset was initially created with whisper in step one of XTTS-Finetune. |
| | The clips were then manually edited to fix the issue of the clips being cut to short. |
| | Also the dataset's metadata was corrected for spelling errors. |
| | Dataset length: 3:49 |