Update XTTS docs

This commit is contained in:
Edresson Casanova 2023-10-23 11:03:57 -03:00
parent 8853e1c3ec
commit 6fefc36e5a
1 changed files with 11 additions and 0 deletions

View File

@ -134,6 +134,17 @@ torchaudio.save("xtts_streaming.wav", wav.squeeze().unsqueeze(0).cpu(), 24000)
``` ```
### Training
A recipe for `XTTS_v1.1` GPT encoder training using `LJSpeech` dataset looks like below. Let's be creative and call this `train_gpt_xtts.py`.
```{literalinclude} ../../recipes/ljspeech/xtts_v1/train_gpt_xtts.py
```
You need to change the fields of the `BaseDatasetConfig` to match your dataset and then update `GPTArgs` and `GPTTrainerConfig` fields as you need. By default, it will use the same parameters that XTTS v1.1 model was trained with. To speed up the model convergence, as default, it will also download the XTTS v1.1 checkpoint and load it.
## Important resources & papers ## Important resources & papers
- VallE: https://arxiv.org/abs/2301.02111 - VallE: https://arxiv.org/abs/2301.02111
- Tortoise Repo: https://github.com/neonbjb/tortoise-tts - Tortoise Repo: https://github.com/neonbjb/tortoise-tts