coqui-tts/recipes/ljspeech
Shivam Mehta 3b8b105b0d
Adding OverFlow (#2183)
* Adding encoder

* currently modifying hmm

* Adding hmm

* Adding overflow

* Adding overflow setting up flat start

* Removing runs

* adding normalization parameters

* Fixing models on same device

* Training overflow and plotting evaluations

* Adding inference

* At the end of epoch the test sentences are coming on cpu instead of gpu

* Adding figures from model during training to monitor

* reverting tacotron2 training recipe

* fixing inference on gpu for test sentences on config

* moving helpers and texts within overflows source code

* renaming to overflow

* moving loss to the model file

* Fixing the rename

* Model training but not plotting the test config sentences's audios

* Formatting logs

* Changing model name to camelcase

* Fixing test log

* Fixing plotting bug

* Adding some tests

* Adding more tests to overflow

* Adding all tests for overflow

* making changes to camel case in config

* Adding information about parameters and docstring

* removing compute_mel_statistics moved statistic computation to the model instead

* Added overflow in readme

* Adding more test cases, now it doesn't saves transition_p like tensor and can be dumped as json
2022-12-12 12:44:15 +01:00
..
align_tts d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
fast_pitch d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
fast_speech d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
glow_tts d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
hifigan Make style (#1405) 2022-03-16 12:13:55 +01:00
multiband_melgan Make style (#1405) 2022-03-16 12:13:55 +01:00
overflow Adding OverFlow (#2183) 2022-12-12 12:44:15 +01:00
speedy_speech d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
tacotron2-Capacitron d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
tacotron2-DCA d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
tacotron2-DDC d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
univnet Make style (#1405) 2022-03-16 12:13:55 +01:00
vits_tts d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
wavegrad Make style 2022-02-25 11:26:59 +01:00
wavernn Make style 2022-02-25 11:26:59 +01:00
README.md Create LJSpeech recipes for all the models 2021-06-22 16:21:11 +02:00
download_ljspeech.sh Update ljspeech download 2022-02-25 11:12:44 +01:00

README.md

🐸💬 TTS LJspeech Recipes

For running the recipes

  1. Download the LJSpeech dataset here either manually from its official website or using download_ljspeech.sh.

  2. Go to your desired model folder and run the training.

    Running Python files. (Choose the desired GPU ID for your run and set CUDA_VISIBLE_DEVICES)

    CUDA_VISIBLE_DEVICES="0" python train_modelX.py
    

    Running bash scripts.

    bash run.sh
    

💡 Note that these runs are just templates to help you start training your first model. They are not optimized for the best result. Double-check the configurations and feel free to share your experiments to find better parameters together 💪.