coqui-tts/TTS/tts
Edresson Casanova 8d228ab22a
Trick to Upsampling to High sampling rates using VITS model (#1456)
* Add upsample VITS support

* Fix the bug in inference

* Fix lint checks

* Add RMS based norm in save_wav method

* Style fix

* Add the period for VITS multi-period discriminator in model_args

* Bug fix in speaker encoder load in inference time

* Add unit tests

* Remove useless detach_z_vocoder parameter

* Add docs for VITS upsampling

* Fix the docs

* Rename TTS_part_sample_rate to encoder_sample_rate

* Add upsampling_init and upsampling_z methods

* Add asserts for encoder_sample_rate part

* Move upsampling tests to test_vits.py
2022-04-26 11:47:46 +02:00
..
configs REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349) 2022-03-11 14:43:40 +01:00
datasets Fix #1380 (#1409) 2022-03-16 12:38:27 +01:00
layers Trick to Upsampling to High sampling rates using VITS model (#1456) 2022-04-26 11:47:46 +02:00
models Trick to Upsampling to High sampling rates using VITS model (#1456) 2022-04-26 11:47:46 +02:00
utils Add EmbeddingManager and BaseIDManager (#1374) 2022-03-31 13:41:16 +02:00
__init__.py rename the project to old TTS 2020-09-09 12:27:23 +02:00