coqui-tts/TTS/tts/layers/vits
Edresson Casanova 8d228ab22a
Trick to Upsampling to High sampling rates using VITS model (#1456)
* Add upsample VITS support

* Fix the bug in inference

* Fix lint checks

* Add RMS based norm in save_wav method

* Style fix

* Add the period for VITS multi-period discriminator in model_args

* Bug fix in speaker encoder load in inference time

* Add unit tests

* Remove useless detach_z_vocoder parameter

* Add docs for VITS upsampling

* Fix the docs

* Rename TTS_part_sample_rate to encoder_sample_rate

* Add upsampling_init and upsampling_z methods

* Add asserts for encoder_sample_rate part

* Move upsampling tests to test_vits.py
2022-04-26 11:47:46 +02:00
..
discriminator.py Trick to Upsampling to High sampling rates using VITS model (#1456) 2022-04-26 11:47:46 +02:00
networks.py Extend unittests 2022-02-25 11:25:00 +01:00
stochastic_duration_predictor.py Update TTS.tts formatters (#1228) 2022-02-11 23:03:43 +01:00
transforms.py Implement VITS model 🚀 2021-08-09 18:02:36 +00:00