Edresson Casanova
569decba64
Add Pitch Predictor conditioned on enc
2022-06-17 17:39:17 -03:00
Edresson Casanova
92e7391a5d
Add speaker embedding on prosody encoder
2022-06-16 19:06:48 +00:00
Edresson Casanova
856e185641
Add Resnet prosody encoder support
2022-06-13 13:47:22 +00:00
Edresson Casanova
4b59f07946
Support the use of speaker embedding as emotion embedding
2022-06-08 09:52:39 -03:00
Edresson Casanova
ae55bdae6c
Fix Lint checks
2022-06-08 09:52:38 -03:00
Edresson Casanova
fd1036f4ba
Add Noise scale predictor
2022-06-08 09:52:38 -03:00
Edresson Casanova
e07fcc7a8c
Add text encoder adversarial loss on the VITS
2022-06-08 09:52:38 -03:00
Edresson Casanova
2cac18c7b7
Add VAE prosody encoder
2022-06-08 09:52:37 -03:00
Edresson Casanova
44ec2ab387
Add prosody encoder training support
2022-06-08 09:52:36 -03:00
Edresson Casanova
6126e5e588
Add emotion embedding in the encoder
2022-06-08 09:52:36 -03:00
Edresson Casanova
8d228ab22a
Trick to Upsampling to High sampling rates using VITS model ( #1456 )
...
* Add upsample VITS support
* Fix the bug in inference
* Fix lint checks
* Add RMS based norm in save_wav method
* Style fix
* Add the period for VITS multi-period discriminator in model_args
* Bug fix in speaker encoder load in inference time
* Add unit tests
* Remove useless detach_z_vocoder parameter
* Add docs for VITS upsampling
* Fix the docs
* Rename TTS_part_sample_rate to encoder_sample_rate
* Add upsampling_init and upsampling_z methods
* Add asserts for encoder_sample_rate part
* Move upsampling tests to test_vits.py
2022-04-26 11:47:46 +02:00
Eren Gölge
146fbfd7c9
Extend unittests
2022-02-25 11:25:00 +01:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
Edresson
12968532fe
Add the language embedding dim in the duration predictor class
2021-12-20 11:54:10 +00:00
WeberJulian
1472b6df49
make style
2021-12-20 11:54:10 +00:00
Edresson
dcb2374bc9
Add multilingual training support to the VITS model
2021-12-20 11:54:09 +00:00
Eren Gölge
b6b14a76af
Fix VITS stochastic duration predictor
2021-11-08 09:20:11 +01:00
Eren Gölge
4761853c5c
Fix imports
2021-09-08 13:34:40 +00:00
Eren Gölge
49e1181ea4
Fixes for the vits model
2021-08-26 17:15:09 +00:00
Eren Gölge
3ab8cef99e
Fix VITS model SPD
2021-08-18 14:55:46 +00:00
Eren Gölge
c312acac7d
Implement VITS model 🚀
...
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00