coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	569decba64	Add Pitch Predictor conditioned on enc	2022-06-17 17:39:17 -03:00
Edresson Casanova	92e7391a5d	Add speaker embedding on prosody encoder	2022-06-16 19:06:48 +00:00
Edresson Casanova	856e185641	Add Resnet prosody encoder support	2022-06-13 13:47:22 +00:00
Edresson Casanova	4b59f07946	Support the use of speaker embedding as emotion embedding	2022-06-08 09:52:39 -03:00
Edresson Casanova	ae55bdae6c	Fix Lint checks	2022-06-08 09:52:38 -03:00
Edresson Casanova	fd1036f4ba	Add Noise scale predictor	2022-06-08 09:52:38 -03:00
Edresson Casanova	e07fcc7a8c	Add text encoder adversarial loss on the VITS	2022-06-08 09:52:38 -03:00
Edresson Casanova	2cac18c7b7	Add VAE prosody encoder	2022-06-08 09:52:37 -03:00
Edresson Casanova	44ec2ab387	Add prosody encoder training support	2022-06-08 09:52:36 -03:00
Edresson Casanova	6126e5e588	Add emotion embedding in the encoder	2022-06-08 09:52:36 -03:00
Edresson Casanova	8d228ab22a	Trick to Upsampling to High sampling rates using VITS model (#1456 ) * Add upsample VITS support * Fix the bug in inference * Fix lint checks * Add RMS based norm in save_wav method * Style fix * Add the period for VITS multi-period discriminator in model_args * Bug fix in speaker encoder load in inference time * Add unit tests * Remove useless detach_z_vocoder parameter * Add docs for VITS upsampling * Fix the docs * Rename TTS_part_sample_rate to encoder_sample_rate * Add upsampling_init and upsampling_z methods * Add asserts for encoder_sample_rate part * Move upsampling tests to test_vits.py	2022-04-26 11:47:46 +02:00
Eren Gölge	146fbfd7c9	Extend unittests	2022-02-25 11:25:00 +01:00
Eren Gölge	127118c637	Update TTS.tts formatters (#1228 ) * Return Dict from tts formatters * Make style	2022-02-11 23:03:43 +01:00
Edresson	12968532fe	Add the language embedding dim in the duration predictor class	2021-12-20 11:54:10 +00:00
WeberJulian	1472b6df49	make style	2021-12-20 11:54:10 +00:00
Edresson	dcb2374bc9	Add multilingual training support to the VITS model	2021-12-20 11:54:09 +00:00
Eren Gölge	b6b14a76af	Fix VITS stochastic duration predictor	2021-11-08 09:20:11 +01:00
Eren Gölge	4761853c5c	Fix imports	2021-09-08 13:34:40 +00:00
Eren Gölge	49e1181ea4	Fixes for the vits model	2021-08-26 17:15:09 +00:00
Eren Gölge	3ab8cef99e	Fix VITS model SPD	2021-08-18 14:55:46 +00:00
Eren Gölge	c312acac7d	Implement VITS model 🚀 VITS model implementation built on Glow TTS and HiFiGAN layers.	2021-08-09 18:02:36 +00:00

21 Commits