Commit Graph

3763 Commits

Author SHA1 Message Date
Edresson 9de4539422 Update the VITS model docs 2021-12-20 11:54:10 +00:00
Edresson eeb8ac07d9 Add voice conversion fine tuning mode 2021-12-20 11:54:10 +00:00
WeberJulian 6ac31e4152 Added a notbook for d-vector multilingual VITS 2021-12-20 11:54:09 +00:00
Edresson 690b37d0ab Add support to use the speaker encoder as loss function in VITS model 2021-12-20 11:54:09 +00:00
Edresson 9b011b1cb3 Add H/ASP original checkpoint support 2021-12-20 11:54:09 +00:00
Edresson 0bdfd3cb50 Add the ValueError in the restore checkpoint exception to avoid problems with the optimizer restauration when new keys are addition 2021-12-20 11:54:09 +00:00
Edresson bbdb5c38e6 Add VITS multispeaker train unit test 2021-12-20 11:54:09 +00:00
Edresson 92f7f4f400 Active the multispeaker mode in multilingual training 2021-12-20 11:54:09 +00:00
Edresson de78556655 Fix the optimizer parameters bug in multilingual and multispeaker training 2021-12-20 11:54:09 +00:00
Edresson 9be5b75da3 Fix bug after merge 2021-12-20 11:54:09 +00:00
Edresson 76251b619a Fix d-vector multispeaker training bug 2021-12-20 11:54:09 +00:00
Edresson e68b042493 Add VITS d-vector unit test 2021-12-20 11:54:09 +00:00
Edresson 959cc8f03c Add VITS multilingual unit test 2021-12-20 11:54:09 +00:00
Edresson 7ef3ddc6ff Fix unit tests 2021-12-20 11:54:09 +00:00
Edresson 36dcd11453 Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson c53693c155 Implement vocoder Fine Tuning like SC-GlowTTS paper 2021-12-20 11:54:09 +00:00
Edresson f1f016314e Fix the bug in M-AILABS formatter 2021-12-20 11:54:09 +00:00
Edresson c334d39acc Add voice conversion support for the model VITS trained with external speaker embedding 2021-12-20 11:54:09 +00:00
Edresson e997889ba8 Fix bug in VITS multilingual inference 2021-12-20 11:54:09 +00:00
Edresson 7c0b8ec572 Fix bugs in the non-multilingual VITS inference 2021-12-20 11:54:09 +00:00
Edresson 3fbbebd74d Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson ac9416fb86 Add multilingual inference support 2021-12-20 11:54:09 +00:00
Edresson dcb2374bc9 Add multilingual training support to the VITS model 2021-12-20 11:54:09 +00:00
Edresson f996afedb0 Implement multilingual dataloader support 2021-12-20 11:54:09 +00:00
Edresson 5f1c18187f Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson d91c595c5a Implement training support with d_vecs in the VITS model 2021-12-20 11:54:09 +00:00
Edresson 6a7db67a91 Allow ignore speakers for all multispeaker datasets 2021-12-20 11:54:09 +00:00
Edresson e0ad838066 Select randomly a speaker from the speaker manager for the test setences 2021-12-20 11:54:09 +00:00
Edresson eb3e8affe1 Save speakers embeddings/ids before starting training 2021-12-20 11:54:09 +00:00
Eren Gölge 37803467aa
Merge pull request #1021 from loganhart420/dataset_downloaders
Add addtional datasets
2021-12-20 10:42:20 +01:00
Eren Gölge 6b8ed18dcc
Merge pull request #1022 from coqui-ai/papercuts
Fix some setup papercuts
2021-12-20 10:40:57 +01:00
Reuben Morais 0635e8b931 Try to speed up CI install times 2021-12-17 14:01:25 +01:00
Reuben Morais 0057494a2a Remove broken link 2021-12-17 11:37:19 +01:00
Reuben Morais 859ac1a54c Include usage instructions in README 2021-12-17 11:37:19 +01:00
Reuben Morais 78100851f2 Upload sdist and wheel to PyPI 2021-12-17 11:37:19 +01:00
Eren Gölge 223ffe2954 Add torchaudio to requirements.txt 2021-12-16 14:57:54 +00:00
Eren Gölge 95ca2ef773 Implement init_speaker_encoder and change arg names 2021-12-16 14:57:24 +00:00
Eren Gölge 1ddf245b08 Use speaker_encoder from speaker manager in Vits 2021-12-16 14:56:34 +00:00
Eren Gölge 08a1cf3dcb Change speaker_idx to speaker_name 2021-12-16 14:55:43 +00:00
Eren Gölge abedfd586d Add functions to get and check and argument in config and config.model_args 2021-12-16 14:53:57 +00:00
loganhart420 103c010eca Add addtional datasets 2021-12-16 07:21:27 -05:00
Eren Gölge 9ec6238f4a Fixup 2021-12-13 16:30:15 +00:00
Eren Gölge 6274d5e438 Fixup 2021-12-13 16:29:19 +00:00
Eren Gölge 6d7199d559 Rename setup_model to setup_speaker_encoder_model 2021-12-13 16:28:54 +00:00
Eren Gölge f73573c215 Fix synthesizer reading `use_language_embedding` 2021-12-13 16:27:36 +00:00
Eren Gölge 4b06e3e232 Use torchaudio for ResNet speaker encoder 2021-12-13 16:23:57 +00:00
Eren Gölge 84bbe02839 Add 👑YourTTS docs 2021-12-10 09:12:03 +00:00
Eren Gölge bbea9b3f9f Remove redundant code 2021-12-10 07:53:19 +00:00
Eren Gölge 66b6e9bc99 Make style 2021-12-10 07:53:10 +00:00
WeberJulian b4bb0ace70 Fix zoo tests 2021-12-09 13:43:08 +00:00