Commit Graph

3750 Commits

Author SHA1 Message Date
Edresson 7ef3ddc6ff Fix unit tests 2021-12-20 11:54:09 +00:00
Edresson 36dcd11453 Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson c53693c155 Implement vocoder Fine Tuning like SC-GlowTTS paper 2021-12-20 11:54:09 +00:00
Edresson f1f016314e Fix the bug in M-AILABS formatter 2021-12-20 11:54:09 +00:00
Edresson c334d39acc Add voice conversion support for the model VITS trained with external speaker embedding 2021-12-20 11:54:09 +00:00
Edresson e997889ba8 Fix bug in VITS multilingual inference 2021-12-20 11:54:09 +00:00
Edresson 7c0b8ec572 Fix bugs in the non-multilingual VITS inference 2021-12-20 11:54:09 +00:00
Edresson 3fbbebd74d Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson ac9416fb86 Add multilingual inference support 2021-12-20 11:54:09 +00:00
Edresson dcb2374bc9 Add multilingual training support to the VITS model 2021-12-20 11:54:09 +00:00
Edresson f996afedb0 Implement multilingual dataloader support 2021-12-20 11:54:09 +00:00
Edresson 5f1c18187f Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson d91c595c5a Implement training support with d_vecs in the VITS model 2021-12-20 11:54:09 +00:00
Edresson 6a7db67a91 Allow ignore speakers for all multispeaker datasets 2021-12-20 11:54:09 +00:00
Edresson e0ad838066 Select randomly a speaker from the speaker manager for the test setences 2021-12-20 11:54:09 +00:00
Edresson eb3e8affe1 Save speakers embeddings/ids before starting training 2021-12-20 11:54:09 +00:00
Eren Gölge 37803467aa
Merge pull request #1021 from loganhart420/dataset_downloaders
Add addtional datasets
2021-12-20 10:42:20 +01:00
Eren Gölge 6b8ed18dcc
Merge pull request #1022 from coqui-ai/papercuts
Fix some setup papercuts
2021-12-20 10:40:57 +01:00
Reuben Morais 0635e8b931 Try to speed up CI install times 2021-12-17 14:01:25 +01:00
Reuben Morais 0057494a2a Remove broken link 2021-12-17 11:37:19 +01:00
Reuben Morais 859ac1a54c Include usage instructions in README 2021-12-17 11:37:19 +01:00
Reuben Morais 78100851f2 Upload sdist and wheel to PyPI 2021-12-17 11:37:19 +01:00
Eren Gölge 223ffe2954 Add torchaudio to requirements.txt 2021-12-16 14:57:54 +00:00
Eren Gölge 95ca2ef773 Implement init_speaker_encoder and change arg names 2021-12-16 14:57:24 +00:00
Eren Gölge 1ddf245b08 Use speaker_encoder from speaker manager in Vits 2021-12-16 14:56:34 +00:00
Eren Gölge 08a1cf3dcb Change speaker_idx to speaker_name 2021-12-16 14:55:43 +00:00
Eren Gölge abedfd586d Add functions to get and check and argument in config and config.model_args 2021-12-16 14:53:57 +00:00
loganhart420 103c010eca Add addtional datasets 2021-12-16 07:21:27 -05:00
Eren Gölge 9ec6238f4a Fixup 2021-12-13 16:30:15 +00:00
Eren Gölge 6274d5e438 Fixup 2021-12-13 16:29:19 +00:00
Eren Gölge 6d7199d559 Rename setup_model to setup_speaker_encoder_model 2021-12-13 16:28:54 +00:00
Eren Gölge f73573c215 Fix synthesizer reading `use_language_embedding` 2021-12-13 16:27:36 +00:00
Eren Gölge 4b06e3e232 Use torchaudio for ResNet speaker encoder 2021-12-13 16:23:57 +00:00
Eren Gölge 84bbe02839 Add 👑YourTTS docs 2021-12-10 09:12:03 +00:00
Eren Gölge bbea9b3f9f Remove redundant code 2021-12-10 07:53:19 +00:00
Eren Gölge 66b6e9bc99 Make style 2021-12-10 07:53:10 +00:00
WeberJulian b4bb0ace70 Fix zoo tests 2021-12-09 13:43:08 +00:00
WeberJulian 846b4a14c6 Add recipe for multi-lingual VITS 2021-12-09 13:43:08 +00:00
WeberJulian 4706583452 Add support for multi-lingual models in CLI 2021-12-09 13:43:08 +00:00
WeberJulian 7b81c16434 Fix seed in test_samplers to avoid random fails 2021-12-09 13:42:42 +00:00
WeberJulian 22c7be5f44 Add test for language_weighted_sampler 2021-12-09 13:42:42 +00:00
WeberJulian 3f3505c1ca Prevent weighted sampler use when num_gpus > 1 2021-12-09 13:42:42 +00:00
WeberJulian 0f64d45e04 Revert init multispeaker change 2021-12-09 13:42:42 +00:00
WeberJulian 4001322e50 Fix trailing space 2021-12-09 13:42:42 +00:00
WeberJulian 352b4be104 Move multilingual logic out of the trainer 2021-12-09 13:42:42 +00:00
Edresson b909a3b63e Add Docstring for TorchSTFT 2021-12-09 13:41:56 +00:00
Edresson d79531c177 Add remove silence vad script Unit test 2021-12-09 13:41:56 +00:00
Edresson c382a3f6a9 Add find unique phonemes unit tests 2021-12-09 13:41:56 +00:00
Edresson be8f444636 Add the SCL resample TODO 2021-12-09 13:41:56 +00:00
WeberJulian 4a8c344fc8 Make a multilingual test use chars 2021-12-09 13:41:56 +00:00