Commit Graph

201 Commits

Author SHA1 Message Date
WeberJulian 868cf6424f Fix small issues 2021-12-09 13:36:48 +00:00
WeberJulian e04577575e Fix use_speaker_embedding logic 2021-12-09 13:36:48 +00:00
WeberJulian 61251bd86c Fix phonemes 2021-12-09 13:36:48 +00:00
WeberJulian 215a74b32e fix linter 2021-12-09 13:36:48 +00:00
WeberJulian 3e9ca4b95d make style 2021-12-09 13:36:48 +00:00
WeberJulian 6ed55ba57e fix test vits 2021-12-09 13:35:12 +00:00
Julian WEBER 3440c54bbe get_aux_input 2021-12-09 13:34:33 +00:00
Edresson 3ac428340d Add audio resample in the speaker consistency loss 2021-12-09 13:32:25 +00:00
Edresson 39aff6685e Add freeze vocoder generator and flow-based decoder option 2021-12-09 13:31:04 +00:00
WeberJulian de41165af4 freeze vits parts 2021-12-09 13:31:04 +00:00
WeberJulian 9d2c445e3d get_speaker_weighted_sampler 2021-12-09 13:31:04 +00:00
Edresson 56480360cf Update the VITS model docs 2021-12-09 13:29:58 +00:00
Edresson cd7639ca70 Add voice conversion fine tuning mode 2021-12-09 13:29:58 +00:00
Edresson 3cd889a9d4 Add support to use the speaker encoder as loss function in VITS model 2021-12-09 13:29:58 +00:00
Edresson 256197b6aa Fix the optimizer parameters bug in multilingual and multispeaker training 2021-12-09 13:27:21 +00:00
Edresson f4abb19515 Fix bug after merge 2021-12-09 13:26:33 +00:00
Edresson d7042ecfd8 Fix d-vector multispeaker training bug 2021-12-09 13:26:33 +00:00
Edresson 82611cfcd3 Fix unit tests 2021-12-09 13:18:36 +00:00
Edresson cfa9910f9d Fix pylint issues 2021-12-09 13:16:32 +00:00
Edresson 9071bf326f Implement vocoder Fine Tuning like SC-GlowTTS paper 2021-12-09 13:16:32 +00:00
Edresson d653227e59 Add voice conversion support for the model VITS trained with external speaker embedding 2021-12-09 13:11:06 +00:00
Edresson 56b548835d Fix bug in VITS multilingual inference 2021-12-09 13:11:06 +00:00
Edresson 240356cd53 Fix bugs in the non-multilingual VITS inference 2021-12-09 13:11:06 +00:00
Edresson 32ece5d5ad Fix pylint issues 2021-12-09 13:11:06 +00:00
Edresson 8e83a212fa Add multilingual inference support 2021-12-09 13:10:09 +00:00
Edresson d0e3647db6 Add multilingual training support to the VITS model 2021-12-09 13:07:00 +00:00
Edresson 829ee55b04 Implement multilingual dataloader support 2021-12-09 12:50:03 +00:00
Edresson c9f5838bb4 Fix pylint issues 2021-12-09 12:38:58 +00:00
Edresson 1efcccd5c9 Implement training support with d_vecs in the VITS model 2021-12-09 12:37:37 +00:00
Edresson 234a4aacb3 Select randomly a speaker from the speaker manager for the test setences 2021-12-09 12:32:14 +00:00
Edresson 8310d19da8 Save speakers embeddings/ids before starting training 2021-12-09 12:23:02 +00:00
Eren Gölge 2ed9e3c241 Fix constant use of noise augment 2021-11-08 09:20:34 +01:00
Eren Gölge 2df0752e73
Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge 00becf2671 Fix import statements 2021-10-25 19:29:16 +02:00
Eren Gölge 2b7d159383 Update BaseTTS for multi-speaker training 2021-10-21 16:29:06 +00:00
Eren Gölge 82fed4add2 Make style 2021-10-21 16:05:51 +00:00
Eren Gölge cea8e1739b Update AlignTTS to use SpeakerManager 2021-10-20 18:22:41 +00:00
Eren Gölge 0e768dd4c5 Update comments 2021-10-20 18:21:26 +00:00
Eren Gölge 7c2cb7cc30 Update BaseTTS 2021-10-20 18:18:22 +00:00
Eren Gölge 330ee7d208 Comment BaseTacotron and remove unused funcs 2021-10-20 18:17:25 +00:00
Eren Gölge aa25f70b95 Update ForwardTTS for multi-speaker 2021-10-20 18:16:41 +00:00
Eren Gölge 0ebc2a400e Implement `_set_speaker_embedding` in GlowTTS 2021-10-20 18:15:20 +00:00
Eren Gölge 3da79a4de4 Comment Tacotron2 model 2021-10-20 18:14:04 +00:00
Eren Gölge c514351c0e Refactor multi-speaker init in BaseTTS-Tacotron1-2 2021-10-18 08:55:45 +00:00
Eren Gölge 127571423c Update multi-speaker init in BaseTTS 2021-10-18 08:54:41 +00:00
Eren Gölge a0a5d580e9 Approximate audio length from file size 2021-10-18 08:54:02 +00:00
Eren Gölge fcbfc53cb7 Fix linter 2021-10-15 10:24:19 +00:00
Eren Gölge 073a2d2eb0 Refactor VITS multi-speaker initialization 2021-10-15 10:20:00 +00:00
Eren Gölge 0565457faa Fix #846 2021-10-14 14:46:14 +00:00
Eren Gölge 4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00