Commit Graph

41 Commits

Author SHA1 Message Date
WeberJulian de41165af4 freeze vits parts 2021-12-09 13:31:04 +00:00
Edresson 56480360cf Update the VITS model docs 2021-12-09 13:29:58 +00:00
Edresson cd7639ca70 Add voice conversion fine tuning mode 2021-12-09 13:29:58 +00:00
Edresson 3cd889a9d4 Add support to use the speaker encoder as loss function in VITS model 2021-12-09 13:29:58 +00:00
Edresson 256197b6aa Fix the optimizer parameters bug in multilingual and multispeaker training 2021-12-09 13:27:21 +00:00
Edresson f4abb19515 Fix bug after merge 2021-12-09 13:26:33 +00:00
Edresson 82611cfcd3 Fix unit tests 2021-12-09 13:18:36 +00:00
Edresson cfa9910f9d Fix pylint issues 2021-12-09 13:16:32 +00:00
Edresson 9071bf326f Implement vocoder Fine Tuning like SC-GlowTTS paper 2021-12-09 13:16:32 +00:00
Edresson d653227e59 Add voice conversion support for the model VITS trained with external speaker embedding 2021-12-09 13:11:06 +00:00
Edresson 56b548835d Fix bug in VITS multilingual inference 2021-12-09 13:11:06 +00:00
Edresson 240356cd53 Fix bugs in the non-multilingual VITS inference 2021-12-09 13:11:06 +00:00
Edresson 32ece5d5ad Fix pylint issues 2021-12-09 13:11:06 +00:00
Edresson 8e83a212fa Add multilingual inference support 2021-12-09 13:10:09 +00:00
Edresson d0e3647db6 Add multilingual training support to the VITS model 2021-12-09 13:07:00 +00:00
Edresson c9f5838bb4 Fix pylint issues 2021-12-09 12:38:58 +00:00
Edresson 1efcccd5c9 Implement training support with d_vecs in the VITS model 2021-12-09 12:37:37 +00:00
Edresson 234a4aacb3 Select randomly a speaker from the speaker manager for the test setences 2021-12-09 12:32:14 +00:00
Edresson 8310d19da8 Save speakers embeddings/ids before starting training 2021-12-09 12:23:02 +00:00
Eren Gölge 2df0752e73
Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge 00becf2671 Fix import statements 2021-10-25 19:29:16 +02:00
Eren Gölge 82fed4add2 Make style 2021-10-21 16:05:51 +00:00
Eren Gölge 3da79a4de4 Comment Tacotron2 model 2021-10-20 18:14:04 +00:00
Eren Gölge c514351c0e Refactor multi-speaker init in BaseTTS-Tacotron1-2 2021-10-18 08:55:45 +00:00
Eren Gölge fcbfc53cb7 Fix linter 2021-10-15 10:24:19 +00:00
Eren Gölge 073a2d2eb0 Refactor VITS multi-speaker initialization 2021-10-15 10:20:00 +00:00
Eren Gölge 0565457faa Fix #846 2021-10-14 14:46:14 +00:00
Eren Gölge 37959ad0c7 Make linter 2021-09-30 23:02:16 +00:00
Eren Gölge 45889804c2 Update VITS 2021-09-30 14:47:56 +00:00
Eren Gölge 3c16013199 Fix Vits imports 2021-09-10 08:26:34 +00:00
Eren Gölge bfc6ceac29 Move MAS to `TTS.tts.utils.helpers` 2021-09-09 10:57:19 +00:00
Eren Gölge 4761853c5c Fix imports 2021-09-08 13:34:40 +00:00
Eren Gölge c1513ec4cd Plot pitch over spectrogram 2021-09-06 15:16:58 +00:00
Eren Gölge 2b7e55f01f Fix vits args types 2021-08-30 23:24:20 +00:00
Eren Gölge 18da8f5dbd Update pylint 2.10.2 and fix lint issues 2021-08-30 08:10:35 +00:00
Eren Gölge 2620f62ea8 Move duration_loss inside VitsGeneratorLoss 2021-08-27 07:07:07 +00:00
Eren Gölge 49e1181ea4 Fixes for the vits model 2021-08-26 17:15:09 +00:00
Eren Gölge 3ab8cef99e Fix VITS model SPD 2021-08-18 14:55:46 +00:00
Eren Gölge 06018251e6 Add VITS and GlowTTS class docs 🗒️ 2021-08-09 18:02:36 +00:00
Eren Gölge f7a72552f1 Make duration predictor dropout configurable 2021-08-09 18:02:36 +00:00
Eren Gölge c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00