Commit Graph

596 Commits

Author SHA1 Message Date
Eren Gölge 36cef5966b Fix resnet speaker encoder 2021-12-30 15:36:35 +00:00
Eren Gölge 348b5c96a2 Fix speaker encoder test 2021-12-30 15:36:35 +00:00
Eren Gölge 7129b04d46 Update VITS model 2021-12-30 14:08:17 +00:00
Eren Gölge 5c5ddd2ba7 Init speaker manager for speaker encoder 2021-12-22 15:51:53 +00:00
Eren Gölge a25269d897 Remove commented code 2021-12-20 11:54:10 +00:00
Eren Gölge d29c3780d1 Use speaker_encoder from speaker manager in Vits 2021-12-20 11:54:10 +00:00
Eren Gölge 79de38ca76 Rename setup_model to setup_speaker_encoder_model 2021-12-20 11:54:10 +00:00
Eren Gölge 649dc9e9da Remove redundant code 2021-12-20 11:54:10 +00:00
Eren Gölge 704dddcffa Make style 2021-12-20 11:54:10 +00:00
WeberJulian a564eb9f54 Add support for multi-lingual models in CLI 2021-12-20 11:54:10 +00:00
WeberJulian 2bbcb558dc Prevent weighted sampler use when num_gpus > 1 2021-12-20 11:54:10 +00:00
WeberJulian 74cedfac38 Revert init multispeaker change 2021-12-20 11:54:10 +00:00
WeberJulian 9cfbacc622 Fix trailing space 2021-12-20 11:54:10 +00:00
WeberJulian 6b03943526 Move multilingual logic out of the trainer 2021-12-20 11:54:10 +00:00
Edresson 67dda0abe1 Add the SCL resample TODO 2021-12-20 11:54:10 +00:00
WeberJulian 8b52fb89d1 Fix merge bug 2021-12-20 11:54:10 +00:00
WeberJulian 09eda31a3f Fix tests 2021-12-20 11:54:10 +00:00
Edresson 78a23e19df Fix pylint checks 2021-12-20 11:54:10 +00:00
WeberJulian 4cd0e4eb0d Remove self.audio_config from VITS 2021-12-20 11:54:10 +00:00
Edresson d39200e69b Remove torchaudio requeriment 2021-12-20 11:54:10 +00:00
WeberJulian 2e516869a1 Fix trailing whitespace 2021-12-20 11:54:10 +00:00
WeberJulian ffc269eaf4 Update docstring 2021-12-20 11:54:10 +00:00
Edresson 12968532fe Add the language embedding dim in the duration predictor class 2021-12-20 11:54:10 +00:00
Edresson 90eac13bb2 Rename ununsed_speakers to ignored_speakers 2021-12-20 11:54:10 +00:00
Edresson f34596d957 Fix function name 2021-12-20 11:54:10 +00:00
Edresson 45d0b04179 Lint fixs 2021-12-20 11:54:10 +00:00
Edresson b769b49e34 Remove the data from the set_d_vectors_from_file function 2021-12-20 11:54:10 +00:00
Edresson 9daa33d1fd Remove unusable speaker manager function 2021-12-20 11:54:10 +00:00
Edresson 8c22d5ac49 Turn more clear the VITS loss function 2021-12-20 11:54:10 +00:00
Edresson 6fc3b9e679 Remove the unusable fine-tuning model 2021-12-20 11:54:10 +00:00
WeberJulian 631addf33b fix d-vector 2021-12-20 11:54:10 +00:00
WeberJulian da6c1e858c Fix small issues 2021-12-20 11:54:10 +00:00
WeberJulian e8af6a9f08 Fix use_speaker_embedding logic 2021-12-20 11:54:10 +00:00
WeberJulian 120332d53f Fix phonemes 2021-12-20 11:54:10 +00:00
WeberJulian 1340938159 fix phonemes per language 2021-12-20 11:54:10 +00:00
WeberJulian e995a63bd6 fix linter 2021-12-20 11:54:10 +00:00
WeberJulian 1472b6df49 make style 2021-12-20 11:54:10 +00:00
WeberJulian 4d721bcabd fix test sentence synthesis 2021-12-20 11:54:10 +00:00
WeberJulian 0804806727 fix f0_cache_path in dataset 2021-12-20 11:54:10 +00:00
WeberJulian 3b5592abcf fix test vits 2021-12-20 11:54:10 +00:00
WeberJulian 2a2b5767c2 fix collate_fn 2021-12-20 11:54:10 +00:00
Julian WEBER 78c2d12a91 PitchExtractor 2021-12-20 11:54:10 +00:00
Julian WEBER 9a2f91327c get_aux_input 2021-12-20 11:54:10 +00:00
Julian WEBER b3abd01793 Merge dataset 2021-12-20 11:54:10 +00:00
Edresson 1bd1a0546b Add audio resample in the speaker consistency loss 2021-12-20 11:54:10 +00:00
Edresson 1c6bcda950 Add freeze vocoder generator and flow-based decoder option 2021-12-20 11:54:10 +00:00
WeberJulian 2b952d8b97 freeze vits parts 2021-12-20 11:54:10 +00:00
WeberJulian 005bba60b0 get_speaker_weighted_sampler 2021-12-20 11:54:10 +00:00
Edresson 9de4539422 Update the VITS model docs 2021-12-20 11:54:10 +00:00
Edresson eeb8ac07d9 Add voice conversion fine tuning mode 2021-12-20 11:54:10 +00:00