Commit Graph

3753 Commits

Author SHA1 Message Date
Edresson c382a3f6a9 Add find unique phonemes unit tests 2021-12-09 13:41:56 +00:00
Edresson be8f444636 Add the SCL resample TODO 2021-12-09 13:41:56 +00:00
WeberJulian 4a8c344fc8 Make a multilingual test use chars 2021-12-09 13:41:56 +00:00
WeberJulian eff0a5ca10 Fix merge bug 2021-12-09 13:41:56 +00:00
WeberJulian 9c1bec86a4 Fix tests 2021-12-09 13:41:28 +00:00
Edresson 79f75924de Fix pylint checks 2021-12-09 13:41:28 +00:00
WeberJulian 93dbb67c52 Remove self.audio_config from VITS 2021-12-09 13:41:28 +00:00
Edresson 0359cab4fa Remove torchaudio requeriment 2021-12-09 13:39:13 +00:00
WeberJulian 36ddf32972 Fix trailing whitespace 2021-12-09 13:38:18 +00:00
WeberJulian 9d8d4e6fb3 Update docstring 2021-12-09 13:38:18 +00:00
Edresson 65b5ed4427 Add VITS multilingual d-vectors unit test 2021-12-09 13:38:18 +00:00
Edresson 87059e3bbb Add the language embedding dim in the duration predictor class 2021-12-09 13:38:18 +00:00
Edresson 5782df8ffe Get the number speaker from the Speaker Manager property 2021-12-09 13:38:18 +00:00
Edresson 88f4369c63 Fix the bug in multispeaker vits 2021-12-09 13:38:18 +00:00
Edresson aa1a070d58 Rename ununsed_speakers to ignored_speakers 2021-12-09 13:38:18 +00:00
WeberJulian 346bd931b2 remove inference notebook 2021-12-09 13:37:50 +00:00
Edresson 1251d04387 Fix function name 2021-12-09 13:37:50 +00:00
Edresson 9781e4d516 Lint fixs 2021-12-09 13:37:50 +00:00
Edresson 8ae22725e1 Fix the bug in extract tts spectrograms 2021-12-09 13:37:50 +00:00
Edresson d0cb700e7d Set the new_fields in copy_model_files as None by default 2021-12-09 13:37:50 +00:00
Edresson 5ba416e2e8 Remove the call to get_speaker_manager 2021-12-09 13:37:50 +00:00
Edresson cad82a9296 Remove the data from the set_d_vectors_from_file function 2021-12-09 13:37:50 +00:00
Edresson ec31dacbb7 Remove unusable speaker manager function 2021-12-09 13:37:50 +00:00
Edresson 86b2536491 Turn more clear the VITS loss function 2021-12-09 13:37:50 +00:00
Edresson 5fc127bb7a Remove the unusable fine-tuning model 2021-12-09 13:37:50 +00:00
Edresson 89019d49a2 Create a module for the VAD script 2021-12-09 13:36:48 +00:00
WeberJulian 390096fe0f fix d-vector 2021-12-09 13:36:48 +00:00
WeberJulian 868cf6424f Fix small issues 2021-12-09 13:36:48 +00:00
WeberJulian e04577575e Fix use_speaker_embedding logic 2021-12-09 13:36:48 +00:00
WeberJulian 5f40e96010 Fix continue path 2021-12-09 13:36:48 +00:00
WeberJulian 61251bd86c Fix phonemes 2021-12-09 13:36:48 +00:00
WeberJulian b1df118b81 fix imports for load_meta_data 2021-12-09 13:36:48 +00:00
WeberJulian 686c7381e2 fix phonemes per language 2021-12-09 13:36:48 +00:00
WeberJulian 215a74b32e fix linter 2021-12-09 13:36:48 +00:00
WeberJulian 3e9ca4b95d make style 2021-12-09 13:36:48 +00:00
WeberJulian e1bdeacd2e Add torchaudio in requirements.txt 2021-12-09 13:36:08 +00:00
WeberJulian 88d6399e12 fix test sentence synthesis 2021-12-09 13:35:43 +00:00
WeberJulian 20ac31dc71 fix f0_cache_path in dataset 2021-12-09 13:35:12 +00:00
WeberJulian 6ed55ba57e fix test vits 2021-12-09 13:35:12 +00:00
WeberJulian 21b49c3acd fix collate_fn 2021-12-09 13:34:33 +00:00
Julian WEBER ec83ffbd7a PitchExtractor 2021-12-09 13:34:33 +00:00
Julian WEBER 3440c54bbe get_aux_input 2021-12-09 13:34:33 +00:00
Julian WEBER 5c89803968 Merge dataset 2021-12-09 13:33:35 +00:00
Edresson c80cf67d3d Add remove silence VAD script 2021-12-09 13:33:05 +00:00
Edresson 3ac428340d Add audio resample in the speaker consistency loss 2021-12-09 13:32:25 +00:00
Edresson 39aff6685e Add freeze vocoder generator and flow-based decoder option 2021-12-09 13:31:04 +00:00
WeberJulian de41165af4 freeze vits parts 2021-12-09 13:31:04 +00:00
WeberJulian 9d2c445e3d get_speaker_weighted_sampler 2021-12-09 13:31:04 +00:00
Edresson 56480360cf Update the VITS model docs 2021-12-09 13:29:58 +00:00
Edresson cd7639ca70 Add voice conversion fine tuning mode 2021-12-09 13:29:58 +00:00