WeberJulian
868cf6424f
Fix small issues
2021-12-09 13:36:48 +00:00
WeberJulian
e04577575e
Fix use_speaker_embedding logic
2021-12-09 13:36:48 +00:00
WeberJulian
5f40e96010
Fix continue path
2021-12-09 13:36:48 +00:00
WeberJulian
61251bd86c
Fix phonemes
2021-12-09 13:36:48 +00:00
WeberJulian
b1df118b81
fix imports for load_meta_data
2021-12-09 13:36:48 +00:00
WeberJulian
686c7381e2
fix phonemes per language
2021-12-09 13:36:48 +00:00
WeberJulian
215a74b32e
fix linter
2021-12-09 13:36:48 +00:00
WeberJulian
3e9ca4b95d
make style
2021-12-09 13:36:48 +00:00
WeberJulian
88d6399e12
fix test sentence synthesis
2021-12-09 13:35:43 +00:00
WeberJulian
20ac31dc71
fix f0_cache_path in dataset
2021-12-09 13:35:12 +00:00
WeberJulian
6ed55ba57e
fix test vits
2021-12-09 13:35:12 +00:00
WeberJulian
21b49c3acd
fix collate_fn
2021-12-09 13:34:33 +00:00
Julian WEBER
ec83ffbd7a
PitchExtractor
2021-12-09 13:34:33 +00:00
Julian WEBER
3440c54bbe
get_aux_input
2021-12-09 13:34:33 +00:00
Julian WEBER
5c89803968
Merge dataset
2021-12-09 13:33:35 +00:00
Edresson
c80cf67d3d
Add remove silence VAD script
2021-12-09 13:33:05 +00:00
Edresson
3ac428340d
Add audio resample in the speaker consistency loss
2021-12-09 13:32:25 +00:00
Edresson
39aff6685e
Add freeze vocoder generator and flow-based decoder option
2021-12-09 13:31:04 +00:00
WeberJulian
de41165af4
freeze vits parts
2021-12-09 13:31:04 +00:00
WeberJulian
9d2c445e3d
get_speaker_weighted_sampler
2021-12-09 13:31:04 +00:00
Edresson
56480360cf
Update the VITS model docs
2021-12-09 13:29:58 +00:00
Edresson
cd7639ca70
Add voice conversion fine tuning mode
2021-12-09 13:29:58 +00:00
Edresson
3cd889a9d4
Add support to use the speaker encoder as loss function in VITS model
2021-12-09 13:29:58 +00:00
Edresson
a3901032f4
Add H/ASP original checkpoint support
2021-12-09 13:28:16 +00:00
Edresson
fee01daa09
Add the ValueError in the restore checkpoint exception to avoid problems with the optimizer restauration when new keys are addition
2021-12-09 13:27:21 +00:00
Edresson
256197b6aa
Fix the optimizer parameters bug in multilingual and multispeaker training
2021-12-09 13:27:21 +00:00
Edresson
f4abb19515
Fix bug after merge
2021-12-09 13:26:33 +00:00
Edresson
d7042ecfd8
Fix d-vector multispeaker training bug
2021-12-09 13:26:33 +00:00
Edresson
82611cfcd3
Fix unit tests
2021-12-09 13:18:36 +00:00
Edresson
cfa9910f9d
Fix pylint issues
2021-12-09 13:16:32 +00:00
Edresson
9071bf326f
Implement vocoder Fine Tuning like SC-GlowTTS paper
2021-12-09 13:16:32 +00:00
Edresson
3df5d9a619
Fix the bug in M-AILABS formatter
2021-12-09 13:11:06 +00:00
Edresson
d653227e59
Add voice conversion support for the model VITS trained with external speaker embedding
2021-12-09 13:11:06 +00:00
Edresson
56b548835d
Fix bug in VITS multilingual inference
2021-12-09 13:11:06 +00:00
Edresson
240356cd53
Fix bugs in the non-multilingual VITS inference
2021-12-09 13:11:06 +00:00
Edresson
32ece5d5ad
Fix pylint issues
2021-12-09 13:11:06 +00:00
Edresson
8e83a212fa
Add multilingual inference support
2021-12-09 13:10:09 +00:00
Edresson
d0e3647db6
Add multilingual training support to the VITS model
2021-12-09 13:07:00 +00:00
Edresson
829ee55b04
Implement multilingual dataloader support
2021-12-09 12:50:03 +00:00
Edresson
c9f5838bb4
Fix pylint issues
2021-12-09 12:38:58 +00:00
Edresson
1efcccd5c9
Implement training support with d_vecs in the VITS model
2021-12-09 12:37:37 +00:00
Edresson
c9c1960040
Allow ignore speakers for all multispeaker datasets
2021-12-09 12:35:12 +00:00
Edresson
234a4aacb3
Select randomly a speaker from the speaker manager for the test setences
2021-12-09 12:32:14 +00:00
Edresson
8310d19da8
Save speakers embeddings/ids before starting training
2021-12-09 12:23:02 +00:00
Jörg Thalheim
bce143c738
server: fix compatibility with tts_models/en/ljspeech/fast_pitch ( #893 )
2021-12-07 14:36:29 +01:00
Eren Gölge
babdd84f91
Fix GST inference
...
commit d3e477875a7e46a101fcf95a1794442823750fe2
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit 074e6d0874d3b34fb6a4991fc17d66dccd413fbb
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit fdece14585ab5a36eed1061a9a838d8e48aa6882
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit cd29e21b8d0a541ee298d2bf5f67223ad60be38f
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit 908ce39370eadcc9fa8510cdb26c9ead87305427
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:49:37 2021 +0100
Make trim_db value negative
commit 1008a2e0f72fa7ca7f0307424f570386f2f16d42
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:22:24 2021 +0100
Set find_endpoint db threshold in config.json
2021-12-07 13:28:49 +00:00
Eren Gölge
ce45d9e1af
Make style and lint
2021-12-01 10:42:52 +00:00
Eren Gölge
40cb8ac966
Fix #958
2021-12-01 10:33:34 +00:00
Eren Gölge
512ada7548
Fix callbacks against multi-gpu training
2021-12-01 10:32:14 +00:00
Eren Gölge
2ed9e3c241
Fix constant use of noise augment
2021-11-08 09:20:34 +01:00