Edresson
3cd889a9d4
Add support to use the speaker encoder as loss function in VITS model
2021-12-09 13:29:58 +00:00
Edresson
a3901032f4
Add H/ASP original checkpoint support
2021-12-09 13:28:16 +00:00
Edresson
256197b6aa
Fix the optimizer parameters bug in multilingual and multispeaker training
2021-12-09 13:27:21 +00:00
Edresson
f4abb19515
Fix bug after merge
2021-12-09 13:26:33 +00:00
Edresson
d7042ecfd8
Fix d-vector multispeaker training bug
2021-12-09 13:26:33 +00:00
Edresson
82611cfcd3
Fix unit tests
2021-12-09 13:18:36 +00:00
Edresson
cfa9910f9d
Fix pylint issues
2021-12-09 13:16:32 +00:00
Edresson
9071bf326f
Implement vocoder Fine Tuning like SC-GlowTTS paper
2021-12-09 13:16:32 +00:00
Edresson
3df5d9a619
Fix the bug in M-AILABS formatter
2021-12-09 13:11:06 +00:00
Edresson
d653227e59
Add voice conversion support for the model VITS trained with external speaker embedding
2021-12-09 13:11:06 +00:00
Edresson
56b548835d
Fix bug in VITS multilingual inference
2021-12-09 13:11:06 +00:00
Edresson
240356cd53
Fix bugs in the non-multilingual VITS inference
2021-12-09 13:11:06 +00:00
Edresson
32ece5d5ad
Fix pylint issues
2021-12-09 13:11:06 +00:00
Edresson
8e83a212fa
Add multilingual inference support
2021-12-09 13:10:09 +00:00
Edresson
d0e3647db6
Add multilingual training support to the VITS model
2021-12-09 13:07:00 +00:00
Edresson
829ee55b04
Implement multilingual dataloader support
2021-12-09 12:50:03 +00:00
Edresson
c9f5838bb4
Fix pylint issues
2021-12-09 12:38:58 +00:00
Edresson
1efcccd5c9
Implement training support with d_vecs in the VITS model
2021-12-09 12:37:37 +00:00
Edresson
c9c1960040
Allow ignore speakers for all multispeaker datasets
2021-12-09 12:35:12 +00:00
Edresson
234a4aacb3
Select randomly a speaker from the speaker manager for the test setences
2021-12-09 12:32:14 +00:00
Edresson
8310d19da8
Save speakers embeddings/ids before starting training
2021-12-09 12:23:02 +00:00
Eren Gölge
babdd84f91
Fix GST inference
...
commit d3e477875a7e46a101fcf95a1794442823750fe2
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit 074e6d0874d3b34fb6a4991fc17d66dccd413fbb
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit fdece14585ab5a36eed1061a9a838d8e48aa6882
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit cd29e21b8d0a541ee298d2bf5f67223ad60be38f
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit 908ce39370eadcc9fa8510cdb26c9ead87305427
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:49:37 2021 +0100
Make trim_db value negative
commit 1008a2e0f72fa7ca7f0307424f570386f2f16d42
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:22:24 2021 +0100
Set find_endpoint db threshold in config.json
2021-12-07 13:28:49 +00:00
Eren Gölge
2ed9e3c241
Fix constant use of noise augment
2021-11-08 09:20:34 +01:00
Eren Gölge
b6b14a76af
Fix VITS stochastic duration predictor
2021-11-08 09:20:11 +01:00
Eren Gölge
faafea4cf2
Fix style
2021-11-04 17:04:40 +01:00
Eren Gölge
c5077c6c3f
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-11-01 16:42:27 +01:00
Eren Gölge
20cebde1c9
Add docstring to MAI labs formatter
2021-11-01 16:41:55 +01:00
Eren Gölge
608f437545
Add a function to find unique chars
2021-11-01 16:41:33 +01:00
Eren Gölge
d6d780e758
Fix FastSpeech config
2021-11-01 16:41:15 +01:00
Michael Hansen
3bc043faeb
Upgrade to gruut 2.0 ( #882 )
2021-10-31 11:41:55 +01:00
Eren Gölge
2df0752e73
Model zoo tests ( #900 )
...
* Fix VITS model multi-speaker init
* Remove gdrive support in model manager
* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge
035ed432bc
Doc update ( #889 )
...
* Link source files from the docs
* Update glowTTS recipes for docs
* Add dataset downloaders
2021-10-26 17:41:33 +02:00
Eren Gölge
0cac3f330a
Enable custom formatter in load_tts_samples
2021-10-26 13:07:11 +02:00
Eren Gölge
00becf2671
Fix import statements
2021-10-25 19:29:16 +02:00
Eren Gölge
2b7d159383
Update BaseTTS for multi-speaker training
2021-10-21 16:29:06 +00:00
Eren Gölge
e62d3c5cf7
Use absolute imports for tts configs and models
2021-10-21 16:29:06 +00:00
Eren Gölge
82fed4add2
Make style
2021-10-21 16:05:51 +00:00
Eren Gölge
3cb07fb6b5
Fix SpeakerManager init with data items
2021-10-21 13:54:39 +00:00
Eren Gölge
aea90e2501
Comment synthesis.py
2021-10-21 13:53:45 +00:00
Eren Gölge
3ab009ca8d
Edit model configs for multi-speaker
2021-10-21 13:51:37 +00:00
Eren Gölge
cea8e1739b
Update AlignTTS to use SpeakerManager
2021-10-20 18:22:41 +00:00
Eren Gölge
0e768dd4c5
Update comments
2021-10-20 18:21:26 +00:00
Eren Gölge
7c2cb7cc30
Update BaseTTS
2021-10-20 18:18:22 +00:00
Eren Gölge
330ee7d208
Comment BaseTacotron and remove unused funcs
2021-10-20 18:17:25 +00:00
Eren Gölge
aa25f70b95
Update ForwardTTS for multi-speaker
2021-10-20 18:16:41 +00:00
Eren Gölge
0ebc2a400e
Implement `_set_speaker_embedding` in GlowTTS
2021-10-20 18:15:20 +00:00
Eren Gölge
3da79a4de4
Comment Tacotron2 model
2021-10-20 18:14:04 +00:00
Eren Gölge
c514351c0e
Refactor multi-speaker init in BaseTTS-Tacotron1-2
2021-10-18 08:55:45 +00:00
Eren Gölge
127571423c
Update multi-speaker init in BaseTTS
2021-10-18 08:54:41 +00:00
Eren Gölge
a0a5d580e9
Approximate audio length from file size
2021-10-18 08:54:02 +00:00