Commit Graph

722 Commits

Author SHA1 Message Date
Eren Gölge d42748082a update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-23 18:04:37 +02:00
Eren Gölge 2da81f5bb6 add load_chekpoint to speaker encoder 2021-04-23 18:04:37 +02:00
Eren Gölge 1229ccbf07 update argument name in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge af2d36faeb update synthesize.py for multi-speaker setting 2021-04-23 18:04:37 +02:00
Eren Gölge 99dc07a7dd add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-23 18:04:37 +02:00
Eren Gölge c955a12428 set the default layer size compatible with scglow 2021-04-23 18:04:37 +02:00
Eren Gölge 3ace2440fa fix a mistake from rebase 2021-04-23 18:04:37 +02:00
Eren Gölge aadb2106ec code styling 2021-04-23 18:04:37 +02:00
Eren Gölge af7baa3387 refactoring to allow defining the speaker file externally 2021-04-23 18:04:37 +02:00
kirianguiller 7dccbfdcd5 handle multi speaker and gst in Synthetizer class 2021-04-23 18:04:37 +02:00
Edresson d2b6326b8b change optimizer initialization for compatibility with Hifi-GAN official implementation 2021-04-23 07:54:39 -03:00
WeberJulian 4205284f92
Change name of the functions 2021-04-23 10:09:55 +02:00
WeberJulian a26498181b Change back the default value 2021-04-22 16:10:17 +02:00
Julian Weber 355e1f47ab fix dumb mistake 2021-04-22 15:50:29 +02:00
Julian Weber c125b71f36 fix windows support 2021-04-22 15:14:24 +02:00
Jörg Thalheim f5fd7f78d4 server: also listen to ipv6
The [::] address will listen to both ipv4/ipv6 addresses.
2021-04-22 12:38:55 +02:00
Eren Gölge ef37633cb3 [ci skip] use prenet_dropout by default with Tacotron models 2021-04-22 12:38:55 +02:00
Eren Gölge e1d960da9e use SpeakerManager in Synthesizer 2021-04-21 13:13:27 +02:00
Eren Gölge 04b6881b66 add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
2021-04-21 13:12:35 +02:00
Eren Gölge 790946faec formating speakers.py 2021-04-21 13:12:11 +02:00
Eren Gölge ab313814de initial SpeakerManager implementation 2021-04-21 13:11:46 +02:00
Eren Gölge 09890c7421 fix the glow-tts in setup_model 2021-04-21 13:10:40 +02:00
Eren Gölge 8764d02eb2 update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-21 13:09:44 +02:00
Eren Gölge 8b40720977 add load_chekpoint to speaker encoder 2021-04-21 13:09:04 +02:00
Eren Gölge 37cad38c27 update argument name in server.py 2021-04-21 13:08:45 +02:00
Eren Gölge 9bccee9da8 update synthesize.py for multi-speaker setting 2021-04-21 13:08:25 +02:00
Eren Gölge d2fa8add1f add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-16 19:40:13 +02:00
Eren Gölge d9612a4351 set the default layer size compatible with scglow 2021-04-16 19:40:13 +02:00
Eren Gölge 1038fd420d fix a mistake from rebase 2021-04-16 19:39:47 +02:00
Eren Gölge 47e356cb48 code styling 2021-04-16 16:01:40 +02:00
Eren Gölge 25328aad00 refactoring to allow defining the speaker file externally 2021-04-16 15:59:57 +02:00
kirianguiller 48ae52a9a3 handle multi speaker and gst in Synthetizer class 2021-04-16 15:54:49 +02:00
Eren Gölge a53958ae3a fix urls for the new models 2021-04-15 17:05:00 +02:00
Eren Gölge 9cc17be53a formatting and a small bug fix in Tacotron model 2021-04-15 16:36:51 +02:00
Eren Gölge 1ad838bc83 add newly released models under .model.json 2021-04-15 16:06:10 +02:00
Eren Gölge 7cada1a949 remove noise 2021-04-15 15:30:45 +02:00
Eren Gölge d60a8d7211 show the real waveform on TB too for GAN vocoder training. 2021-04-15 15:30:06 +02:00
Eren Gölge 5fbe926429 change the default TTS model to TacotronDDC 2021-04-15 15:29:44 +02:00
Eren Gölge 3de5a89154 optionally enable prenet dropout at inference time for tacotron models 2021-04-13 13:24:56 +02:00
Eren Gölge 28a2fed8a3 update hifigan in .model.json 2021-04-12 16:48:05 +02:00
Eren Gölge abaf36861a aligntts model .model.json placeholder 2021-04-12 16:43:52 +02:00
Eren Gölge 480e2f7888 docstring update and better handling make_symbols 2021-04-12 16:40:49 +02:00
Eren Gölge b735076bb4 linter fixes 2021-04-12 13:14:11 +02:00
Eren Gölge b11d1cb845 small fixes 2021-04-12 12:40:55 +02:00
Eren Gölge a7f6045644 Merge branch 'reformat' into hifigan-reformat 2021-04-12 12:00:17 +02:00
Eren Gölge f519012dea reformatting and styling 2021-04-12 11:47:39 +02:00
Eren Gölge 9011dddf77 tacotron DDC placeholder in models.json 2021-04-12 04:06:27 +02:00
Eren Gölge d295d5de97 remove torch.no_grad from TorchSTFT 2021-04-10 19:43:57 +02:00
Eren Gölge 5b70da2e3f restore schedulers only if training is continuing a previous training
inherit nn.Module for TorchSTFT
2021-04-09 19:31:28 +02:00
Eren Gölge 2c71c6d8cd [ci skip]update gan vocoder configs to reflect the recent changes 2021-04-09 17:15:32 +02:00
Eren Gölge 2b529f60c8 update default hifigan config 2021-04-09 11:40:06 +02:00
Eren Gölge 105e0b4d62 vocoder gan training fixes 2021-04-09 11:38:04 +02:00
Eren Gölge 87ee6ceb57 style update #3 2021-04-09 01:17:15 +02:00
Eren Gölge 18d9ec8036 format with black 2021-04-09 00:54:59 +02:00
Eren Gölge e5b9607bc3 isort all imports 2021-04-09 00:45:20 +02:00
Eren Gölge 0e79fa86ad format with black and pylint 2.7.3 2021-04-09 00:38:08 +02:00
Eren Gölge cd69da4868 linter fixes #2 2021-04-08 16:57:46 +02:00
Eren Gölge 4d3e1e9d9a linter fix 2021-04-08 14:57:46 +02:00
Eren Gölge 53f54898bc small fixes 2021-04-08 14:22:47 +02:00
Eren Gölge 006b1d3aaa bug fix 2021-04-08 13:17:45 +02:00
Eren Gölge 3f0993aebe remove junk 2021-04-08 12:17:02 +02:00
Eren Gölge 0ee0458309 remove redundant imports 2021-04-08 11:29:15 +02:00
Eren Gölge 773f1db6fa refactor HifiGAN discriminator 2021-04-08 11:28:30 +02:00
Eren Gölge 15f362d5b1 formatting 2021-04-08 11:28:30 +02:00
Eren Gölge aee24b0704 set different seed in gan_dataset when it is multi-workers 2021-04-08 11:28:30 +02:00
Eren Gölge 6ee211c137 remove stft params causing warning 2021-04-08 11:28:30 +02:00
Eren Gölge 4998ece8d8 allow configuration of optimziers from the config file 2021-04-08 11:28:30 +02:00
Eren Gölge 8daf407652 cache empty 2021-04-08 11:28:30 +02:00
Eren Gölge 3fb78c004a move scheduler updates to the end of the epoch 2021-04-08 11:28:30 +02:00
Eren Gölge 2a872c98aa don't call os.exit as it leaves the process resources standing 2021-04-08 11:27:40 +02:00
Eren Gölge 7cecd2fb2e add hifigan D 2021-04-08 11:27:40 +02:00
Eren Gölge 13dca6e6b6 revert some of Hifigan generator updates 2021-04-08 11:27:40 +02:00
Eren Gölge 02bc776c35 prevenet grad in TorchSTFT 2021-04-08 11:27:40 +02:00
Eren Gölge cf44624df8 more docstring 2021-04-08 11:27:40 +02:00
Eren Gölge d95b1458e8 Linter fixes and docstrings for HiFiGAN 2021-04-08 11:27:40 +02:00
Eren Gölge bd7a1c177b fix #419 2021-04-08 11:26:41 +02:00
Eren Gölge 7726dfca99 change the upper bound in sound normalization 2021-04-08 11:26:01 +02:00
Eren Gölge 57f6bd1afa make using different samples for G and D networks optional 2021-04-08 11:26:01 +02:00
Eren Gölge 67f8248492 placeholder for finetuned sam hifigan model 2021-04-08 11:25:29 +02:00
Eren Gölge 241e968df1 load_checkpoint for hifigan and no_grad for inference 2021-04-08 11:25:29 +02:00
Eren Gölge de3a04f104 some commeting for Generator loss and check if the argument is defines in the config file 2021-04-08 11:25:29 +02:00
Eren Gölge ff07c5f5e3 update TorchSTFT to enable melspec 2021-04-08 11:25:29 +02:00
Eren Gölge 4a5b1d4ac2 update hifigan config 2021-04-08 11:24:21 +02:00
Eren Gölge e0e3b12b26 pass all parameters explicity to _istft 2021-04-08 11:23:20 +02:00
Eren Gölge f0e76ee135 initial models.json entry for universal hifigan 2021-04-08 11:23:20 +02:00
Eren Gölge d57f416957 small fixes 2021-04-08 11:22:30 +02:00
Eren Gölge 8c9e1c9e58 hifigan implementation update 2021-04-08 11:21:43 +02:00
Eren Gölge a14d7bc5db hifigan config update 2021-04-08 11:20:33 +02:00
Eren Gölge 8d4fd79cd7 update hifigan config 2021-04-08 11:20:33 +02:00
rishikksh20 e656e8b108 Remove select size bug 2021-04-08 11:20:33 +02:00
rishikksh20 b533474e3b Remove minor bugs and make code trainable 2021-04-08 11:20:33 +02:00
rishikksh20 ef6ff4e95c Add Exponential LR scheduler check 2021-04-08 11:20:33 +02:00
rishikksh20 1535777f64 1) Add ExponentialLR 2021-04-08 11:18:36 +02:00
rishikksh20 c20a6b1185 * Format the model definition
* Update code and integrate training code
2021-04-08 11:18:36 +02:00
rishikksh20 39b5845810 1) Add hifigan json files
2) Rename MPD disc
3) Re-format remove weight norm generator
2021-04-08 11:14:39 +02:00
rishikksh20 7b7c5d635f 1) Combine MSD with Multi-Period disc
2) Add remove weight norm layer on Generator
2021-04-08 11:14:39 +02:00
rishikksh20 4493feb95c Add HiFi-GAN v1 generator and discriminator classes 2021-04-08 11:14:39 +02:00
Eren Gölge c86c559349 docstring and optional padding in TorchSTFT 2021-04-07 12:36:15 +02:00
Eren Gölge f890454de3 linter fixes 2021-04-07 12:36:03 +02:00
Eren Gölge 9782d9ea5d [ci skip] implement #418 2021-04-06 16:24:50 +02:00
Eren Gölge f46a275b22 update docstring 2 2021-04-06 16:24:50 +02:00
Eren Gölge ec94ff3691 update docstring 2021-04-06 16:24:50 +02:00
Eren Gölge 2048095e9a audio.py fix 2021-04-06 16:24:50 +02:00
Eren Gölge e0b3008c31 allow choosing the log function used for amptodb conversion 2021-04-06 16:24:50 +02:00
Eren Gölge 44b4cb5ba5 DCA comment 2021-04-06 16:24:50 +02:00
Eren Gölge b86e7fb2e8 pad short samples when loading precomputed features in vocoder trainign 2021-04-06 16:24:50 +02:00
Eren Gölge 6ad4eba678 gan vocoder train fix in case of restoring models wiht no scheduler is defined 2021-04-06 16:24:50 +02:00
Eren Gölge e3ccfe37ea add DE more urls 2021-04-02 14:54:41 +02:00
Eren Gölge e84f120a04 sam-accenture model preprocessor 2021-04-01 03:41:41 +02:00
Eren Gölge e3c052382b fix loading always best_model when continue 2021-04-01 03:41:15 +02:00
Eren Gölge 48ea20e69f example aligntts config 2021-03-30 14:41:00 +02:00
Eren Gölge b4c2cf80f2 fix eval iter 2021-03-30 14:39:16 +02:00
Eren Gölge a3a840fd78 linter fixes 2021-03-30 14:39:16 +02:00
Eren Gölge 6b2e13bf62 compute normalized logp using torch primitives 2021-03-30 14:39:16 +02:00
Eren Gölge 7a382a5c2b stowed aligntts commit and small refactoring with feed_forward layers 2021-03-30 14:39:16 +02:00
Eren Gölge d542a50818 fix losses for alignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge 18cc7b95ec update l1 and huber to mse loss 2021-03-30 14:39:16 +02:00
Eren Gölge 896d33ed49 update losses to hande alingtts phases 2021-03-30 14:39:16 +02:00
Eren Gölge aec0b78aff duration predictor fix 2 2021-03-30 14:39:16 +02:00
Eren Gölge 07269e639b fix duration predictor in AlignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge c2d29e5cd4 FFTransformer encoder for aligntts 2021-03-30 14:39:16 +02:00
Eren Gölge 460a2d3e26 FFTransformer Decoder for AlignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge 844e8e0ed4 adapt align_tts and model name handling 2021-03-30 14:39:16 +02:00
Eren Gölge aa29f5b199 aligntts loss 2021-03-30 14:39:16 +02:00
Eren Gölge a831468cab align tts MDN layer 2021-03-30 14:39:16 +02:00
Eren Gölge 4396f8e2da continue refactoring 2021-03-30 14:39:16 +02:00
Eren Gölge 892c3c3623 use torch for AngleProtoLoss 2021-03-30 14:39:16 +02:00
Eren Gölge 2b3e12ea49 correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting 2021-03-30 14:39:16 +02:00
Eren Gölge ecb6b0d6ad rename GlowTtts as GlowTTS 2021-03-30 14:39:16 +02:00
Eren Gölge e8cf8cb00e restructure TF tacotron files 2021-03-30 14:39:16 +02:00
Eren Gölge 1ac99ce0d0 if git is not available set git has 'unknown' 2021-03-30 14:39:16 +02:00
Eren Gölge d9c405f0c3 create feedforward folder for SS layers 2021-03-30 14:39:16 +02:00
Eren Gölge a8cf1ae6b4 fix wavenet running with no input mask 2021-03-30 14:39:16 +02:00
Eren Gölge 1c1949d348 utf-8 encoding for certain preprocessors 2021-03-30 14:39:16 +02:00
Eren Gölge ca2f22cdd7 linter fix 2021-03-30 14:36:12 +02:00
Eren Gölge d0dcd7d1b8 let the user define outpu.wav file path fix #393 2021-03-30 14:24:31 +02:00
Eren Gölge 25654233d5 [ci skip]initial commit for the new DE models and stale ot update 2021-03-29 03:23:57 +02:00
Guy Elsmore-Paddock 15459627cc Fix `UnicodeEncodeError` on Windows Platforms
Prevents the following error from appearing when running training on Windows platforms:
```
UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined>
```
2021-03-20 17:30:00 -04:00
Eren Gölge 3947750dd9 Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-03-18 14:09:47 +01:00
WeberJulian 4a9d2e4309 fix french_cleaners 2021-03-18 13:35:29 +01:00
WeberJulian 596ea2c98a Add resample script 2021-03-18 13:33:37 +01:00
Eren Gölge 6e68637f48 bug fix 2021-03-18 13:33:23 +01:00
Eren Gölge f3e5ddfaaf bug fix in preprocessor 2021-03-18 13:33:23 +01:00
Eren Gölge aeb4f82233 bug fix 2021-03-18 13:33:23 +01:00
Eren Gölge 0514330869 fix mozilla/TTS#685 2021-03-18 13:33:23 +01:00
Eren Gölge f06603a0db force utf8 2021-03-18 13:33:23 +01:00
Eren Gölge 32e8b56c45 linter fix 2021-03-18 13:33:23 +01:00
Eren Gölge 65533f33e9 fix #374 2021-03-18 13:33:00 +01:00
Eren Gölge d790d2fccb linter fix 2021-03-18 13:33:00 +01:00
WeberJulian af96080e17 fix linter issues 2021-03-18 13:33:00 +01:00