Commit Graph

303 Commits

Author SHA1 Message Date
Eren Gölge fae10309e4
Merge pull request #624 from SanjaESC/patch-3
Update train_tacotron.py
2021-01-22 13:29:09 +01:00
Eren Gölge 5ee73c2bae Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-22 13:26:27 +01:00
Eren Gölge 5fb611ef40 static image for server index.html 2021-01-22 03:01:53 +01:00
Eren Gölge ca647cf222 Model Manager to download released models 2021-01-22 02:35:43 +01:00
Eren Gölge ca8ad9c21e rename audio._normalize to audio.normalize 2021-01-22 02:33:19 +01:00
Eren Gölge c990b3a59c linter fixes and test fixes 2021-01-22 02:32:35 +01:00
Alexander Korolev f251dc8c0e
Update train_tacotron.py
When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training.

By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.
2021-01-21 21:16:30 +01:00
Eren Gölge 0ab2eb2664 use synthesizer in both synthesize.py and server.pu 2021-01-21 15:54:33 +01:00
Eren Gölge 9addfabc43 wavernn load_checkpoint function 2021-01-21 15:31:13 +01:00
Eren Gölge 50fee59a2c update synthesizer.py for better interfacing to different models 2021-01-21 15:30:49 +01:00
Eren Gölge 007a4d7139 remove 3rd paty wavernn support from server.py and add ModelManager arguments 2021-01-21 15:30:16 +01:00
Eren Gölge 6b6e989fd2 update server readme 2021-01-21 15:29:46 +01:00
Thorsten Mueller e414582be6 Added option for server ui details page. 2021-01-20 21:56:40 +01:00
root 1bc8fbbd3c set eval mode whe nloading models 2021-01-20 02:14:18 +00:00
root 5bd7238153 interpolate spectrogram in vocoder generic utils for matching sample
rates
2021-01-20 02:13:01 +00:00
root ca3743539a load_checkpoint func for vocoder models 2021-01-20 02:12:29 +00:00
root ea39715305 read_json_with_comments 2021-01-20 02:11:55 +00:00
root 563bc921d8 optional verbose for audio.py init 2021-01-20 02:11:24 +00:00
root 1faf565e3a add load_checkpoint func to tts models 2021-01-20 02:10:56 +00:00
root 5c87753e88 glow-tts fix for saving inverse weight 2021-01-20 02:09:42 +00:00
root 3d30dae8f3 .models.json and synthesize.py update for interfacing with model manager 2021-01-20 02:08:58 +00:00
gerazov b2b4828f17 set requires_grad=False 2021-01-16 19:46:04 +01:00
gerazov c96f7a2614 TorchSTFT to device fix 2021-01-16 12:21:16 +01:00
root 7beaacc55b update compute_attention_masks.py 2021-01-13 10:03:57 +00:00
erogol 428c224b88 commet update 2021-01-12 17:31:04 +01:00
erogol bbc8d665a1 move attention layers to a sperate file 2021-01-11 17:27:30 +01:00
erogol 79c841ccd3 mass refactoring and update 2021-01-11 17:26:58 +01:00
erogol 1d961d6f8a cladd renaming 2021-01-11 17:26:11 +01:00
erogol c0a2aa68d3 formatting 2021-01-11 17:25:39 +01:00
erogol b206162d11 more docstrings 2021-01-11 17:25:04 +01:00
erogol 6e9043c5d2 rename convbnblocks and handle none mask 2021-01-11 17:22:34 +01:00
erogol 921fa5db92 remove attentions from common layers 2021-01-11 15:06:42 +01:00
erogol cc2b1e043d docstrings for common layers 2021-01-11 15:06:12 +01:00
erogol a6f40fef2e stage missing files 2021-01-08 16:02:56 +01:00
erogol d382d759b3 small fixes and test fixes 2021-01-08 15:48:40 +01:00
erogol a6259041d3 docstring for speedyspeech 2021-01-07 14:35:22 +01:00
erogol de2a542f83 glow-tts bug fix 2021-01-07 13:40:32 +01:00
erogol 14d33662ea input shapes for tacotron models 2021-01-06 13:19:40 +01:00
erogol f288e9a260 docstrings for taoctron models 2021-01-06 13:19:40 +01:00
erogol 5a45af48f1 fix 2021-01-06 13:19:40 +01:00
erogol e7fad928e7 doc strings for the all glow-tts layers 2021-01-06 13:19:40 +01:00
erogol d3b7284be4 glow-tts comments and refactoring 2021-01-06 13:19:40 +01:00
erogol 7586fbc4de SS refactoring 2021-01-06 13:19:40 +01:00
erogol e82d31b6ac glow ttss refactoring 2021-01-06 13:19:40 +01:00
erogol 29f4329d7f update glow-tts layers and add some comments 2021-01-06 13:19:40 +01:00
erogol 29cf933831 update SS condif 2021-01-06 13:19:40 +01:00
erogol 228ada04b5 update glow-tts ljspeech config 2021-01-06 13:19:40 +01:00
erogol f352b3534c make noise augmentation optional 2021-01-06 13:19:40 +01:00
erogol 71c382be14 copy model scale stats file with config.json to the trianing folder, fixed for model inits 2021-01-06 13:19:40 +01:00
erogol aa40fe1aa0 SS model refacotring for multi speaker 2021-01-06 13:19:40 +01:00
erogol eb555855e4 small fixes 2021-01-06 13:19:40 +01:00
erogol 5901a00576 argument rename 2021-01-06 13:19:40 +01:00
erogol 4ef083f0f1 select decoder type for SS 2021-01-06 13:19:40 +01:00
erogol d5a0190c4b update copy_config_file to copy_model_files 2021-01-06 13:19:40 +01:00
erogol 8971c59b2d plot eval alignment score right 2021-01-06 13:19:40 +01:00
erogol 3fa408a5ea change order BN + ReLU to ReLU + BN for SS 2021-01-06 13:19:40 +01:00
erogol ac5c9217d1 positional encoding masking for SS 2021-01-06 13:19:40 +01:00
erogol fede46e96e pylint and test fixes 2021-01-06 13:19:40 +01:00
erogol 2abe3df153 compute_attention_masks.py 2021-01-06 13:19:40 +01:00
erogol cf869e8922 add SS files 2021-01-06 13:19:40 +01:00
erogol e4680e1b99 plot float16 alignments 2021-01-06 13:19:40 +01:00
erogol 13c6665c92 inference for SS 2021-01-06 13:19:40 +01:00
erogol 30788960a8 check SS model parameters 2021-01-06 13:19:40 +01:00
erogol 5cae2c5742 make optional position encoding for speedyspeech 2021-01-06 13:19:40 +01:00
erogol dc4a16d62e speedy speehc losses 2021-01-06 13:19:40 +01:00
erogol d62cac7252 fix glow-tts prenet bug fix 2021-01-06 13:19:40 +01:00
erogol a1d5a9ddda config update tyo use noise for augmentation 2021-01-06 13:19:40 +01:00
erogol 022af74d74 update prompt msg 2021-01-06 13:19:40 +01:00
erogol 57ef53bef3 update argumnet check for non tacotron models 2021-01-06 13:19:40 +01:00
erogol 27a75de15f update processors for loading attention maps 2021-01-06 13:19:40 +01:00
erogol fa6907fa0e update glow-tts parameters and fix rel-attn-win size 2021-01-06 13:19:40 +01:00
erogol 7b20d8cbd3 implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic 2021-01-06 13:19:40 +01:00
erogol 973754d893 fix for init glow-tts 2021-01-06 13:19:40 +01:00
erogol f81af4eb0d config update disable guided attention for dynamic conv attention 2021-01-06 13:19:40 +01:00
erogol 29b17c0808 bug fix for gradual training 2021-01-06 13:19:40 +01:00
erogol 5c50e104d6 config update 2021-01-06 13:19:40 +01:00
erogol 6478d552dc tacotron training bug fix 2021-01-06 13:19:40 +01:00
erogol 1dd086577a tacotron training bug fix 2021-01-06 13:18:41 +01:00
erogol fa20638083 config for ljspeech dynamic conv attention 2021-01-06 13:18:41 +01:00
erogol 070146e143 add monotonic dynamic convolution attention 2021-01-06 13:18:41 +01:00
erogol 18392bc13a Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-06 13:18:08 +01:00
Thorsten Mueller f673f8f74d Added support for npy output from tune-wavegrad 2020-12-19 22:51:22 +01:00
Thorsten Mueller 2aa0354b44 Fix for 'NoneType' object has no attribute 'to' 2020-12-19 22:37:03 +01:00
Thorsten Mueller 28a64221ea Improve robostness on cpu / gpu model mix 2020-12-19 22:23:28 +01:00
erogol 8293751a38 remove mozilla from server page 2020-12-17 12:28:28 +01:00
erogol 639fa29261 update speaker id casting for glow-tts 2020-12-14 16:58:47 +01:00
erogol 999120ecdf Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:50:14 +01:00
erogol f611e6ac01 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:47:59 +01:00
Jörg Thalheim 62fd4ca70d
inflect negative numbers correctly 2020-12-10 16:47:51 +01:00
Jörg Thalheim 6646682650
cleaners: expand english time 2020-12-10 14:53:20 +01:00
Jörg Thalheim 76138687d3
expand more currencies 2020-12-10 14:53:20 +01:00
erogol a2859b7ddc update config args checks 2020-12-10 13:52:57 +01:00
erogol 788cd6f902 fix multi-speaker glow-tts inference 2020-12-10 02:05:48 +01:00
erogol 3d5066e2b8 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-10 00:31:03 +01:00
erogol 92cc9630d7 fix glow-tts synthesis for DPP 2020-12-10 00:30:34 +01:00
Eren Gölge 2473b2dc62
Merge pull request #559 from krzim/patch-1
Fix import to grab the encoder model save function
2020-12-10 00:19:32 +01:00
erogol 53679b706d glow-tts distributed fix 2020-12-09 23:39:09 +01:00
erogol 62bc171db5 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-09 15:46:57 +01:00
erogol df180148e9 use noise augmentation in TTSDataset 2020-12-09 15:46:25 +01:00
Thorsten Mueller e39628ce2f Limit filenames to 10 chars 2020-12-08 18:44:19 +01:00