Commit Graph

275 Commits

Author SHA1 Message Date
erogol c0a2aa68d3 formatting 2021-01-11 17:25:39 +01:00
erogol b206162d11 more docstrings 2021-01-11 17:25:04 +01:00
erogol 6e9043c5d2 rename convbnblocks and handle none mask 2021-01-11 17:22:34 +01:00
erogol 921fa5db92 remove attentions from common layers 2021-01-11 15:06:42 +01:00
erogol cc2b1e043d docstrings for common layers 2021-01-11 15:06:12 +01:00
erogol a6f40fef2e stage missing files 2021-01-08 16:02:56 +01:00
erogol d382d759b3 small fixes and test fixes 2021-01-08 15:48:40 +01:00
erogol a6259041d3 docstring for speedyspeech 2021-01-07 14:35:22 +01:00
erogol de2a542f83 glow-tts bug fix 2021-01-07 13:40:32 +01:00
erogol 14d33662ea input shapes for tacotron models 2021-01-06 13:19:40 +01:00
erogol f288e9a260 docstrings for taoctron models 2021-01-06 13:19:40 +01:00
erogol 5a45af48f1 fix 2021-01-06 13:19:40 +01:00
erogol e7fad928e7 doc strings for the all glow-tts layers 2021-01-06 13:19:40 +01:00
erogol d3b7284be4 glow-tts comments and refactoring 2021-01-06 13:19:40 +01:00
erogol 7586fbc4de SS refactoring 2021-01-06 13:19:40 +01:00
erogol e82d31b6ac glow ttss refactoring 2021-01-06 13:19:40 +01:00
erogol 29f4329d7f update glow-tts layers and add some comments 2021-01-06 13:19:40 +01:00
erogol 29cf933831 update SS condif 2021-01-06 13:19:40 +01:00
erogol 228ada04b5 update glow-tts ljspeech config 2021-01-06 13:19:40 +01:00
erogol f352b3534c make noise augmentation optional 2021-01-06 13:19:40 +01:00
erogol 71c382be14 copy model scale stats file with config.json to the trianing folder, fixed for model inits 2021-01-06 13:19:40 +01:00
erogol aa40fe1aa0 SS model refacotring for multi speaker 2021-01-06 13:19:40 +01:00
erogol eb555855e4 small fixes 2021-01-06 13:19:40 +01:00
erogol 5901a00576 argument rename 2021-01-06 13:19:40 +01:00
erogol 4ef083f0f1 select decoder type for SS 2021-01-06 13:19:40 +01:00
erogol d5a0190c4b update copy_config_file to copy_model_files 2021-01-06 13:19:40 +01:00
erogol 8971c59b2d plot eval alignment score right 2021-01-06 13:19:40 +01:00
erogol 3fa408a5ea change order BN + ReLU to ReLU + BN for SS 2021-01-06 13:19:40 +01:00
erogol ac5c9217d1 positional encoding masking for SS 2021-01-06 13:19:40 +01:00
erogol fede46e96e pylint and test fixes 2021-01-06 13:19:40 +01:00
erogol 2abe3df153 compute_attention_masks.py 2021-01-06 13:19:40 +01:00
erogol cf869e8922 add SS files 2021-01-06 13:19:40 +01:00
erogol e4680e1b99 plot float16 alignments 2021-01-06 13:19:40 +01:00
erogol 13c6665c92 inference for SS 2021-01-06 13:19:40 +01:00
erogol 30788960a8 check SS model parameters 2021-01-06 13:19:40 +01:00
erogol 5cae2c5742 make optional position encoding for speedyspeech 2021-01-06 13:19:40 +01:00
erogol dc4a16d62e speedy speehc losses 2021-01-06 13:19:40 +01:00
erogol d62cac7252 fix glow-tts prenet bug fix 2021-01-06 13:19:40 +01:00
erogol a1d5a9ddda config update tyo use noise for augmentation 2021-01-06 13:19:40 +01:00
erogol 022af74d74 update prompt msg 2021-01-06 13:19:40 +01:00
erogol 57ef53bef3 update argumnet check for non tacotron models 2021-01-06 13:19:40 +01:00
erogol 27a75de15f update processors for loading attention maps 2021-01-06 13:19:40 +01:00
erogol fa6907fa0e update glow-tts parameters and fix rel-attn-win size 2021-01-06 13:19:40 +01:00
erogol 7b20d8cbd3 implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic 2021-01-06 13:19:40 +01:00
erogol 973754d893 fix for init glow-tts 2021-01-06 13:19:40 +01:00
erogol f81af4eb0d config update disable guided attention for dynamic conv attention 2021-01-06 13:19:40 +01:00
erogol 29b17c0808 bug fix for gradual training 2021-01-06 13:19:40 +01:00
erogol 5c50e104d6 config update 2021-01-06 13:19:40 +01:00
erogol 6478d552dc tacotron training bug fix 2021-01-06 13:19:40 +01:00
erogol 1dd086577a tacotron training bug fix 2021-01-06 13:18:41 +01:00
erogol fa20638083 config for ljspeech dynamic conv attention 2021-01-06 13:18:41 +01:00
erogol 070146e143 add monotonic dynamic convolution attention 2021-01-06 13:18:41 +01:00
erogol 18392bc13a Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-06 13:18:08 +01:00
Thorsten Mueller f673f8f74d Added support for npy output from tune-wavegrad 2020-12-19 22:51:22 +01:00
Thorsten Mueller 2aa0354b44 Fix for 'NoneType' object has no attribute 'to' 2020-12-19 22:37:03 +01:00
Thorsten Mueller 28a64221ea Improve robostness on cpu / gpu model mix 2020-12-19 22:23:28 +01:00
erogol 8293751a38 remove mozilla from server page 2020-12-17 12:28:28 +01:00
erogol 639fa29261 update speaker id casting for glow-tts 2020-12-14 16:58:47 +01:00
erogol 999120ecdf Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:50:14 +01:00
erogol f611e6ac01 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:47:59 +01:00
Jörg Thalheim 62fd4ca70d
inflect negative numbers correctly 2020-12-10 16:47:51 +01:00
Jörg Thalheim 6646682650
cleaners: expand english time 2020-12-10 14:53:20 +01:00
Jörg Thalheim 76138687d3
expand more currencies 2020-12-10 14:53:20 +01:00
erogol a2859b7ddc update config args checks 2020-12-10 13:52:57 +01:00
erogol 788cd6f902 fix multi-speaker glow-tts inference 2020-12-10 02:05:48 +01:00
erogol 3d5066e2b8 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-10 00:31:03 +01:00
erogol 92cc9630d7 fix glow-tts synthesis for DPP 2020-12-10 00:30:34 +01:00
Eren Gölge 2473b2dc62
Merge pull request #559 from krzim/patch-1
Fix import to grab the encoder model save function
2020-12-10 00:19:32 +01:00
erogol 53679b706d glow-tts distributed fix 2020-12-09 23:39:09 +01:00
erogol 62bc171db5 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-09 15:46:57 +01:00
erogol df180148e9 use noise augmentation in TTSDataset 2020-12-09 15:46:25 +01:00
Thorsten Mueller e39628ce2f Limit filenames to 10 chars 2020-12-08 18:44:19 +01:00
erogol 06612ce305 test fixes 2020-12-07 15:57:34 +01:00
erogol 0252a07fa6 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-07 11:31:55 +01:00
erogol 482e725752 sync torch calls before logging training results 2020-12-07 11:30:19 +01:00
erogol 7505c0ba27 muliprocess phoneme computation 2020-12-07 11:29:41 +01:00
erogol 20c86489d7 make static methods for faster multiprocess call 2020-12-07 11:29:10 +01:00
erogol affe1c1138 setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length. 2020-12-07 11:26:57 +01:00
Alexander Korolev f42ca2b73f
Update wavegrad.py
This should fix the issue https://github.com/mozilla/TTS/issues/581
2020-12-04 16:43:39 +01:00
erogol 7c3cdced1a make speaker_mapping a global variable to prevent reload. Fix glow-tts training 2020-12-01 03:23:25 +01:00
Thorsten Mueller 06a389bc08 Added option for saving raw spectograms 2020-11-27 15:49:55 +01:00
erogol a757b203bc fix longer phoneme seqs 2020-11-26 15:05:03 +01:00
erogol 7b0a93d2f8 fix 2020-11-26 11:44:52 +01:00
erogol 0c6f7e4c77 resample audio if flag set true 2020-11-26 11:30:48 +01:00
erogol f6c96b0ac2 Merge branch 'dev' 2020-11-25 15:29:06 +01:00
erogol e3b7157146 remove contextlib 2020-11-25 15:22:01 +01:00
erogol e3eda159d1 wavegrad_dataset update 2020-11-25 14:50:50 +01:00
erogol a1e4ee18f9 convert float16 to float32 for plotting spectrograms 2020-11-25 14:50:28 +01:00
erogol 7541d2ecaa return eval split optional 2020-11-25 14:50:09 +01:00
erogol 4b92ac0f92 tune_wavegrad update 2020-11-25 14:49:48 +01:00
erogol d8c1b5b73d print max lengths in tacotron training 2020-11-25 14:49:07 +01:00
erogol 1229554c42 use native amp 2020-11-25 14:48:54 +01:00
erogol 8a820930c6 compute_embedding update 2020-11-25 14:46:08 +01:00
erogol aa2b31a1b0 use 'enabled' argument to control autocast 2020-11-17 14:22:01 +01:00
erogol d9d04d892b Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-11-17 14:17:24 +01:00
erogol 8b0e0846a3 temporary travis check 2020-11-17 14:17:03 +01:00
Qingping Hou b0b97d636f speed up metafile build for voxceleb 2020-11-14 23:45:17 -08:00
erogol a2a142dc39 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-11-14 13:02:19 +01:00
erogol c65712426a change noise scheduling for wavegrad. Compute beta values externally to enable better flexibility 2020-11-14 13:01:10 +01:00
erogol 5a59467f34 scaler fix for wavegrad and wavernn. Save and load scaler 2020-11-14 13:00:35 +01:00