Commit Graph

515 Commits

Author SHA1 Message Date
Eren Gölge 93a6bdfd6c linter fixes and version updates for deps 2021-03-08 02:51:10 +01:00
Eren Gölge a30a231566 unpin cython version and commentout pyworld in audio.py causing dep
issues
2021-03-08 02:50:15 +01:00
Thorsten Mueller 3eb00e8d93 Set out_path to be required param. 2021-03-08 02:49:15 +01:00
Alexander Korolev ace430d5e6 fix device mismatch wavegrad training
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-03-08 02:49:15 +01:00
Eren Gölge 83143fbe39 fix #638 2021-03-08 02:48:31 +01:00
Eren Gölge 30c3bef3f9 move hubconf 2021-03-08 02:48:31 +01:00
Eren Gölge bbea6a0884 hubconf.py and load .models.json from the defualt location by mange.py 2021-03-08 02:48:31 +01:00
Eren Gölge 90d4f08d6c reorder imports 2021-03-08 02:48:31 +01:00
Eren Gölge db231c83fc distill import statement, check python version in setup.py 2021-03-08 02:48:31 +01:00
Thorsten Mueller 915ec1faac Added info if model already downloaded in --list_models 2021-03-08 02:48:31 +01:00
Alexander Korolev b4bc5f6eb1 update fixed stopnet_pos_weight parameter
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-03-08 02:48:31 +01:00
Eren Gölge 534e3c67c6 README update, set default models for synthesize.py and server.py. Disable verbose for ap init. 2021-03-08 02:48:31 +01:00
kirianguiller 7f36d91131 update chinese model 2021-03-01 14:55:05 +01:00
Eren Gölge 547bfc4ce9 bug fix 2021-02-18 18:24:03 +00:00
Eren Gölge adaeec57ec Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-18 17:21:09 +00:00
Eren Gölge 5b70c8ba4f enable backward compat for loading the best model 2021-02-18 17:20:36 +00:00
Eren Gölge e4f81d6856
Merge pull request #654 from kirianguiller/chinese-implementation
Chinese implementation (merge into dev)
2021-02-18 17:15:32 +01:00
kirianguiller 22a6bbfa80 remove gst handling in synthetizer.py class 2021-02-17 20:53:56 +01:00
kirianguiller 3911b87e54 remove re.Match typing in '_number_replace()' 2021-02-17 20:53:56 +01:00
kirianguiller fb0655d1e7 modify according to PR reviews 2021-02-17 20:53:56 +01:00
kirianguiller c4c7bc1b88 <add> Chinese mandarin implementation (tacotron2) 2021-02-17 20:53:56 +01:00
Eren Gölge d0454461de Merge branch 'pr/gerazov/650-2' into dev 2021-02-17 13:40:45 +00:00
Eren Gölge a8ea0ea6ce Docstrings for audioprocessor 2021-02-17 13:35:41 +00:00
Eren Gölge f6e6314910 add RUSLAN dataset preprocessor 2021-02-17 13:35:23 +00:00
Eren Gölge ce0c5eccbd do not test server and modelManager until fixing #657 2021-02-17 00:35:43 +00:00
gerazov 61c88beb94 refactored keep_all_best 2021-02-15 18:40:17 +01:00
Eren Gölge eb543c027e Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-15 17:06:40 +00:00
Eren Gölge 8a106e0527 fix #655 2021-02-15 17:06:03 +00:00
Eren Gölge 216945e653
Merge pull request #647 from adonispujols/patch-1
Easy Fix for #454 (which was somehow deleted?)
2021-02-15 13:17:17 +01:00
Eren Gölge 06a3ba2fe2 linter update 2021-02-15 12:10:19 +00:00
Eren Gölge 7f58fa365b Merge branch 'save_characters' into dev 2021-02-15 12:07:28 +00:00
Eren Gölge ff218e2370 remove redundancy 2021-02-15 12:07:02 +00:00
Eren Gölge 80af8ca5e1
Update TTS/utils/arguments.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:03:59 +01:00
Eren Gölge 3b6ce04332
Update TTS/bin/find_unique_chars.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:02:29 +01:00
Eren Gölge dc3596dad4 model_manager tests 2021-02-15 11:29:22 +00:00
Eren Gölge 77e630348e author , license and contact info in .model.json 2021-02-15 11:02:21 +00:00
Eren Gölge e1bc823e44 Merge branch 'pr/nmstoker/652' into dev 2021-02-15 10:57:12 +00:00
nmstoker 33bcdc6ff8 Updating models list to include EK1 TTS/vocoder 2021-02-14 23:44:05 +00:00
Eren Gölge 420901f4c2 linter fixes 2021-02-12 14:41:17 +00:00
Eren Gölge 4244096ccb update test_text_processing for espeak-ng 2021-02-12 14:07:26 +00:00
Eren Gölge b28c724c04 remove _phoneme_punctuations 2021-02-12 12:10:57 +00:00
Eren Gölge 7ab527d17e save default model chars to the training config file 2021-02-12 12:06:46 +00:00
Eren Gölge 593cedee14 parse_characters function 2021-02-12 12:05:56 +00:00
Eren Gölge 2abfff17f9 enable saving model characters in io.py 2021-02-12 12:04:41 +00:00
Eren Gölge 918f007a11 docstring update 2021-02-12 12:04:07 +00:00
Eren Gölge e774f68aee save used model characters to the checkpoints 2021-02-12 12:03:42 +00:00
gerazov 0e78e31dbf reformated docstrings in arguments.py 2021-02-12 11:36:01 +01:00
gerazov 310d18325e brushed up printing model load path and best loss path 2021-02-12 10:55:45 +01:00
Eren Gölge 8b6fd76ad2 find unique characters in a dataset 2021-02-12 09:46:11 +00:00
gerazov af46727517 loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added 2021-02-12 02:12:00 +01:00
Eren Gölge a1e595790d use default vocoders in server.pu 2021-02-11 15:31:39 +00:00
Eren Gölge 8aa6a0decb set an output_sample_rate in synthesizer and use it for writing the wav
file
2021-02-11 15:28:07 +00:00
Eren Gölge 0c52d27d65 return the json entry of the downloaded model 2021-02-11 15:27:41 +00:00
Eren Gölge 1649ad3431 save_wav with a custom sampling rate 2021-02-11 15:27:20 +00:00
Eren Gölge 43f54d2dce fix make_symbols 2021-02-11 15:26:52 +00:00
Eren Gölge 0657b38111 use default vocoder in synthesize.py 2021-02-11 15:26:17 +00:00
Eren Gölge 2043a9b5f5 define default vocoders 2021-02-11 15:25:55 +00:00
Eren Gölge ff27690ca7 bug fix 2021-02-11 13:43:29 +00:00
Eren Gölge bc131208be fix spelling of a def argument and parse phonemes from config.json if
use_phonemes is True
2021-02-11 13:04:47 +00:00
Eren Gölge f1799dbd60 docstring update 2021-02-11 11:25:31 +00:00
Eren Gölge 3baec4ea96 add missing phonemes to test_config.json 2021-02-11 11:14:39 +00:00
Eren Gölge a3d1e65b34 Merge branch 'pr/adonispujols/646' into dev 2021-02-11 10:37:29 +00:00
Eren Gölge 3c2e13ca5c fix the default vocoder name 2021-02-11 10:36:52 +00:00
Adonis Pujols 48011a8b58
add encoding="utf-8" 2021-02-11 05:26:06 -05:00
Adonis Pujols b29a7e9645
spelling error. should be multiband not mulitband 2021-02-11 04:49:28 -05:00
Adonis Pujols 6c824a6629
spelling error. should be multiband not mulitband 2021-02-11 04:48:53 -05:00
Eren Gölge b08b8ca2a1 add russian phoneme char 2021-02-10 13:30:59 +00:00
Eren Gölge 9cad435288 css10 dataset preprocessor 2021-02-09 15:11:26 +00:00
Eren Gölge cea5e517f2 download github model releases by model manager 2021-02-09 14:24:14 +00:00
Eren Gölge c619859a3f linter fixes 2021-02-09 11:43:17 +00:00
gerazov e507373b55 final final fixes 2021-02-06 23:08:47 +01:00
gerazov ad17dc9e76 final fixes 2021-02-06 23:05:01 +01:00
gerazov 8fdd08ea15 updated to current dev 2021-02-06 22:59:52 +01:00
gerazov 2705d27b28 changed train scripts 2021-02-06 22:29:30 +01:00
gerazov 4f8f274d6e restructured arg parsing and processing to utils 2021-02-06 22:25:56 +01:00
Eren Gölge e7e880f514 fix gdown 2021-02-05 13:42:24 +00:00
Eren Gölge f4f6290eec Merge branch 'pr/gerazov/641' into dev 2021-02-05 13:14:49 +00:00
Eren Gölge d49757faaa linter updates 2021-02-05 13:10:43 +00:00
Branislav Gerazov f063545325 improve robustness of defining wavernn in config file 2021-02-05 13:26:33 +01:00
Branislav Gerazov 24ffa9e9f6 update wavernn test config, delete cap=True 2021-02-05 13:10:02 +01:00
Branislav Gerazov cb77aef36c waveRNN fix 2021-02-04 09:52:03 +01:00
Thorsten Mueller d74866cb8e Merge remote-tracking branch 'upstream/dev' into dev
Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637
2021-02-02 19:40:18 +01:00
Thorsten Mueller a82152eef3 Ups. Added missing , 2021-02-02 19:29:16 +01:00
Thorsten Mueller 4cb4fcf02c Set out_path to be required param. 2021-02-02 19:29:16 +01:00
Thorsten Mueller c75ea74914 Added info if model already downloaded in --list_models 2021-02-02 19:29:16 +01:00
Eren Gölge 2edab4b3f9 disable pw in audio that causes numpy issue 2021-02-01 17:05:03 +00:00
Eren Gölge 5c46543765 linter fixes and version updates for deps 2021-02-01 13:18:56 +00:00
Eren Gölge 8774e37444 unpin cython version and commentout pyworld in audio.py causing dep
issues
2021-02-01 11:34:05 +00:00
Eren Gölge 5beed0ddcd Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-01 11:27:14 +00:00
Eren Gölge c7407571fa fix #638 2021-02-01 10:05:55 +00:00
Eren Gölge dfdac1def9
Merge pull request #636 from thorstenMueller/dev
Set out_path to be required param in compute_statistics.py.
2021-01-29 18:08:31 +01:00
Thorsten Mueller 44c4a49745 Set out_path to be required param. 2021-01-29 17:23:38 +01:00
Eren Gölge 536366dc0a
Merge pull request #635 from SanjaESC/patch-1
fix device mismatch wavegrad training
2021-01-29 16:42:25 +01:00
Eren Gölge 0354b6f35e move hubconf 2021-01-29 15:28:32 +00:00
Eren Gölge aa5f24608a hubconf.py and load .models.json from the defualt location by mange.py 2021-01-29 15:28:26 +00:00
Alexander Korolev e81ebec7a8
fix device mismatch wavegrad training
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-01-29 15:18:59 +01:00
Eren Gölge a926aa106d reorder imports 2021-01-29 01:36:21 +01:00
Eren Gölge 8a6eee7fec distill import statement, check python version in setup.py 2021-01-28 17:04:08 +01:00
Eren Gölge 131a163c95
Merge pull request #628 from thorstenMueller/dev
Added info if model already downloaded in --list_models
2021-01-28 13:10:06 +01:00
Alexander Korolev ca28e05ed7
update fixed stopnet_pos_weight parameter
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-01-27 16:33:25 +01:00
Thorsten Mueller ccbd542eb0 Added info if model already downloaded in --list_models 2021-01-27 16:19:02 +01:00
Eren Gölge 25c86ca715 README update, set default models for synthesize.py and server.py. Disable verbose for ap init. 2021-01-27 11:47:03 +01:00
Eren Gölge 4f32e77006 platform indep. way to fetch user data folder 2021-01-26 17:32:43 +01:00
Eren Gölge 0117c811a9 add a button to index.html to see the model details 2021-01-26 12:33:27 +01:00
Eren Gölge a3adcaccdb Merge branch 'pr/thorstenMueller/623' into dev 2021-01-26 12:19:39 +01:00
Eren Gölge b464cab9b8 setup.py update and pylint fixes 2021-01-26 02:57:50 +01:00
Eren Gölge 660d61aeeb maximum_path_numpy and CYTHON adabtable import 2021-01-26 02:57:07 +01:00
Eren Gölge 877f0bbfba manifest.in update 2021-01-26 02:56:55 +01:00
Eren Gölge 82e029529e fix manifest file 2021-01-25 13:27:54 +01:00
Eren Gölge 57b668fd86 fixing dome pypi issues 2021-01-25 13:06:12 +01:00
Eren Gölge 60c1bb93d9 fixes before first PyPI release 2021-01-25 11:16:20 +01:00
Thorsten Mueller afb7db2a1d Removed unneeded check and removed specific taco2 model name. 2021-01-22 16:22:50 +01:00
Eren Gölge fae10309e4
Merge pull request #624 from SanjaESC/patch-3
Update train_tacotron.py
2021-01-22 13:29:09 +01:00
Eren Gölge 5ee73c2bae Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-22 13:26:27 +01:00
Eren Gölge 5fb611ef40 static image for server index.html 2021-01-22 03:01:53 +01:00
Eren Gölge ca647cf222 Model Manager to download released models 2021-01-22 02:35:43 +01:00
Eren Gölge ca8ad9c21e rename audio._normalize to audio.normalize 2021-01-22 02:33:19 +01:00
Eren Gölge c990b3a59c linter fixes and test fixes 2021-01-22 02:32:35 +01:00
Alexander Korolev f251dc8c0e
Update train_tacotron.py
When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training.

By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.
2021-01-21 21:16:30 +01:00
Eren Gölge 0ab2eb2664 use synthesizer in both synthesize.py and server.pu 2021-01-21 15:54:33 +01:00
Eren Gölge 9addfabc43 wavernn load_checkpoint function 2021-01-21 15:31:13 +01:00
Eren Gölge 50fee59a2c update synthesizer.py for better interfacing to different models 2021-01-21 15:30:49 +01:00
Eren Gölge 007a4d7139 remove 3rd paty wavernn support from server.py and add ModelManager arguments 2021-01-21 15:30:16 +01:00
Eren Gölge 6b6e989fd2 update server readme 2021-01-21 15:29:46 +01:00
Thorsten Mueller e414582be6 Added option for server ui details page. 2021-01-20 21:56:40 +01:00
root 1bc8fbbd3c set eval mode whe nloading models 2021-01-20 02:14:18 +00:00
root 5bd7238153 interpolate spectrogram in vocoder generic utils for matching sample
rates
2021-01-20 02:13:01 +00:00
root ca3743539a load_checkpoint func for vocoder models 2021-01-20 02:12:29 +00:00
root ea39715305 read_json_with_comments 2021-01-20 02:11:55 +00:00
root 563bc921d8 optional verbose for audio.py init 2021-01-20 02:11:24 +00:00
root 1faf565e3a add load_checkpoint func to tts models 2021-01-20 02:10:56 +00:00
root 5c87753e88 glow-tts fix for saving inverse weight 2021-01-20 02:09:42 +00:00
root 3d30dae8f3 .models.json and synthesize.py update for interfacing with model manager 2021-01-20 02:08:58 +00:00
gerazov b2b4828f17 set requires_grad=False 2021-01-16 19:46:04 +01:00
gerazov c96f7a2614 TorchSTFT to device fix 2021-01-16 12:21:16 +01:00
root 7beaacc55b update compute_attention_masks.py 2021-01-13 10:03:57 +00:00
erogol 428c224b88 commet update 2021-01-12 17:31:04 +01:00
erogol bbc8d665a1 move attention layers to a sperate file 2021-01-11 17:27:30 +01:00
erogol 79c841ccd3 mass refactoring and update 2021-01-11 17:26:58 +01:00
erogol 1d961d6f8a cladd renaming 2021-01-11 17:26:11 +01:00
erogol c0a2aa68d3 formatting 2021-01-11 17:25:39 +01:00
erogol b206162d11 more docstrings 2021-01-11 17:25:04 +01:00
erogol 6e9043c5d2 rename convbnblocks and handle none mask 2021-01-11 17:22:34 +01:00
erogol 921fa5db92 remove attentions from common layers 2021-01-11 15:06:42 +01:00
erogol cc2b1e043d docstrings for common layers 2021-01-11 15:06:12 +01:00
erogol a6f40fef2e stage missing files 2021-01-08 16:02:56 +01:00
erogol d382d759b3 small fixes and test fixes 2021-01-08 15:48:40 +01:00
erogol a6259041d3 docstring for speedyspeech 2021-01-07 14:35:22 +01:00
erogol de2a542f83 glow-tts bug fix 2021-01-07 13:40:32 +01:00
erogol 14d33662ea input shapes for tacotron models 2021-01-06 13:19:40 +01:00
erogol f288e9a260 docstrings for taoctron models 2021-01-06 13:19:40 +01:00
erogol 5a45af48f1 fix 2021-01-06 13:19:40 +01:00
erogol e7fad928e7 doc strings for the all glow-tts layers 2021-01-06 13:19:40 +01:00
erogol d3b7284be4 glow-tts comments and refactoring 2021-01-06 13:19:40 +01:00
erogol 7586fbc4de SS refactoring 2021-01-06 13:19:40 +01:00
erogol e82d31b6ac glow ttss refactoring 2021-01-06 13:19:40 +01:00
erogol 29f4329d7f update glow-tts layers and add some comments 2021-01-06 13:19:40 +01:00
erogol 29cf933831 update SS condif 2021-01-06 13:19:40 +01:00
erogol 228ada04b5 update glow-tts ljspeech config 2021-01-06 13:19:40 +01:00
erogol f352b3534c make noise augmentation optional 2021-01-06 13:19:40 +01:00
erogol 71c382be14 copy model scale stats file with config.json to the trianing folder, fixed for model inits 2021-01-06 13:19:40 +01:00
erogol aa40fe1aa0 SS model refacotring for multi speaker 2021-01-06 13:19:40 +01:00
erogol eb555855e4 small fixes 2021-01-06 13:19:40 +01:00
erogol 5901a00576 argument rename 2021-01-06 13:19:40 +01:00
erogol 4ef083f0f1 select decoder type for SS 2021-01-06 13:19:40 +01:00
erogol d5a0190c4b update copy_config_file to copy_model_files 2021-01-06 13:19:40 +01:00
erogol 8971c59b2d plot eval alignment score right 2021-01-06 13:19:40 +01:00
erogol 3fa408a5ea change order BN + ReLU to ReLU + BN for SS 2021-01-06 13:19:40 +01:00
erogol ac5c9217d1 positional encoding masking for SS 2021-01-06 13:19:40 +01:00
erogol fede46e96e pylint and test fixes 2021-01-06 13:19:40 +01:00
erogol 2abe3df153 compute_attention_masks.py 2021-01-06 13:19:40 +01:00
erogol cf869e8922 add SS files 2021-01-06 13:19:40 +01:00
erogol e4680e1b99 plot float16 alignments 2021-01-06 13:19:40 +01:00
erogol 13c6665c92 inference for SS 2021-01-06 13:19:40 +01:00
erogol 30788960a8 check SS model parameters 2021-01-06 13:19:40 +01:00
erogol 5cae2c5742 make optional position encoding for speedyspeech 2021-01-06 13:19:40 +01:00
erogol dc4a16d62e speedy speehc losses 2021-01-06 13:19:40 +01:00
erogol d62cac7252 fix glow-tts prenet bug fix 2021-01-06 13:19:40 +01:00
erogol a1d5a9ddda config update tyo use noise for augmentation 2021-01-06 13:19:40 +01:00
erogol 022af74d74 update prompt msg 2021-01-06 13:19:40 +01:00
erogol 57ef53bef3 update argumnet check for non tacotron models 2021-01-06 13:19:40 +01:00
erogol 27a75de15f update processors for loading attention maps 2021-01-06 13:19:40 +01:00
erogol fa6907fa0e update glow-tts parameters and fix rel-attn-win size 2021-01-06 13:19:40 +01:00
erogol 7b20d8cbd3 implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic 2021-01-06 13:19:40 +01:00
erogol 973754d893 fix for init glow-tts 2021-01-06 13:19:40 +01:00
erogol f81af4eb0d config update disable guided attention for dynamic conv attention 2021-01-06 13:19:40 +01:00
erogol 29b17c0808 bug fix for gradual training 2021-01-06 13:19:40 +01:00
erogol 5c50e104d6 config update 2021-01-06 13:19:40 +01:00
erogol 6478d552dc tacotron training bug fix 2021-01-06 13:19:40 +01:00
erogol 1dd086577a tacotron training bug fix 2021-01-06 13:18:41 +01:00
erogol fa20638083 config for ljspeech dynamic conv attention 2021-01-06 13:18:41 +01:00
erogol 070146e143 add monotonic dynamic convolution attention 2021-01-06 13:18:41 +01:00
erogol 18392bc13a Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-06 13:18:08 +01:00
Thorsten Mueller f673f8f74d Added support for npy output from tune-wavegrad 2020-12-19 22:51:22 +01:00
Thorsten Mueller 2aa0354b44 Fix for 'NoneType' object has no attribute 'to' 2020-12-19 22:37:03 +01:00
Thorsten Mueller 28a64221ea Improve robostness on cpu / gpu model mix 2020-12-19 22:23:28 +01:00
erogol 8293751a38 remove mozilla from server page 2020-12-17 12:28:28 +01:00
erogol 639fa29261 update speaker id casting for glow-tts 2020-12-14 16:58:47 +01:00
erogol 999120ecdf Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:50:14 +01:00
erogol f611e6ac01 Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-12-12 18:47:59 +01:00