Commit Graph

1186 Commits

Author SHA1 Message Date
Edresson 1496f271dc update Compute embeddings script 2021-05-27 00:45:18 -03:00
Edresson bc5307caa0 add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix 2021-05-26 20:35:58 -03:00
Edresson c90037c2e9 solve merge problems 2021-05-26 16:01:30 -03:00
Katsuya Iida f921a05bdb Fixed lint errors 2021-05-26 19:02:16 +09:00
Edresson Casanova f89cb6aec2
Merge branch 'dev' into dev 2021-05-25 17:30:25 -03:00
Edresson d570c2d790 pylint fix and data loader bug fix 2021-05-26 01:11:37 -03:00
Katsuya Iida 0536aa6d0f Japanese Tacotron 2 model 2021-05-22 17:12:19 +09:00
Eren Gölge 5482a0f62d type def for gradual_training 2021-05-19 14:03:26 +02:00
Eren Gölge df6a98d0c3 type def for gradual_training 2021-05-19 14:00:44 +02:00
Eren Gölge 16576d6408 bump version number 2021-05-19 12:35:10 +02:00
Eren Gölge 8a7c40736c set use_phonemes false 2021-05-19 01:27:26 +02:00
Eren Gölge ccfaa6b1d5 add `needs_phonemizer` field to models.json. If set true these models
are only compatible with v0.0.13 or below.
2021-05-18 17:57:28 +02:00
Eren Gölge a14fcf2a13 remove text_processing test 2021-05-18 17:57:28 +02:00
Eren Gölge d7fae3f515 remove all espeaker and phonemizer deps 2021-05-18 17:57:28 +02:00
Eren Gölge ced05e812a move chinese phonemizer 2021-05-18 17:57:28 +02:00
Eren Gölge 218af1d9a2 change `list` to `List` in config 2021-05-18 17:30:27 +02:00
Eren Gölge 4df31f7fbd unused_speakers argument for ignoring speaker ids in multi-speaker
training
2021-05-18 14:50:03 +02:00
Eren Gölge c2c7dff805 use relaxted coqpit parser 2021-05-18 14:49:47 +02:00
Edresson 856ea19758 bug fix in dataloader and update inference 2021-05-18 03:43:16 -03:00
Eren Gölge d1b469935d tacotron DDC LJSpeech recipe 2021-05-17 11:42:14 +02:00
Eren Gölge 34a42d379f update tacotron_config.py for checking `r` and the docstring 2021-05-17 11:35:30 +02:00
Eren Gölge 12722501bb styling 2021-05-15 23:48:31 +02:00
Eren Gölge 8b1014d188 add docstrings with default value fixes 2021-05-15 23:45:10 +02:00
Eren Gölge da49089a72 update melgan training test batch size 2021-05-12 10:12:11 +02:00
Edresson 3433c2f348 add compute embedding for the new speaker encoder 2021-05-12 03:06:46 -03:00
Eren Gölge 0213e1cbf4 update configs for tts models to match the field typed with the expected
values
2021-05-12 00:57:38 +02:00
Eren Gölge 715b0a65a0 update main.yml for python x64
fix test
2021-05-12 00:57:29 +02:00
Edresson 3fcc748b2e implement the Speaker Encoder H/ASP 2021-05-11 16:27:05 -03:00
Eren Gölge 843d1b3d98 linter fixes 2021-05-11 11:30:00 +02:00
Eren Gölge 19fb1d743d style update 2021-05-11 11:30:00 +02:00
Eren Gölge 6e980b49c4 fix synthesizer.py for Coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge db14dcd95a remove old load_config 2021-05-11 11:29:18 +02:00
Eren Gölge a21ac883dd add get_cuda() 2021-05-11 11:29:18 +02:00
Eren Gölge 21dd4d7960 fix load_config imports for Coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge c57f0b46bb reintro use_gst for backwars compat 2021-05-11 11:29:18 +02:00
Eren Gölge 18e76a2309 fix speaker encoder model initialization 2021-05-11 11:29:18 +02:00
Eren Gölge 10de40bba1 make num_workers mandatory config field 2021-05-11 11:29:18 +02:00
Eren Gölge df1ddd3539 allow read_json_with_comments for backward compat 2021-05-11 11:29:18 +02:00
Eren Gölge 9f7599e3c3 fix train_encoder for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge f8e52965dd add speaker encoder coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge ce2bba543e remove extra from utils and move funcs to io.py 2021-05-11 11:29:18 +02:00
Eren Gölge 812dbc2b06 rm config.json 2021-05-11 11:29:18 +02:00
Eren Gölge 3fde2001b1 train_encoder refactoring for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge 9ee70af9bb code styling 2021-05-11 11:29:18 +02:00
Eren Gölge 10db2baa06 global shared Coqpit configs 2021-05-11 11:29:18 +02:00
Eren Gölge 3dec62b183 add Coqpits for the vocoder models 2021-05-11 11:29:18 +02:00
Eren Gölge 6f4eed94f5 remove *.json vocoder configs 2021-05-11 11:29:18 +02:00
Eren Gölge 78b3825d0b update train scripts for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge 757e90b1cc load_config function to initialize the right Coqpit for the given model 2021-05-11 11:29:18 +02:00
Eren Gölge e6f45b9eb7 update train_vocoder_gan.py for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge bcebd69d09 remove bash tts training tests 2021-05-11 11:29:17 +02:00
Eren Gölge 7663bc63c1 add Coqpit configs for the TTS models 2021-05-11 11:29:17 +02:00
Eren Gölge 7227e8f1d2 update train_align_tts.py for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 51a7e06945 glow_tts_config.py and train test on python 2021-05-11 11:29:17 +02:00
Eren Gölge 720fe13056 update glow_tts modules and training script for coqpit use 2021-05-11 11:29:17 +02:00
Eren Gölge 816e7ee698 remove default configs.json as replacing with Coqpit configs 2021-05-11 11:29:17 +02:00
Eren Gölge 35341d5482 move bash script based tests to python with coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 647163397d coqpit refactoring 2021-05-11 11:29:17 +02:00
Eren Gölge eaa130e813 fix tacotron for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 65d7ad4250 refactor train_speedy_speech.py for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 4a58fdfd59 comment out check-arguments before copying fields to the configs 2021-05-11 11:29:17 +02:00
Eren Gölge 05d9543ed8 init GST module using gst config in Tacotron models 2021-05-11 11:29:17 +02:00
Eren Gölge 93a00373f6 move split_dataset 2021-05-11 11:29:17 +02:00
Eren Gölge 9c18e40f64 black formatting 2021-05-11 11:29:17 +02:00
Eren Gölge c34c8137d7 update compute_statistics for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 79d7215142 config refactor #5 WIP 2021-05-11 11:29:17 +02:00
Eren Gölge dc50f5f0b0 config refactor #4 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge 97bd5f9734 [ci skip] config update #3 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge a21c0b5585 config update 2 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge e092ae40dc config update WIP 2021-05-11 11:28:35 +02:00
Eren Gölge 06f80a4806 update check argument 2021-05-11 11:28:35 +02:00
Eren Gölge bf7ddfa542
Merge pull request #481 from chmodsss/main
Accessing __version__ command
2021-05-11 10:20:48 +02:00
Edresson 85ccad7e0a add Audio data augamentation Addtive and RIR 2021-05-11 00:59:57 -03:00
Edresson 77d85c6cc5 add softmaxproto loss and bug fix in data loader 2021-05-10 17:08:38 -03:00
chmodsss 607d5cf377 [#480] Adding version variable 2021-05-10 19:46:34 +02:00
Adam Froghyar 7ddc885f37 deleted a line the broke GravesAttention 2021-05-10 15:42:59 +02:00
Edresson 78bad25f2b update voxceleb download link 2021-05-07 23:45:15 -03:00
Eren Gölge f7582107da
Merge pull request #453 from Edresson/dev
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
2021-05-06 17:53:28 +02:00
Edresson 501c8e0302 remove unused vars on extract tts spectrograms script 2021-05-04 19:04:13 -03:00
Eren Gölge 0325c58862
Merge pull request #468 from shaun95/patch-1
Update losses.py
2021-05-03 14:45:24 +02:00
Eren Gölge 8cb27267a4 formatting 2021-05-03 14:26:35 +02:00
Eren Gölge 87d674a038 bumpup librosa version to 0.8.0 2021-05-03 14:25:09 +02:00
shaun 7d0ec62bf1
Update losses.py
The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled.
The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)
2021-05-02 14:14:24 +02:00
Edresson 3ecd556bbe add unit test for extract tts spectrograms script 2021-05-01 13:41:56 -03:00
Edresson 446b1da936 create inference function 2021-04-29 18:18:37 -03:00
Eren Gölge f02f0338c2 fix .models.json and add testing to check released models availability 2021-04-29 09:32:36 +02:00
Eren Gölge fd95e9b8a4 [ci skip] Add sam models 2021-04-28 21:57:31 +02:00
Agrin Hilmkil 351d0ed6ae Remove unnecessary fsspec usage 2021-04-28 11:21:08 +02:00
Agrin Hilmkil 167f86417e Move dev, tf, notebook dependencies to extras 2021-04-28 11:20:06 +02:00
Eren Gölge 1235e54738 test for synthesize.py 2021-04-27 14:17:38 +02:00
Eren Gölge 4719414f2e remove imports 2021-04-27 11:25:17 +02:00
Eren Gölge add97cddc1 move function and remove import 2021-04-27 11:22:56 +02:00
Eren Gölge 734e6a515c bug fix 2021-04-27 10:27:45 +02:00
Eren Gölge 6bdd81667e place holders for sc-glow and hifigan models 2021-04-26 19:53:12 +02:00
Eren Gölge 2f0716073e enable multi-speaker CoquiTTS models for synthesize.py 2021-04-26 19:36:53 +02:00
Eren Gölge b531fa699c remove conflicy noise 2021-04-26 15:27:52 +02:00
Eren Gölge f37b488876 Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager 2021-04-26 15:25:25 +02:00
Eren Gölge b82daa5e86 style and linter fixes 2021-04-26 15:22:24 +02:00
Edresson 20e42a3381 add save audio option 2021-04-23 15:00:00 -03:00
Edresson 8228091f92 add script for extraction of tts spectrograms 2021-04-23 14:17:46 -03:00
Eren Gölge 4cf211348d styling and linting 2021-04-23 18:04:37 +02:00
Eren Gölge 7eb0c60d2e let synthesizer to pass speaker encoder file paths to speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge f69195739e let speaker manager compute mean x_vector from multiple wav files 2021-04-23 18:04:37 +02:00
Eren Gölge 179722e3a7 new arguments to synthesize.py for loading speaker encoder and speaker wavs 2021-04-23 18:04:37 +02:00
Eren Gölge dfa415a8b8 small refactor in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge c80d21f311 load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge ad047c8195 html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker 2021-04-23 18:04:37 +02:00
Eren Gölge f9f3d04d14 remove moved function 2021-04-23 18:04:37 +02:00
Eren Gölge 10c988ac8c update server.py 2021-04-23 18:04:37 +02:00
Eren Gölge 6d0f5e0459 use SpeakerManager in Synthesizer 2021-04-23 18:04:37 +02:00
Eren Gölge e97126314c add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
2021-04-23 18:04:37 +02:00
Eren Gölge d08888e603 formating speakers.py 2021-04-23 18:04:37 +02:00
Eren Gölge df422223a3 initial SpeakerManager implementation 2021-04-23 18:04:37 +02:00
Eren Gölge 7a7aeb35f5 fix the glow-tts in setup_model 2021-04-23 18:04:37 +02:00
Eren Gölge d42748082a update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-23 18:04:37 +02:00
Eren Gölge 2da81f5bb6 add load_chekpoint to speaker encoder 2021-04-23 18:04:37 +02:00
Eren Gölge 1229ccbf07 update argument name in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge af2d36faeb update synthesize.py for multi-speaker setting 2021-04-23 18:04:37 +02:00
Eren Gölge 99dc07a7dd add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-23 18:04:37 +02:00
Eren Gölge c955a12428 set the default layer size compatible with scglow 2021-04-23 18:04:37 +02:00
Eren Gölge 3ace2440fa fix a mistake from rebase 2021-04-23 18:04:37 +02:00
Eren Gölge aadb2106ec code styling 2021-04-23 18:04:37 +02:00
Eren Gölge af7baa3387 refactoring to allow defining the speaker file externally 2021-04-23 18:04:37 +02:00
kirianguiller 7dccbfdcd5 handle multi speaker and gst in Synthetizer class 2021-04-23 18:04:37 +02:00
Edresson d2b6326b8b change optimizer initialization for compatibility with Hifi-GAN official implementation 2021-04-23 07:54:39 -03:00
WeberJulian 4205284f92
Change name of the functions 2021-04-23 10:09:55 +02:00
WeberJulian a26498181b Change back the default value 2021-04-22 16:10:17 +02:00
Julian Weber 355e1f47ab fix dumb mistake 2021-04-22 15:50:29 +02:00
Julian Weber c125b71f36 fix windows support 2021-04-22 15:14:24 +02:00
Jörg Thalheim f5fd7f78d4 server: also listen to ipv6
The [::] address will listen to both ipv4/ipv6 addresses.
2021-04-22 12:38:55 +02:00
Eren Gölge ef37633cb3 [ci skip] use prenet_dropout by default with Tacotron models 2021-04-22 12:38:55 +02:00
Eren Gölge e1d960da9e use SpeakerManager in Synthesizer 2021-04-21 13:13:27 +02:00
Eren Gölge 04b6881b66 add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
2021-04-21 13:12:35 +02:00
Eren Gölge 790946faec formating speakers.py 2021-04-21 13:12:11 +02:00
Eren Gölge ab313814de initial SpeakerManager implementation 2021-04-21 13:11:46 +02:00
Eren Gölge 09890c7421 fix the glow-tts in setup_model 2021-04-21 13:10:40 +02:00
Eren Gölge 8764d02eb2 update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-21 13:09:44 +02:00
Eren Gölge 8b40720977 add load_chekpoint to speaker encoder 2021-04-21 13:09:04 +02:00
Eren Gölge 37cad38c27 update argument name in server.py 2021-04-21 13:08:45 +02:00
Eren Gölge 9bccee9da8 update synthesize.py for multi-speaker setting 2021-04-21 13:08:25 +02:00
Eren Gölge d2fa8add1f add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-16 19:40:13 +02:00
Eren Gölge d9612a4351 set the default layer size compatible with scglow 2021-04-16 19:40:13 +02:00
Eren Gölge 1038fd420d fix a mistake from rebase 2021-04-16 19:39:47 +02:00
Eren Gölge 47e356cb48 code styling 2021-04-16 16:01:40 +02:00
Eren Gölge 25328aad00 refactoring to allow defining the speaker file externally 2021-04-16 15:59:57 +02:00
kirianguiller 48ae52a9a3 handle multi speaker and gst in Synthetizer class 2021-04-16 15:54:49 +02:00
Eren Gölge a53958ae3a fix urls for the new models 2021-04-15 17:05:00 +02:00
Eren Gölge 9cc17be53a formatting and a small bug fix in Tacotron model 2021-04-15 16:36:51 +02:00
Eren Gölge 1ad838bc83 add newly released models under .model.json 2021-04-15 16:06:10 +02:00
Eren Gölge 7cada1a949 remove noise 2021-04-15 15:30:45 +02:00
Eren Gölge d60a8d7211 show the real waveform on TB too for GAN vocoder training. 2021-04-15 15:30:06 +02:00
Eren Gölge 5fbe926429 change the default TTS model to TacotronDDC 2021-04-15 15:29:44 +02:00
Eren Gölge 3de5a89154 optionally enable prenet dropout at inference time for tacotron models 2021-04-13 13:24:56 +02:00
Eren Gölge 28a2fed8a3 update hifigan in .model.json 2021-04-12 16:48:05 +02:00
Eren Gölge abaf36861a aligntts model .model.json placeholder 2021-04-12 16:43:52 +02:00
Eren Gölge 480e2f7888 docstring update and better handling make_symbols 2021-04-12 16:40:49 +02:00
Eren Gölge b735076bb4 linter fixes 2021-04-12 13:14:11 +02:00
Eren Gölge b11d1cb845 small fixes 2021-04-12 12:40:55 +02:00
Eren Gölge a7f6045644 Merge branch 'reformat' into hifigan-reformat 2021-04-12 12:00:17 +02:00
Eren Gölge f519012dea reformatting and styling 2021-04-12 11:47:39 +02:00
Eren Gölge 9011dddf77 tacotron DDC placeholder in models.json 2021-04-12 04:06:27 +02:00
Eren Gölge d295d5de97 remove torch.no_grad from TorchSTFT 2021-04-10 19:43:57 +02:00
Eren Gölge 5b70da2e3f restore schedulers only if training is continuing a previous training
inherit nn.Module for TorchSTFT
2021-04-09 19:31:28 +02:00
Eren Gölge 2c71c6d8cd [ci skip]update gan vocoder configs to reflect the recent changes 2021-04-09 17:15:32 +02:00
Eren Gölge 2b529f60c8 update default hifigan config 2021-04-09 11:40:06 +02:00
Eren Gölge 105e0b4d62 vocoder gan training fixes 2021-04-09 11:38:04 +02:00
Eren Gölge 87ee6ceb57 style update #3 2021-04-09 01:17:15 +02:00
Eren Gölge 18d9ec8036 format with black 2021-04-09 00:54:59 +02:00
Eren Gölge e5b9607bc3 isort all imports 2021-04-09 00:45:20 +02:00
Eren Gölge 0e79fa86ad format with black and pylint 2.7.3 2021-04-09 00:38:08 +02:00
Eren Gölge cd69da4868 linter fixes #2 2021-04-08 16:57:46 +02:00
Eren Gölge 4d3e1e9d9a linter fix 2021-04-08 14:57:46 +02:00
Eren Gölge 53f54898bc small fixes 2021-04-08 14:22:47 +02:00
Eren Gölge 006b1d3aaa bug fix 2021-04-08 13:17:45 +02:00
Eren Gölge 3f0993aebe remove junk 2021-04-08 12:17:02 +02:00
Eren Gölge 0ee0458309 remove redundant imports 2021-04-08 11:29:15 +02:00
Eren Gölge 773f1db6fa refactor HifiGAN discriminator 2021-04-08 11:28:30 +02:00
Eren Gölge 15f362d5b1 formatting 2021-04-08 11:28:30 +02:00
Eren Gölge aee24b0704 set different seed in gan_dataset when it is multi-workers 2021-04-08 11:28:30 +02:00
Eren Gölge 6ee211c137 remove stft params causing warning 2021-04-08 11:28:30 +02:00
Eren Gölge 4998ece8d8 allow configuration of optimziers from the config file 2021-04-08 11:28:30 +02:00
Eren Gölge 8daf407652 cache empty 2021-04-08 11:28:30 +02:00
Eren Gölge 3fb78c004a move scheduler updates to the end of the epoch 2021-04-08 11:28:30 +02:00
Eren Gölge 2a872c98aa don't call os.exit as it leaves the process resources standing 2021-04-08 11:27:40 +02:00
Eren Gölge 7cecd2fb2e add hifigan D 2021-04-08 11:27:40 +02:00
Eren Gölge 13dca6e6b6 revert some of Hifigan generator updates 2021-04-08 11:27:40 +02:00
Eren Gölge 02bc776c35 prevenet grad in TorchSTFT 2021-04-08 11:27:40 +02:00
Eren Gölge cf44624df8 more docstring 2021-04-08 11:27:40 +02:00
Eren Gölge d95b1458e8 Linter fixes and docstrings for HiFiGAN 2021-04-08 11:27:40 +02:00
Eren Gölge bd7a1c177b fix #419 2021-04-08 11:26:41 +02:00
Eren Gölge 7726dfca99 change the upper bound in sound normalization 2021-04-08 11:26:01 +02:00
Eren Gölge 57f6bd1afa make using different samples for G and D networks optional 2021-04-08 11:26:01 +02:00
Eren Gölge 67f8248492 placeholder for finetuned sam hifigan model 2021-04-08 11:25:29 +02:00
Eren Gölge 241e968df1 load_checkpoint for hifigan and no_grad for inference 2021-04-08 11:25:29 +02:00
Eren Gölge de3a04f104 some commeting for Generator loss and check if the argument is defines in the config file 2021-04-08 11:25:29 +02:00
Eren Gölge ff07c5f5e3 update TorchSTFT to enable melspec 2021-04-08 11:25:29 +02:00
Eren Gölge 4a5b1d4ac2 update hifigan config 2021-04-08 11:24:21 +02:00
Eren Gölge e0e3b12b26 pass all parameters explicity to _istft 2021-04-08 11:23:20 +02:00
Eren Gölge f0e76ee135 initial models.json entry for universal hifigan 2021-04-08 11:23:20 +02:00
Eren Gölge d57f416957 small fixes 2021-04-08 11:22:30 +02:00
Eren Gölge 8c9e1c9e58 hifigan implementation update 2021-04-08 11:21:43 +02:00
Eren Gölge a14d7bc5db hifigan config update 2021-04-08 11:20:33 +02:00
Eren Gölge 8d4fd79cd7 update hifigan config 2021-04-08 11:20:33 +02:00
rishikksh20 e656e8b108 Remove select size bug 2021-04-08 11:20:33 +02:00
rishikksh20 b533474e3b Remove minor bugs and make code trainable 2021-04-08 11:20:33 +02:00
rishikksh20 ef6ff4e95c Add Exponential LR scheduler check 2021-04-08 11:20:33 +02:00
rishikksh20 1535777f64 1) Add ExponentialLR 2021-04-08 11:18:36 +02:00
rishikksh20 c20a6b1185 * Format the model definition
* Update code and integrate training code
2021-04-08 11:18:36 +02:00
rishikksh20 39b5845810 1) Add hifigan json files
2) Rename MPD disc
3) Re-format remove weight norm generator
2021-04-08 11:14:39 +02:00
rishikksh20 7b7c5d635f 1) Combine MSD with Multi-Period disc
2) Add remove weight norm layer on Generator
2021-04-08 11:14:39 +02:00
rishikksh20 4493feb95c Add HiFi-GAN v1 generator and discriminator classes 2021-04-08 11:14:39 +02:00
Eren Gölge c86c559349 docstring and optional padding in TorchSTFT 2021-04-07 12:36:15 +02:00
Eren Gölge f890454de3 linter fixes 2021-04-07 12:36:03 +02:00
Eren Gölge 9782d9ea5d [ci skip] implement #418 2021-04-06 16:24:50 +02:00
Eren Gölge f46a275b22 update docstring 2 2021-04-06 16:24:50 +02:00
Eren Gölge ec94ff3691 update docstring 2021-04-06 16:24:50 +02:00
Eren Gölge 2048095e9a audio.py fix 2021-04-06 16:24:50 +02:00
Eren Gölge e0b3008c31 allow choosing the log function used for amptodb conversion 2021-04-06 16:24:50 +02:00
Eren Gölge 44b4cb5ba5 DCA comment 2021-04-06 16:24:50 +02:00
Eren Gölge b86e7fb2e8 pad short samples when loading precomputed features in vocoder trainign 2021-04-06 16:24:50 +02:00
Eren Gölge 6ad4eba678 gan vocoder train fix in case of restoring models wiht no scheduler is defined 2021-04-06 16:24:50 +02:00
Eren Gölge e3ccfe37ea add DE more urls 2021-04-02 14:54:41 +02:00
Eren Gölge e84f120a04 sam-accenture model preprocessor 2021-04-01 03:41:41 +02:00
Eren Gölge e3c052382b fix loading always best_model when continue 2021-04-01 03:41:15 +02:00
Eren Gölge 48ea20e69f example aligntts config 2021-03-30 14:41:00 +02:00
Eren Gölge b4c2cf80f2 fix eval iter 2021-03-30 14:39:16 +02:00
Eren Gölge a3a840fd78 linter fixes 2021-03-30 14:39:16 +02:00
Eren Gölge 6b2e13bf62 compute normalized logp using torch primitives 2021-03-30 14:39:16 +02:00
Eren Gölge 7a382a5c2b stowed aligntts commit and small refactoring with feed_forward layers 2021-03-30 14:39:16 +02:00
Eren Gölge d542a50818 fix losses for alignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge 18cc7b95ec update l1 and huber to mse loss 2021-03-30 14:39:16 +02:00
Eren Gölge 896d33ed49 update losses to hande alingtts phases 2021-03-30 14:39:16 +02:00
Eren Gölge aec0b78aff duration predictor fix 2 2021-03-30 14:39:16 +02:00
Eren Gölge 07269e639b fix duration predictor in AlignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge c2d29e5cd4 FFTransformer encoder for aligntts 2021-03-30 14:39:16 +02:00
Eren Gölge 460a2d3e26 FFTransformer Decoder for AlignTTS 2021-03-30 14:39:16 +02:00
Eren Gölge 844e8e0ed4 adapt align_tts and model name handling 2021-03-30 14:39:16 +02:00
Eren Gölge aa29f5b199 aligntts loss 2021-03-30 14:39:16 +02:00
Eren Gölge a831468cab align tts MDN layer 2021-03-30 14:39:16 +02:00
Eren Gölge 4396f8e2da continue refactoring 2021-03-30 14:39:16 +02:00
Eren Gölge 892c3c3623 use torch for AngleProtoLoss 2021-03-30 14:39:16 +02:00
Eren Gölge 2b3e12ea49 correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting 2021-03-30 14:39:16 +02:00
Eren Gölge ecb6b0d6ad rename GlowTtts as GlowTTS 2021-03-30 14:39:16 +02:00
Eren Gölge e8cf8cb00e restructure TF tacotron files 2021-03-30 14:39:16 +02:00
Eren Gölge 1ac99ce0d0 if git is not available set git has 'unknown' 2021-03-30 14:39:16 +02:00
Eren Gölge d9c405f0c3 create feedforward folder for SS layers 2021-03-30 14:39:16 +02:00
Eren Gölge a8cf1ae6b4 fix wavenet running with no input mask 2021-03-30 14:39:16 +02:00
Eren Gölge 1c1949d348 utf-8 encoding for certain preprocessors 2021-03-30 14:39:16 +02:00
Eren Gölge ca2f22cdd7 linter fix 2021-03-30 14:36:12 +02:00
Eren Gölge d0dcd7d1b8 let the user define outpu.wav file path fix #393 2021-03-30 14:24:31 +02:00
Eren Gölge 25654233d5 [ci skip]initial commit for the new DE models and stale ot update 2021-03-29 03:23:57 +02:00
Guy Elsmore-Paddock 15459627cc Fix `UnicodeEncodeError` on Windows Platforms
Prevents the following error from appearing when running training on Windows platforms:
```
UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined>
```
2021-03-20 17:30:00 -04:00
Eren Gölge 3947750dd9 Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-03-18 14:09:47 +01:00
WeberJulian 4a9d2e4309 fix french_cleaners 2021-03-18 13:35:29 +01:00
WeberJulian 596ea2c98a Add resample script 2021-03-18 13:33:37 +01:00
Eren Gölge 6e68637f48 bug fix 2021-03-18 13:33:23 +01:00
Eren Gölge f3e5ddfaaf bug fix in preprocessor 2021-03-18 13:33:23 +01:00
Eren Gölge aeb4f82233 bug fix 2021-03-18 13:33:23 +01:00
Eren Gölge 0514330869 fix mozilla/TTS#685 2021-03-18 13:33:23 +01:00
Eren Gölge f06603a0db force utf8 2021-03-18 13:33:23 +01:00
Eren Gölge 32e8b56c45 linter fix 2021-03-18 13:33:23 +01:00
Eren Gölge 65533f33e9 fix #374 2021-03-18 13:33:00 +01:00
Eren Gölge d790d2fccb linter fix 2021-03-18 13:33:00 +01:00
WeberJulian af96080e17 fix linter issues 2021-03-18 13:33:00 +01:00
WeberJulian bf04383e74 fix french_cleaners 2021-03-18 13:33:00 +01:00
WeberJulian f6cd8e0ecc test case 2021-03-18 13:33:00 +01:00
WeberJulian e954e45e57 linter + test 2021-03-18 13:33:00 +01:00
WeberJulian e598977f3d Using path.join instead of concat 2021-03-18 13:33:00 +01:00
WeberJulian c5ef2de73f Add resample script 2021-03-18 13:33:00 +01:00
Eren Gölge 2690ab2ee5 bug fix 2021-03-16 19:15:28 +01:00
Eren Gölge 4c1aed4a9c bug fix in preprocessor 2021-03-16 19:13:32 +01:00
Eren Gölge 01e35e06c4 bug fix 2021-03-16 19:13:32 +01:00
Eren Gölge aa8bb815a7 fix mozilla/TTS#685 2021-03-16 19:13:32 +01:00
Eren Gölge a8c348ffb2 force utf8 2021-03-16 19:13:32 +01:00
Eren Gölge bf0caba0bc linter fix 2021-03-16 19:13:32 +01:00
Eren Gölge babc94f63f fix #374 2021-03-16 19:13:32 +01:00
Eren Gölge bdfd1f8a89 linter fix 2021-03-16 19:13:32 +01:00
WeberJulian 11e25a7125 fix linter issues 2021-03-16 19:13:01 +01:00
WeberJulian 1574d8dd39 fix french_cleaners 2021-03-16 19:13:01 +01:00
WeberJulian b94373afb8 test case 2021-03-16 19:13:01 +01:00
WeberJulian 93fdc0729c linter + test 2021-03-16 19:13:01 +01:00
WeberJulian 17f197f51e Using path.join instead of concat 2021-03-16 19:13:01 +01:00
WeberJulian d6749f030f Add resample script 2021-03-16 19:13:01 +01:00
Eren Gölge 838ebd6ad5 add the missing russian model 2021-03-16 18:38:35 +01:00
Eren Gölge 5c657715f2 fix #382 2021-03-16 17:31:48 +01:00
Eren Gölge 38a29ce1c9 move all models to github rls 2021-03-10 18:19:32 +01:00
Eren Gölge e5bb317242 fix model manager 2021-03-10 17:01:19 +01:00
Eren Gölge d260fb03a2 fix handling scale_stats.npy for models downloaded from Github rls 2021-03-10 16:40:30 +01:00
Eren Gölge 4aba4e5b1e linter fx 2021-03-10 15:33:11 +01:00
Eren Gölge 6c932c8503 print the desc if required parameters are not provided 2021-03-10 15:19:00 +01:00
Eren Gölge 9e84c8a623 do not copy scale_stats if exist in the output folder 2021-03-10 15:13:55 +01:00
Eren Gölge 7782034e7e fix #369 2021-03-10 15:13:21 +01:00
Eren Gölge 4337e9ff87 pad_mode in torch_stft 2021-03-10 14:41:00 +01:00
Eren Gölge 599149a7e5 downloading models from github releases 2021-03-10 11:09:01 +01:00
Eren Gölge fc19411ac6 update some of the models to github releases 2021-03-10 11:08:15 +01:00
Eren Gölge 19bb9ba851 fix tts endpoint using list-models argument 2021-03-09 14:06:09 +01:00
Eren Gölge 43379eecef fix the nl model and add the vocoder 2021-03-09 14:05:56 +01:00
r-dh 8a4dcd152f Add Dutch model 2021-03-09 13:22:19 +01:00
Eren Gölge 94805236fb Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-03-08 15:21:06 +01:00
Eren Gölge 5dcc4be560 rebrand demo server 2021-03-08 14:51:04 +01:00
Eren Gölge 947e3d6a93 rename test 2021-03-08 14:50:54 +01:00
Eren Gölge a519ed52f2 deprecate embedding models to the wheel 2021-03-08 14:06:15 +01:00
Eren Gölge c16ad38930 update server rEADME 2021-03-08 14:05:59 +01:00
Eren Gölge 594d8d8f09 linter fixes 2021-03-08 11:22:59 +01:00
Eren Gölge 00b5090974 linter fix 2021-03-08 11:05:30 +01:00
Eren Gölge e15734c3fc linter fix 2021-03-08 05:29:43 +01:00
Eren Gölge 9a48ba3821 a ton of linter updates 2021-03-08 05:06:54 +01:00
Eren Gölge e03a426378 bug fix 2021-03-08 02:59:48 +01:00
kirianguiller 628afe5cb0 remove gst handling in synthetizer.py class 2021-03-08 02:59:48 +01:00
kirianguiller 557239db7f remove re.Match typing in '_number_replace()' 2021-03-08 02:59:48 +01:00
kirianguiller 9ab07f94e2 modify according to PR reviews 2021-03-08 02:59:48 +01:00
kirianguiller 42ba30eb8f <add> Chinese mandarin implementation (tacotron2) 2021-03-08 02:59:24 +01:00
kirianguiller 49665783a6 remove gst handling in synthetizer.py class 2021-03-08 02:57:11 +01:00
kirianguiller e85658ac2b remove re.Match typing in '_number_replace()' 2021-03-08 02:57:11 +01:00
kirianguiller 0d4525322c modify according to PR reviews 2021-03-08 02:57:11 +01:00
kirianguiller e6fd118cf8 <add> Chinese mandarin implementation (tacotron2) 2021-03-08 02:57:11 +01:00
Eren Gölge e3102e753c enable backward compat for loading the best model 2021-03-08 02:57:11 +01:00
gerazov 2451a813a2 refactored keep_all_best 2021-03-08 02:57:11 +01:00
gerazov 8cefa76bae reformated docstrings in arguments.py 2021-03-08 02:57:11 +01:00
gerazov 2db40457e8 brushed up printing model load path and best loss path 2021-03-08 02:56:36 +01:00
gerazov f2e474cd37 loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added 2021-03-08 02:56:36 +01:00
Eren Gölge 4111df6769 Docstrings for audioprocessor 2021-03-08 02:54:47 +01:00
Eren Gölge 2ca74b8ab3 add RUSLAN dataset preprocessor 2021-03-08 02:54:47 +01:00
Eren Gölge 8993120634 do not test server and modelManager until fixing #657 2021-03-08 02:54:47 +01:00
Adonis Pujols 89b7f01534 add encoding="utf-8" 2021-03-08 02:54:47 +01:00
Eren Gölge ffceccb021 fix #655 2021-03-08 02:54:47 +01:00
Eren Gölge 534c341f16 linter update 2021-03-08 02:54:47 +01:00
Eren Gölge 0e1e60bef0 remove redundancy 2021-03-08 02:54:47 +01:00
Eren Gölge 93a83c0068 Update TTS/utils/arguments.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-03-08 02:54:47 +01:00
Eren Gölge 39fbf2fe84 Update TTS/bin/find_unique_chars.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-03-08 02:54:47 +01:00
Eren Gölge ee71eb4eb7 linter fixes 2021-03-08 02:54:47 +01:00
Eren Gölge 55fc50b26d update test_text_processing for espeak-ng 2021-03-08 02:54:47 +01:00
Eren Gölge 5b8a6736a7 remove _phoneme_punctuations 2021-03-08 02:54:47 +01:00
Eren Gölge 194f82de51 save default model chars to the training config file 2021-03-08 02:54:47 +01:00
Eren Gölge 62a8eba3b2 parse_characters function 2021-03-08 02:54:47 +01:00
Eren Gölge 0b33acdcca enable saving model characters in io.py 2021-03-08 02:54:47 +01:00
Eren Gölge f9fe167537 docstring update 2021-03-08 02:54:47 +01:00
Eren Gölge 62aeacbdd1 save used model characters to the checkpoints 2021-03-08 02:54:47 +01:00
Eren Gölge e06c93fe81 model_manager tests 2021-03-08 02:54:47 +01:00
Eren Gölge fe41084eb3 author , license and contact info in .model.json 2021-03-08 02:54:47 +01:00
nmstoker ae0d54ddae Updating models list to include EK1 TTS/vocoder 2021-03-08 02:54:47 +01:00
Eren Gölge c6702b5b9f find unique characters in a dataset 2021-03-08 02:54:47 +01:00
Eren Gölge dad3565379 use default vocoders in server.pu 2021-03-08 02:54:47 +01:00
Eren Gölge d30608ab17 set an output_sample_rate in synthesizer and use it for writing the wav
file
2021-03-08 02:54:47 +01:00
Eren Gölge 3ccb015cd8 return the json entry of the downloaded model 2021-03-08 02:54:47 +01:00
Eren Gölge 00e0933f43 save_wav with a custom sampling rate 2021-03-08 02:54:47 +01:00
Eren Gölge 9fefc79f0c fix make_symbols 2021-03-08 02:54:47 +01:00
Eren Gölge 8955333e9d use default vocoder in synthesize.py 2021-03-08 02:54:47 +01:00
Eren Gölge 23b282f600 define default vocoders 2021-03-08 02:54:47 +01:00
Eren Gölge 6bd8485d10 bug fix 2021-03-08 02:54:47 +01:00
Eren Gölge 5f1018abee fix spelling of a def argument and parse phonemes from config.json if
use_phonemes is True
2021-03-08 02:54:47 +01:00
Eren Gölge 1c1abb8a9b docstring update 2021-03-08 02:54:47 +01:00
Eren Gölge 6cd642c2e1 add missing phonemes to test_config.json 2021-03-08 02:54:47 +01:00
Eren Gölge 43b951018e fix the default vocoder name 2021-03-08 02:54:47 +01:00
Adonis Pujols 81b145c321 spelling error. should be multiband not mulitband 2021-03-08 02:54:47 +01:00
Adonis Pujols 59b1b13e07 spelling error. should be multiband not mulitband 2021-03-08 02:54:47 +01:00
Eren Gölge ee58ff2d38 add russian phoneme char 2021-03-08 02:54:47 +01:00
Eren Gölge 29d928d531 css10 dataset preprocessor 2021-03-08 02:54:47 +01:00
Eren Gölge 49771f2541 download github model releases by model manager 2021-03-08 02:54:21 +01:00
Eren Gölge 3c961370e7 linter fixes 2021-03-08 02:54:21 +01:00
gerazov 2b5cb24db7 final final fixes 2021-03-08 02:54:21 +01:00
gerazov b3c5cc2cdc final fixes 2021-03-08 02:54:21 +01:00
gerazov 10d5a63d49 updated to current dev 2021-03-08 02:54:21 +01:00
gerazov 6f06e31541 changed train scripts 2021-03-08 02:54:21 +01:00
gerazov 2daca15802 restructured arg parsing and processing to utils 2021-03-08 02:54:21 +01:00
Eren Gölge 2fbe4a1b8a fix gdown 2021-03-08 02:54:21 +01:00
Branislav Gerazov ed56944c4a improve robustness of defining wavernn in config file 2021-03-08 02:54:21 +01:00
Branislav Gerazov 5e2bc8c99f update wavernn test config, delete cap=True 2021-03-08 02:54:21 +01:00
Branislav Gerazov b1e3160884 waveRNN fix 2021-03-08 02:54:21 +01:00
Eren Gölge 08581deb61 linter updates 2021-03-08 02:53:02 +01:00
Thorsten Mueller 167901813d Ups. Added missing , 2021-03-08 02:53:02 +01:00
Eren Gölge 93a6bdfd6c linter fixes and version updates for deps 2021-03-08 02:51:10 +01:00
Eren Gölge a30a231566 unpin cython version and commentout pyworld in audio.py causing dep
issues
2021-03-08 02:50:15 +01:00
Thorsten Mueller 3eb00e8d93 Set out_path to be required param. 2021-03-08 02:49:15 +01:00
Alexander Korolev ace430d5e6 fix device mismatch wavegrad training
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-03-08 02:49:15 +01:00
Eren Gölge 83143fbe39 fix #638 2021-03-08 02:48:31 +01:00
Eren Gölge 30c3bef3f9 move hubconf 2021-03-08 02:48:31 +01:00
Eren Gölge bbea6a0884 hubconf.py and load .models.json from the defualt location by mange.py 2021-03-08 02:48:31 +01:00
Eren Gölge 90d4f08d6c reorder imports 2021-03-08 02:48:31 +01:00
Eren Gölge db231c83fc distill import statement, check python version in setup.py 2021-03-08 02:48:31 +01:00
Thorsten Mueller 915ec1faac Added info if model already downloaded in --list_models 2021-03-08 02:48:31 +01:00
Alexander Korolev b4bc5f6eb1 update fixed stopnet_pos_weight parameter
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-03-08 02:48:31 +01:00
Eren Gölge 534e3c67c6 README update, set default models for synthesize.py and server.py. Disable verbose for ap init. 2021-03-08 02:48:31 +01:00
kirianguiller 7f36d91131 update chinese model 2021-03-01 14:55:05 +01:00
Eren Gölge 547bfc4ce9 bug fix 2021-02-18 18:24:03 +00:00
Eren Gölge adaeec57ec Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-18 17:21:09 +00:00
Eren Gölge 5b70c8ba4f enable backward compat for loading the best model 2021-02-18 17:20:36 +00:00
Eren Gölge e4f81d6856
Merge pull request #654 from kirianguiller/chinese-implementation
Chinese implementation (merge into dev)
2021-02-18 17:15:32 +01:00
kirianguiller 22a6bbfa80 remove gst handling in synthetizer.py class 2021-02-17 20:53:56 +01:00
kirianguiller 3911b87e54 remove re.Match typing in '_number_replace()' 2021-02-17 20:53:56 +01:00
kirianguiller fb0655d1e7 modify according to PR reviews 2021-02-17 20:53:56 +01:00
kirianguiller c4c7bc1b88 <add> Chinese mandarin implementation (tacotron2) 2021-02-17 20:53:56 +01:00
Eren Gölge d0454461de Merge branch 'pr/gerazov/650-2' into dev 2021-02-17 13:40:45 +00:00
Eren Gölge a8ea0ea6ce Docstrings for audioprocessor 2021-02-17 13:35:41 +00:00
Eren Gölge f6e6314910 add RUSLAN dataset preprocessor 2021-02-17 13:35:23 +00:00
Eren Gölge ce0c5eccbd do not test server and modelManager until fixing #657 2021-02-17 00:35:43 +00:00
gerazov 61c88beb94 refactored keep_all_best 2021-02-15 18:40:17 +01:00
Eren Gölge eb543c027e Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-15 17:06:40 +00:00
Eren Gölge 8a106e0527 fix #655 2021-02-15 17:06:03 +00:00
Eren Gölge 216945e653
Merge pull request #647 from adonispujols/patch-1
Easy Fix for #454 (which was somehow deleted?)
2021-02-15 13:17:17 +01:00
Eren Gölge 06a3ba2fe2 linter update 2021-02-15 12:10:19 +00:00
Eren Gölge 7f58fa365b Merge branch 'save_characters' into dev 2021-02-15 12:07:28 +00:00
Eren Gölge ff218e2370 remove redundancy 2021-02-15 12:07:02 +00:00
Eren Gölge 80af8ca5e1
Update TTS/utils/arguments.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:03:59 +01:00
Eren Gölge 3b6ce04332
Update TTS/bin/find_unique_chars.py
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:02:29 +01:00
Eren Gölge dc3596dad4 model_manager tests 2021-02-15 11:29:22 +00:00
Eren Gölge 77e630348e author , license and contact info in .model.json 2021-02-15 11:02:21 +00:00
Eren Gölge e1bc823e44 Merge branch 'pr/nmstoker/652' into dev 2021-02-15 10:57:12 +00:00
nmstoker 33bcdc6ff8 Updating models list to include EK1 TTS/vocoder 2021-02-14 23:44:05 +00:00
Eren Gölge 420901f4c2 linter fixes 2021-02-12 14:41:17 +00:00
Eren Gölge 4244096ccb update test_text_processing for espeak-ng 2021-02-12 14:07:26 +00:00
Eren Gölge b28c724c04 remove _phoneme_punctuations 2021-02-12 12:10:57 +00:00
Eren Gölge 7ab527d17e save default model chars to the training config file 2021-02-12 12:06:46 +00:00
Eren Gölge 593cedee14 parse_characters function 2021-02-12 12:05:56 +00:00
Eren Gölge 2abfff17f9 enable saving model characters in io.py 2021-02-12 12:04:41 +00:00
Eren Gölge 918f007a11 docstring update 2021-02-12 12:04:07 +00:00
Eren Gölge e774f68aee save used model characters to the checkpoints 2021-02-12 12:03:42 +00:00
gerazov 0e78e31dbf reformated docstrings in arguments.py 2021-02-12 11:36:01 +01:00
gerazov 310d18325e brushed up printing model load path and best loss path 2021-02-12 10:55:45 +01:00
Eren Gölge 8b6fd76ad2 find unique characters in a dataset 2021-02-12 09:46:11 +00:00
gerazov af46727517 loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added 2021-02-12 02:12:00 +01:00
Eren Gölge a1e595790d use default vocoders in server.pu 2021-02-11 15:31:39 +00:00
Eren Gölge 8aa6a0decb set an output_sample_rate in synthesizer and use it for writing the wav
file
2021-02-11 15:28:07 +00:00
Eren Gölge 0c52d27d65 return the json entry of the downloaded model 2021-02-11 15:27:41 +00:00
Eren Gölge 1649ad3431 save_wav with a custom sampling rate 2021-02-11 15:27:20 +00:00
Eren Gölge 43f54d2dce fix make_symbols 2021-02-11 15:26:52 +00:00
Eren Gölge 0657b38111 use default vocoder in synthesize.py 2021-02-11 15:26:17 +00:00
Eren Gölge 2043a9b5f5 define default vocoders 2021-02-11 15:25:55 +00:00
Eren Gölge ff27690ca7 bug fix 2021-02-11 13:43:29 +00:00
Eren Gölge bc131208be fix spelling of a def argument and parse phonemes from config.json if
use_phonemes is True
2021-02-11 13:04:47 +00:00
Eren Gölge f1799dbd60 docstring update 2021-02-11 11:25:31 +00:00
Eren Gölge 3baec4ea96 add missing phonemes to test_config.json 2021-02-11 11:14:39 +00:00
Eren Gölge a3d1e65b34 Merge branch 'pr/adonispujols/646' into dev 2021-02-11 10:37:29 +00:00
Eren Gölge 3c2e13ca5c fix the default vocoder name 2021-02-11 10:36:52 +00:00
Adonis Pujols 48011a8b58
add encoding="utf-8" 2021-02-11 05:26:06 -05:00
Adonis Pujols b29a7e9645
spelling error. should be multiband not mulitband 2021-02-11 04:49:28 -05:00
Adonis Pujols 6c824a6629
spelling error. should be multiband not mulitband 2021-02-11 04:48:53 -05:00
Eren Gölge b08b8ca2a1 add russian phoneme char 2021-02-10 13:30:59 +00:00
Eren Gölge 9cad435288 css10 dataset preprocessor 2021-02-09 15:11:26 +00:00
Eren Gölge cea5e517f2 download github model releases by model manager 2021-02-09 14:24:14 +00:00
Eren Gölge c619859a3f linter fixes 2021-02-09 11:43:17 +00:00
gerazov e507373b55 final final fixes 2021-02-06 23:08:47 +01:00
gerazov ad17dc9e76 final fixes 2021-02-06 23:05:01 +01:00
gerazov 8fdd08ea15 updated to current dev 2021-02-06 22:59:52 +01:00
gerazov 2705d27b28 changed train scripts 2021-02-06 22:29:30 +01:00
gerazov 4f8f274d6e restructured arg parsing and processing to utils 2021-02-06 22:25:56 +01:00
Eren Gölge e7e880f514 fix gdown 2021-02-05 13:42:24 +00:00
Eren Gölge f4f6290eec Merge branch 'pr/gerazov/641' into dev 2021-02-05 13:14:49 +00:00
Eren Gölge d49757faaa linter updates 2021-02-05 13:10:43 +00:00
Branislav Gerazov f063545325 improve robustness of defining wavernn in config file 2021-02-05 13:26:33 +01:00
Branislav Gerazov 24ffa9e9f6 update wavernn test config, delete cap=True 2021-02-05 13:10:02 +01:00
Branislav Gerazov cb77aef36c waveRNN fix 2021-02-04 09:52:03 +01:00
Thorsten Mueller d74866cb8e Merge remote-tracking branch 'upstream/dev' into dev
Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637
2021-02-02 19:40:18 +01:00
Thorsten Mueller a82152eef3 Ups. Added missing , 2021-02-02 19:29:16 +01:00
Thorsten Mueller 4cb4fcf02c Set out_path to be required param. 2021-02-02 19:29:16 +01:00
Thorsten Mueller c75ea74914 Added info if model already downloaded in --list_models 2021-02-02 19:29:16 +01:00
Eren Gölge 2edab4b3f9 disable pw in audio that causes numpy issue 2021-02-01 17:05:03 +00:00
Eren Gölge 5c46543765 linter fixes and version updates for deps 2021-02-01 13:18:56 +00:00
Eren Gölge 8774e37444 unpin cython version and commentout pyworld in audio.py causing dep
issues
2021-02-01 11:34:05 +00:00
Eren Gölge 5beed0ddcd Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-02-01 11:27:14 +00:00
Eren Gölge c7407571fa fix #638 2021-02-01 10:05:55 +00:00
Eren Gölge dfdac1def9
Merge pull request #636 from thorstenMueller/dev
Set out_path to be required param in compute_statistics.py.
2021-01-29 18:08:31 +01:00
Thorsten Mueller 44c4a49745 Set out_path to be required param. 2021-01-29 17:23:38 +01:00
Eren Gölge 536366dc0a
Merge pull request #635 from SanjaESC/patch-1
fix device mismatch wavegrad training
2021-01-29 16:42:25 +01:00
Eren Gölge 0354b6f35e move hubconf 2021-01-29 15:28:32 +00:00
Eren Gölge aa5f24608a hubconf.py and load .models.json from the defualt location by mange.py 2021-01-29 15:28:26 +00:00
Alexander Korolev e81ebec7a8
fix device mismatch wavegrad training
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-01-29 15:18:59 +01:00
Eren Gölge a926aa106d reorder imports 2021-01-29 01:36:21 +01:00
Eren Gölge 8a6eee7fec distill import statement, check python version in setup.py 2021-01-28 17:04:08 +01:00
Eren Gölge 131a163c95
Merge pull request #628 from thorstenMueller/dev
Added info if model already downloaded in --list_models
2021-01-28 13:10:06 +01:00
Alexander Korolev ca28e05ed7
update fixed stopnet_pos_weight parameter
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-01-27 16:33:25 +01:00
Thorsten Mueller ccbd542eb0 Added info if model already downloaded in --list_models 2021-01-27 16:19:02 +01:00
Eren Gölge 25c86ca715 README update, set default models for synthesize.py and server.py. Disable verbose for ap init. 2021-01-27 11:47:03 +01:00
Eren Gölge 4f32e77006 platform indep. way to fetch user data folder 2021-01-26 17:32:43 +01:00
Eren Gölge 0117c811a9 add a button to index.html to see the model details 2021-01-26 12:33:27 +01:00
Eren Gölge a3adcaccdb Merge branch 'pr/thorstenMueller/623' into dev 2021-01-26 12:19:39 +01:00
Eren Gölge b464cab9b8 setup.py update and pylint fixes 2021-01-26 02:57:50 +01:00
Eren Gölge 660d61aeeb maximum_path_numpy and CYTHON adabtable import 2021-01-26 02:57:07 +01:00
Eren Gölge 877f0bbfba manifest.in update 2021-01-26 02:56:55 +01:00
Eren Gölge 82e029529e fix manifest file 2021-01-25 13:27:54 +01:00
Eren Gölge 57b668fd86 fixing dome pypi issues 2021-01-25 13:06:12 +01:00
Eren Gölge 60c1bb93d9 fixes before first PyPI release 2021-01-25 11:16:20 +01:00
Thorsten Mueller afb7db2a1d Removed unneeded check and removed specific taco2 model name. 2021-01-22 16:22:50 +01:00
Eren Gölge fae10309e4
Merge pull request #624 from SanjaESC/patch-3
Update train_tacotron.py
2021-01-22 13:29:09 +01:00
Eren Gölge 5ee73c2bae Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2021-01-22 13:26:27 +01:00
Eren Gölge 5fb611ef40 static image for server index.html 2021-01-22 03:01:53 +01:00
Eren Gölge ca647cf222 Model Manager to download released models 2021-01-22 02:35:43 +01:00
Eren Gölge ca8ad9c21e rename audio._normalize to audio.normalize 2021-01-22 02:33:19 +01:00
Eren Gölge c990b3a59c linter fixes and test fixes 2021-01-22 02:32:35 +01:00
Alexander Korolev f251dc8c0e
Update train_tacotron.py
When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training.

By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.
2021-01-21 21:16:30 +01:00
Eren Gölge 0ab2eb2664 use synthesizer in both synthesize.py and server.pu 2021-01-21 15:54:33 +01:00
Eren Gölge 9addfabc43 wavernn load_checkpoint function 2021-01-21 15:31:13 +01:00
Eren Gölge 50fee59a2c update synthesizer.py for better interfacing to different models 2021-01-21 15:30:49 +01:00
Eren Gölge 007a4d7139 remove 3rd paty wavernn support from server.py and add ModelManager arguments 2021-01-21 15:30:16 +01:00
Eren Gölge 6b6e989fd2 update server readme 2021-01-21 15:29:46 +01:00
Thorsten Mueller e414582be6 Added option for server ui details page. 2021-01-20 21:56:40 +01:00
root 1bc8fbbd3c set eval mode whe nloading models 2021-01-20 02:14:18 +00:00
root 5bd7238153 interpolate spectrogram in vocoder generic utils for matching sample
rates
2021-01-20 02:13:01 +00:00
root ca3743539a load_checkpoint func for vocoder models 2021-01-20 02:12:29 +00:00
root ea39715305 read_json_with_comments 2021-01-20 02:11:55 +00:00