Commit Graph

322 Commits

Author SHA1 Message Date
Eren Golge df1b8b3ec7 linter and test updates for speaker_encoder, gmm_Attention 2019-11-12 12:42:42 +01:00
Eren Golge 1401a0db6b update GMM attention calp max min 2019-11-12 11:20:53 +01:00
Eren Golge 015f7780f4 Decoder shape comments for Tacotron2, decoupled grad clip for stopnet and the rest of the network. Some variable renaming and bug fix for alignment score logging 2019-11-12 11:20:53 +01:00
Eren Golge adf9ebd629 Graves attention and setting attn type by config.json 2019-11-12 11:18:57 +01:00
Eren Golge 0e0d0345cd call truncated inference 2019-10-29 17:41:07 +01:00
Eren Golge 60b6ec18fe bug fix for synthesis.py 2019-10-29 17:38:59 +01:00
Eren Golge e83a4b07d2 commention model outputs for tacotron, align outputs shapes of tacotron and tracotron2, merge bidirectional decoder 2019-10-28 14:51:19 +01:00
Eren Golge 2dcdc14ea6 UPDATE TRIM SILENCE 2019-10-12 18:34:28 +02:00
Eren Golge 0849e3c42f sound normalization while reading, adapting get_Speaker for multiple datasets 2019-10-04 18:20:30 +02:00
Eren Golge 8dec2a9e95 fix memory leak duee to diagonal alingmnet score 2019-10-02 00:30:25 +02:00
Eren Golge acbafb456b Weighting positive values for stopnet loss, change adam_weight_decay name 2019-09-28 15:44:17 +02:00
Eren Golge 99d7f2a666 update set_weight_decay 2019-09-28 15:31:18 +02:00
Eren Golge 8565c508e4 remove debug line 2019-09-28 01:11:04 +02:00
Eren Golge b76aaf8ad4 skip weight decay for BN and biases, some formatting 2019-09-28 01:09:28 +02:00
Eren Golge 5b6b1f354d add use_gst to enable global style token 2019-09-24 16:24:58 +02:00
Eren Golge d45d963dc1 linter fix 2019-09-11 10:39:59 +02:00
Eren Golge 609d8efa69 compute alignment diagonality score and encapsulate stats averaging with a class in traning 2019-09-11 10:32:07 +02:00
Eren Golge d1828c9573 fix server tests and pylint 2019-09-10 12:09:58 +02:00
Eren Golge 0bb8d780e8 visual.py update 2019-09-05 16:48:36 +02:00
Eren Golge dc69074a56 add RADAM reference 2019-08-30 10:33:46 +02:00
Eren Golge 529348d6dc lint fixes 2019-08-30 10:29:22 +02:00
Eren Golge afdc4bad10 Merge branch 'dev-radam' into dev 2019-08-30 10:15:54 +02:00
Reuben Morais 28644a717e Fix tests 2019-08-29 12:18:33 +02:00
Reuben Morais 3c5aeb5e22 Fix installation by using an explicit symlink 2019-08-29 11:49:53 +02:00
Eren Golge e02fc51fde server update for changing r value 2019-08-23 12:28:05 +02:00
Eren Golge 1a1db23df1 radam 2019-08-22 00:34:46 +02:00
Eren Golge 5ff8544d6a force frame_length to be a multiple hop_length 2019-08-20 13:22:04 +02:00
Eren Golge d99623e285 bug fixes for logging 2019-08-19 16:27:53 +02:00
Eren Golge 5629292bde bug fixes 2019-08-16 15:08:04 +02:00
Eren Golge b22c7d4a29 Merge branch 'dev-gradual-queue' into dev 2019-08-16 13:20:17 +02:00
Eren Golge 5acd9e82bd save model r value for checkpoints 2019-08-16 13:11:51 +02:00
Thomas Werkmeister 215eb014ca enforce list append semantic; prevents numpy add 2019-07-26 13:40:58 +02:00
Eren Golge 85adb2496c Merge branch 'master' of github.com:mozilla/TTS 2019-07-22 20:59:42 +02:00
Eren Golge 91795cc0f1 config update 2019-07-22 15:44:09 +02:00
Eren Golge ee706b50f6 enalbe graudal training by config.json 2019-07-22 02:11:20 +02:00
Thomas Werkmeister f59543d127 fixed usage of bos&eos char with caching 2019-07-19 15:17:35 +02:00
Reuben Morais 9a61dfa155 Address additional lint problems 2019-07-19 11:35:06 +02:00
Reuben Morais 11e7895329 Fix Pylint issues 2019-07-19 09:08:51 +02:00
Eren Gölge 63c0085256
Merge pull request #229 from twerkmeister/patch-2
check for speaker id is None before put on cuda
2019-07-17 16:20:24 +02:00
Thomas Werkmeister ee4d55549d
check for speaker id is None before put on cuda 2019-07-17 14:08:53 +02:00
Eren Golge fd081c49b7 split dataset outside preprocessor 2019-07-16 21:15:04 +02:00
Eren Golge aec7f02817 libri tts config, and bug fix 2019-07-16 15:17:38 +02:00
Eren Golge 1468db0d07 bug fix for multispeaker test run 2019-07-12 10:50:20 +02:00
Eren Golge 5851c5d29b Merge branch 'tacotron-gst' into dev 2019-07-11 15:32:32 +02:00
Eren Golge 89969b0f38 LibriTTS processor and a small notification for silence trimming 2019-07-11 15:25:29 +02:00
Thomas Werkmeister 2f2482f9b4 reading all speakers upfront 2019-07-10 18:38:55 +02:00
Thomas Werkmeister d23e29ea1f extracted id to torch code 2019-07-02 14:40:01 +02:00
Thomas Werkmeister ba8cc8054b disabling multispeaker with num_speakers=0 2019-07-01 14:01:34 +02:00
Thomas Werkmeister 04e452d8cb Merge branch 'tacotron-gst' of github.com:mozilla/TTS into multispeaker 2019-07-01 14:00:22 +02:00
Eren Golge 464cc29756 Make optional reampling of the read wav 2019-06-26 14:11:30 +02:00
Thomas Werkmeister 05ff8801d1 config, benchmark notebook, synthesis fixed 2019-06-26 13:31:16 +02:00
Thomas Werkmeister d172a3d3d5 multispeaker 2019-06-26 12:59:14 +02:00
Eren Golge 51f1cd67e3 bug fix 2019-06-15 01:22:27 +02:00
Eren Golge 037ec13453 config update, audio.py update and modularize synthesize.py 2019-06-14 16:18:49 +02:00
Eren Golge e061ed091a modularize synthesis 2019-06-12 12:12:22 +02:00
Eren Golge 0f8936d744 GST inference 2019-06-12 12:12:01 +02:00
Eren Golge 31fe02412c forward_attn_mask and config update 2019-06-06 11:14:20 +02:00
Eren Golge 127a6b68e0 update mulaw decoder 2019-06-06 11:13:26 +02:00
Eren Golge 63eea4a364 bug fix 2019-06-06 10:24:34 +02:00
Eren Golge 7410daceb2 Adapt TTS for TacotronGST and some changes for Audio.py , better config.json naming 2019-06-05 18:33:57 +02:00
Eren Golge 4678c66599 forward_attn_mask and config update 2019-06-04 00:39:29 +02:00
Eren Golge f096f1052f config updates, update audio.py, update mailabs preprocessor 2019-06-03 15:34:36 +02:00
Eren Golge 70929387c0 Merge branch 'dev-tacotron2' 2019-05-28 14:59:24 +02:00
Eren Golge 0dbed8fef7 New method to convert Tacotron output to mel psectrograms 2019-05-27 14:41:59 +02:00
Eren Golge ba492f43be Set tacotron model parameters to adap to common_layers.py - Prenet and Attention 2019-05-27 14:40:28 +02:00
Eren Golge d4b900f6c9 use soundfile for faster read 2019-05-23 02:02:22 +02:00
Eren Golge e62659da94 update separate stopnet flow to make it faster. 2019-05-17 16:15:43 +02:00
Eren Golge 832dc3eafa bug fix 2019-05-15 12:37:31 +02:00
Eren Golge bb2b705e01 small bug fixes 2019-05-14 13:53:26 +02:00
Eren Golge 5e679f746d save figures in visualize of set 2019-05-12 17:35:44 +02:00
Eren Golge 6331bccefc make dropout oprional #2 2019-05-12 17:35:31 +02:00
Eren Golge e2439fde9a make location attention optional and keep all attention weights in attention class 2019-04-29 11:37:01 +02:00
Eren Golge 01dbfb3a0f Server update s 2019-04-18 17:35:20 +02:00
Eren Golge 3c2d500f53 Changesat windowing and some comments 2019-04-12 16:13:40 +02:00
Eren Golge 9466505f27 Make eos bos chars optional 2019-04-12 16:12:15 +02:00
Eren Golge e2cf35bb10 Make loss masking optional 2019-04-10 16:41:08 +02:00
Eren Golge 8a47b46195 print warning if a layer in ehckpoint is not defined in model definition 2019-04-08 19:32:07 +02:00
Eren Golge 961af0f5cd setup_model externally based on model selection. Make forward attention and prenet type configurable in config.json 2019-04-05 17:49:18 +02:00
Eren Golge 7baaf140f9 Remove start character for phonme sequenceing 2019-04-04 10:49:09 +02:00
Eren Golge 2e361e2306 strip sting after phonemizer 2019-03-29 17:05:44 +01:00
Eren Golge 103971c893 text processing updates with tests 2019-03-29 17:04:10 +01:00
Eren Golge 6edd8bc6dd add git branch and restore_path to copied config file for each run 2019-03-29 17:01:57 +01:00
Eren Golge 1ed4978e69 text processing update 2019-03-27 14:57:36 +01:00
Eren Golge 76d5e065db phoneme_to_sequence bug fix 2019-03-27 14:57:26 +01:00
Eren Golge fdca8402c7 config updates 2019-03-26 15:46:26 +01:00
Eren Golge d8908692c5 refactor partial reinit script as a function. Allow user to select layers to reinit in finutunning 2019-03-23 17:19:40 +01:00
Eren Golge 06a7aeb26d git commit bug fix for phonimizer 2019-03-23 16:44:38 +01:00
Eren Golge f96945443e add start char but remove end char 2019-03-22 23:48:44 +01:00
Eren Golge d6307fbb7f config update 2019-03-22 19:12:58 +01:00
Eren Golge ff7258062c skip the alst empty char in phonemes to sequence. It breaks the alingment 2019-03-20 12:24:04 +01:00
Eren Golge 5acc9db4ac
Add empty character to phonemes 2019-03-12 10:16:42 +01:00
gnosly 95de2cd559 added missing phonemes, synthesizer.py now setup the correct input layer 2019-03-11 21:56:40 +01:00
Eren Golge b9b79fcf0f inference truncated NEED TO BE TESTED 2019-03-11 17:40:09 +01:00
Eren Golge 5754116c19 bos char addded 2019-03-06 22:06:01 +01:00
Eren Golge a2a22d253f synthesis update compatible with multiplt architecture 2019-03-06 13:11:46 +01:00
Eren Golge 08162157ee generic train.py for multiple architectures set on config.json 2019-03-06 13:11:22 +01:00
Eren Golge 1e8fdec084 Modularize functions in Tacotron 2019-03-05 13:25:50 +01:00
Eren Golge bf5f18d11e Formatting changes and distributed training 2019-02-27 09:50:52 +01:00
Eren Golge caae1af4f6 visual updates for phoenemes 2019-02-25 17:20:36 +01:00
Eren Golge 97a16cedbf phoneme punctuation bug fix 2019-02-16 03:20:04 +01:00
Eren Golge eb839a7acd small buggy fix for phoeneme sequencer 2019-02-05 11:57:12 +01:00
Eren Golge 328db7757d one more phoneme char for en-uk 2019-01-18 13:35:51 +01:00
Eren Golge 4749bc211e Add new char to phoneme symbols for en-gb 2019-01-17 15:48:37 +01:00
Eren Golge 7e020d4084 Bug fixes 2019-01-16 16:23:04 +01:00
Eren Golge 915783e10e enable phoneme based synthesizing 2019-01-16 15:53:07 +01:00
Eren Golge b241104778 Make phoneme training configurable through config.json 2019-01-16 13:07:03 +01:00
Eren Golge 9927664f27 Phonemize statements are updated 2019-01-16 12:30:33 +01:00
Eren Golge 524743507c remove debug prints 2019-01-16 12:29:48 +01:00
Eren Golge b9629135db phonemizer updates for utils.text 2019-01-16 12:29:48 +01:00
Eren Golge c754ca89de Move phoneme compuataion to __init__ and put char list to symbols.py 2019-01-16 12:28:28 +01:00
Eren Golge 28d45a8d80 bug fixes 2019-01-16 12:27:38 +01:00
Eren Golge 004dd0f208 useing epitran and new phoneme list 2019-01-16 12:26:39 +01:00
Eren Golge 0e73b6ba45 Debug prints for phoneme extraction 2019-01-16 12:26:21 +01:00
Eren Golge 85a1990cc6 Convesntional update s 2019-01-16 12:26:21 +01:00
Eren Golge 1722b1659a phonem updates 2019-01-16 12:24:40 +01:00
Eren Golge 9c9aea276c phonem extraction for training 2019-01-16 12:23:04 +01:00
Eren Golge 94387c905e remove debug prints 2019-01-16 12:08:12 +01:00
Eren Golge e1cb7c1501 phonemizer updates for utils.text 2019-01-16 12:08:12 +01:00
Eren Golge df49e93684 Move phoneme compuataion to __init__ and put char list to symbols.py 2019-01-16 12:07:33 +01:00
Eren Golge da2f064bc5 bug fixes 2019-01-16 12:07:33 +01:00
Eren Golge 444451dc8e useing epitran and new phoneme list 2019-01-16 12:07:00 +01:00
Eren Golge 7edb53ce63 Debug prints for phoneme extraction 2019-01-16 12:06:59 +01:00
Eren Golge e6750ca652 Convesntional update s 2019-01-16 12:05:29 +01:00
Eren Golge 5f22e2a83a use phoneme to sequence for synthesis 2019-01-16 12:05:29 +01:00
Eren Golge 421787277f phonem updates 2019-01-16 12:00:41 +01:00
Eren Golge da30c3c9b3 change numbers.py to number_norm.py 2019-01-16 11:59:48 +01:00
Eren Golge 8e22147a19 phonem extraction for training 2019-01-16 11:59:48 +01:00
Eren Golge c8d7a6a84e explicit slience removal after voice synthesis in case of wrong stop token 2019-01-06 18:10:54 +01:00
Eren Golge 4abc9ad1bc Logger field naming update for layer stats 2018-12-28 14:22:41 +01:00
Eren Golge 806643300c Place model name to the beginning of the generated output folder name 2018-12-28 14:22:41 +01:00
Eren Golge 481105ccfa logger for tensorboard plotting 2018-12-28 14:18:19 +01:00
Eren Golge 6488d5e305 nug fix 2018-11-28 16:37:59 +01:00
Eren Golge 7730ef6bff Merge branch 'dev' of github.com:mozilla/TTS into dev 2018-11-28 16:34:03 +01:00
Eren Golge bb2a88a984 Rename LR scheduler 2018-11-26 14:09:42 +01:00
Eren Golge f6bf5b3d74 trim silence if enabled 2018-11-23 17:06:22 +01:00
Eren Golge 0f0bde935c trim silence if enabled 2018-11-23 16:58:26 +01:00
Eren Golge 22dcc4f7d0 small print formatting 2018-11-22 17:03:53 +01:00
Eren Golge 161a26c9dd Plot mel spectrogram if required 2018-11-13 12:10:40 +01:00
Eren Golge 6550db5251 Formatting, fixing import statements, logging learning rate, remove optimizer restore cuda call 2018-11-05 14:05:04 +01:00
Eren Golge 440f51b61d correct import statements 2018-11-03 23:19:23 +01:00
Eren Golge 0b6a9995fc change import statements 2018-11-03 19:15:06 +01:00
Eren Golge d96690f83f Config updates and add sigmoid to mel network again 2018-11-02 17:27:31 +01:00
Eren Golge c8a552e627 Batch update after data-loss 2018-11-02 16:13:51 +01:00
Eren 41bfa95736 bug fix 2018-09-21 21:51:38 +02:00
Eren 34eeaee58b Make audio folder and save audio with scipy 2018-09-21 17:38:55 +02:00
Eren a165cd7bda Bug fix audio saving 2018-09-19 15:45:08 +02:00
Eren c52d3f16f9 Bug fix, prevent save_wav to modify given variable 2018-09-19 14:05:10 +02:00
Eren 56c6d0cac8 Remove min max mel freq 2018-09-06 15:26:20 +02:00
Eren bb526c296f Change scheduler AnnealLR and catch audio synthesis error in eval time 2018-08-13 13:13:45 +02:00
Eren 6818e11185 Make lr scheduler configurable 2018-08-12 15:02:06 +02:00