Commit Graph

138 Commits

Author SHA1 Message Date
Eren Gölge 843d1b3d98 linter fixes 2021-05-11 11:30:00 +02:00
Eren Gölge 19fb1d743d style update 2021-05-11 11:30:00 +02:00
Eren Gölge c57f0b46bb reintro use_gst for backwars compat 2021-05-11 11:29:18 +02:00
Eren Gölge 9ee70af9bb code styling 2021-05-11 11:29:18 +02:00
Eren Gölge 7663bc63c1 add Coqpit configs for the TTS models 2021-05-11 11:29:17 +02:00
Eren Gölge 7227e8f1d2 update train_align_tts.py for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 51a7e06945 glow_tts_config.py and train test on python 2021-05-11 11:29:17 +02:00
Eren Gölge 816e7ee698 remove default configs.json as replacing with Coqpit configs 2021-05-11 11:29:17 +02:00
Eren Gölge 97bd5f9734 [ci skip] config update #3 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge a21c0b5585 config update 2 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge ef37633cb3 [ci skip] use prenet_dropout by default with Tacotron models 2021-04-22 12:38:55 +02:00
Eren Gölge 48ea20e69f example aligntts config 2021-03-30 14:41:00 +02:00
gerazov 2451a813a2 refactored keep_all_best 2021-03-08 02:57:11 +01:00
gerazov f2e474cd37 loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added 2021-03-08 02:56:36 +01:00
erogol 29cf933831 update SS condif 2021-01-06 13:19:40 +01:00
erogol 228ada04b5 update glow-tts ljspeech config 2021-01-06 13:19:40 +01:00
erogol ac5c9217d1 positional encoding masking for SS 2021-01-06 13:19:40 +01:00
erogol cf869e8922 add SS files 2021-01-06 13:19:40 +01:00
erogol a1d5a9ddda config update tyo use noise for augmentation 2021-01-06 13:19:40 +01:00
erogol 7b20d8cbd3 implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic 2021-01-06 13:19:40 +01:00
erogol f81af4eb0d config update disable guided attention for dynamic conv attention 2021-01-06 13:19:40 +01:00
erogol 5c50e104d6 config update 2021-01-06 13:19:40 +01:00
erogol fa20638083 config for ljspeech dynamic conv attention 2021-01-06 13:18:41 +01:00
erogol 070146e143 add monotonic dynamic convolution attention 2021-01-06 13:18:41 +01:00
erogol affe1c1138 setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length. 2020-12-07 11:26:57 +01:00
erogol 1229554c42 use native amp 2020-11-25 14:48:54 +01:00
erogol 183fe56d95 Merge branch 'ssim_loss' into dev 2020-10-29 23:49:09 +01:00
erogol 73581cd94c renaming train scripts and updating tests 2020-10-29 16:50:07 +01:00
erogol a1582a0e12 fix distributed training for train_* scripts 2020-10-29 12:31:43 +01:00
erogol 59e1cf99d0 config update and ssim implementation 2020-10-28 18:30:00 +01:00
erogol 9cef923d99 ssim loss for tacotron models 2020-10-28 15:24:18 +01:00
Eren Gölge f4b8170bd1
Merge pull request #545 from Edresson/dev
GlowTTS zeroshot TTS support
2020-10-27 15:23:41 +01:00
Edresson d9540a5857 add blank token in sequence for encrease glowtts results 2020-10-25 15:08:28 -03:00
ayush-1506 2a3559f02b Fix readme and config file 2020-10-21 13:43:49 +05:30
erogol e0d4b88877 config update 2020-10-08 01:29:30 +02:00
erogol bb9b70ee27 differential spectral loss and loss weight settings 2020-10-08 01:29:30 +02:00
erogol e0b9fa887f glow-tts modules added 2020-09-21 14:15:40 +02:00
erogol df19428ec6 rename the project to old TTS 2020-09-09 12:27:23 +02:00