Commit Graph

68 Commits

Author SHA1 Message Date
erogol d94782a076 reset the way ga_loss is stored in return_dict 2020-11-02 13:18:56 +01:00
erogol a108d0ee81 check nan loss in glow-tts loss 2020-11-02 13:12:19 +01:00
erogol b8ac9aba9d check against NaN loss in tacotron_loss 2020-11-02 12:44:41 +01:00
erogol 183fe56d95 Merge branch 'ssim_loss' into dev 2020-10-29 23:49:09 +01:00
erogol 73581cd94c renaming train scripts and updating tests 2020-10-29 16:50:07 +01:00
erogol 946a0c0fb9 bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts 2020-10-29 15:45:50 +01:00
erogol a1582a0e12 fix distributed training for train_* scripts 2020-10-29 12:31:43 +01:00
erogol e723b99888 handle distributed model as saving 2020-10-29 12:30:37 +01:00
erogol fdaed45f58 optional loss masking for stoptoken predictor 2020-10-28 18:40:54 +01:00
erogol e49cc3bbcd bug fix 2020-10-28 18:34:34 +01:00
erogol 59e1cf99d0 config update and ssim implementation 2020-10-28 18:30:00 +01:00
erogol 9cef923d99 ssim loss for tacotron models 2020-10-28 15:24:18 +01:00
Eren Gölge f4b8170bd1
Merge pull request #545 from Edresson/dev
GlowTTS zeroshot TTS support
2020-10-27 15:23:41 +01:00
erogol a6f564c8c8 pylint fixes 2020-10-27 12:35:10 +01:00
erogol 8de7c13708 fix no loss masking loss computation 2020-10-27 12:17:38 +01:00
Edresson 89e9bfe3a2 add text processing blank token test 2020-10-26 17:41:23 -03:00
Edresson d9540a5857 add blank token in sequence for encrease glowtts results 2020-10-25 15:08:28 -03:00
Edresson fbea058c59 add parse speakers function 2020-10-24 16:10:05 -03:00
Edresson 07345099ee GlowTTS zero-shot TTS Support 2020-10-24 15:58:39 -03:00
ayush-1506 2a3559f02b Fix readme and config file 2020-10-21 13:43:49 +05:30
Edresson b7f9ebd32b add check arguments for GlowTTS and multispeaker training bug fix 2020-10-19 17:17:58 -03:00
erogol c2c4126a18 remove merge conflicts 2020-10-08 01:35:27 +02:00
erogol 6f0654f9a8 differential spectral loss 2020-10-08 01:30:42 +02:00
erogol e0d4b88877 config update 2020-10-08 01:29:30 +02:00
erogol 4e93f90108 bug fix 2020-10-08 01:29:30 +02:00
erogol bb9b70ee27 differential spectral loss and loss weight settings 2020-10-08 01:29:30 +02:00
erogol e1eab1ce4b print model r value as loading it 2020-10-07 13:34:21 +02:00
Eren Gölge 4873601694
Merge pull request #531 from WeberJulian/french-cleaners
Adding support for french cleaners
2020-09-30 15:30:50 +02:00
Edresson 99d5a0ac07 add Speaker Conditional GST support 2020-09-29 16:09:27 -03:00
Julian WEBER ea7c2e15c0 Adding french abbreviations 2020-09-29 15:43:39 +02:00
Julian WEBER 54b4031391 Merge remote-tracking branch 'origin/dev' into french-cleaners 2020-09-29 14:24:51 +02:00
Julian WEBER da134eeee4 Subjective improvements 2020-09-29 14:20:52 +02:00
Julian WEBER b2817e9e93 Adding french cleaners 2020-09-29 14:20:24 +02:00
Eren Gölge cf02ace5b7
Merge pull request #530 from mueller91/fix_split_dataset
fix: split_dataset
2020-09-28 12:42:40 +02:00
erogol e097bc6c5d Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-09-28 11:15:32 +02:00
Eren Gölge 8e2dc79c3a
Merge pull request #526 from mueller91/dev
Fix: Check storage params only for speaker encoder
2020-09-28 11:15:23 +02:00
erogol 6a70c63f24 correct glow-tts loss 2020-09-27 03:28:42 +02:00
erogol 665f7ca714 linter fix 2020-09-24 12:57:54 +02:00
mueller91 227b9c8864 fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
2020-09-23 23:27:51 +02:00
mueller91 1fe5eb054f Merge branch 'dev' of https://github.com/mozilla/TTS into dev
 Conflicts:
	TTS/bin/train_encoder.py
	requirements.txt
2020-09-22 19:58:53 +02:00
mueller91 df4caec4b7 add: check_config for speaker_encoder 2020-09-22 19:52:09 +02:00
mueller91 0ea7f4e2bd fix: make speaker encoder's storage parameters non-restriced 2020-09-22 10:39:40 +02:00
mueller91 7029452228 fix: make speaker encoder's storage parameters non-restriced 2020-09-22 10:31:42 +02:00
erogol 10258724d1 linter fixes 2020-09-22 03:54:16 +02:00
erogol a6df617eb1 Merge branch 'glow-tts-amp-time_depth_conv' into dev 2020-09-21 14:23:45 +02:00
erogol e0b9fa887f glow-tts modules added 2020-09-21 14:15:40 +02:00
erogol e4c6386603 change import for normalization layer 2020-09-21 13:09:52 +02:00
mueller91 9b4aac94a8 fix: linter issues 2020-09-21 12:13:02 +02:00
erogol c008003506 do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder 2020-09-18 12:52:19 +02:00
mueller 6b0621c794 cleanup 2020-09-17 16:46:43 +02:00