erogol
5cae2c5742
make optional position encoding for speedyspeech
2021-01-06 13:19:40 +01:00
erogol
dc4a16d62e
speedy speehc losses
2021-01-06 13:19:40 +01:00
erogol
d62cac7252
fix glow-tts prenet bug fix
2021-01-06 13:19:40 +01:00
erogol
a1d5a9ddda
config update tyo use noise for augmentation
2021-01-06 13:19:40 +01:00
erogol
022af74d74
update prompt msg
2021-01-06 13:19:40 +01:00
erogol
57ef53bef3
update argumnet check for non tacotron models
2021-01-06 13:19:40 +01:00
erogol
27a75de15f
update processors for loading attention maps
2021-01-06 13:19:40 +01:00
erogol
fa6907fa0e
update glow-tts parameters and fix rel-attn-win size
2021-01-06 13:19:40 +01:00
erogol
7b20d8cbd3
implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic
2021-01-06 13:19:40 +01:00
erogol
973754d893
fix for init glow-tts
2021-01-06 13:19:40 +01:00
erogol
f81af4eb0d
config update disable guided attention for dynamic conv attention
2021-01-06 13:19:40 +01:00
erogol
5c50e104d6
config update
2021-01-06 13:19:40 +01:00
erogol
fa20638083
config for ljspeech dynamic conv attention
2021-01-06 13:18:41 +01:00
erogol
070146e143
add monotonic dynamic convolution attention
2021-01-06 13:18:41 +01:00
erogol
639fa29261
update speaker id casting for glow-tts
2020-12-14 16:58:47 +01:00
erogol
999120ecdf
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:50:14 +01:00
erogol
f611e6ac01
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:47:59 +01:00
Jörg Thalheim
62fd4ca70d
inflect negative numbers correctly
2020-12-10 16:47:51 +01:00
Jörg Thalheim
6646682650
cleaners: expand english time
2020-12-10 14:53:20 +01:00
Jörg Thalheim
76138687d3
expand more currencies
2020-12-10 14:53:20 +01:00
erogol
a2859b7ddc
update config args checks
2020-12-10 13:52:57 +01:00
erogol
788cd6f902
fix multi-speaker glow-tts inference
2020-12-10 02:05:48 +01:00
erogol
92cc9630d7
fix glow-tts synthesis for DPP
2020-12-10 00:30:34 +01:00
erogol
df180148e9
use noise augmentation in TTSDataset
2020-12-09 15:46:25 +01:00
erogol
06612ce305
test fixes
2020-12-07 15:57:34 +01:00
erogol
7505c0ba27
muliprocess phoneme computation
2020-12-07 11:29:41 +01:00
erogol
20c86489d7
make static methods for faster multiprocess call
2020-12-07 11:29:10 +01:00
erogol
affe1c1138
setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length.
2020-12-07 11:26:57 +01:00
erogol
7c3cdced1a
make speaker_mapping a global variable to prevent reload. Fix glow-tts training
2020-12-01 03:23:25 +01:00
erogol
a757b203bc
fix longer phoneme seqs
2020-11-26 15:05:03 +01:00
erogol
f6c96b0ac2
Merge branch 'dev'
2020-11-25 15:29:06 +01:00
erogol
a1e4ee18f9
convert float16 to float32 for plotting spectrograms
2020-11-25 14:50:28 +01:00
erogol
7541d2ecaa
return eval split optional
2020-11-25 14:50:09 +01:00
erogol
1229554c42
use native amp
2020-11-25 14:48:54 +01:00
Qingping Hou
b0b97d636f
speed up metafile build for voxceleb
2020-11-14 23:45:17 -08:00
erogol
6cc464ead6
fix ton of tesnting bugs
2020-11-12 16:33:29 +01:00
erogol
9b0f441945
argument for returning no eval split
2020-11-12 12:52:27 +01:00
erogol
21364331d2
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-09 13:31:12 +01:00
erogol
d94782a076
reset the way ga_loss is stored in return_dict
2020-11-02 13:18:56 +01:00
erogol
a108d0ee81
check nan loss in glow-tts loss
2020-11-02 13:12:19 +01:00
erogol
b8ac9aba9d
check against NaN loss in tacotron_loss
2020-11-02 12:44:41 +01:00
erogol
183fe56d95
Merge branch 'ssim_loss' into dev
2020-10-29 23:49:09 +01:00
erogol
73581cd94c
renaming train scripts and updating tests
2020-10-29 16:50:07 +01:00
erogol
946a0c0fb9
bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts
2020-10-29 15:45:50 +01:00
erogol
a1582a0e12
fix distributed training for train_* scripts
2020-10-29 12:31:43 +01:00
erogol
e723b99888
handle distributed model as saving
2020-10-29 12:30:37 +01:00
Eren Gölge
26c18b61c9
Merge pull request #553 from Edresson/dev
...
bug fix in the inference with GlowTTS
2020-10-28 18:49:31 +01:00
erogol
fdaed45f58
optional loss masking for stoptoken predictor
2020-10-28 18:40:54 +01:00
erogol
e49cc3bbcd
bug fix
2020-10-28 18:34:34 +01:00
erogol
59e1cf99d0
config update and ssim implementation
2020-10-28 18:30:00 +01:00
erogol
9cef923d99
ssim loss for tacotron models
2020-10-28 15:24:18 +01:00
Edresson
f01502a9db
bug fix in glowTTS sythesize
2020-10-27 16:30:16 -03:00
Eren Gölge
f4b8170bd1
Merge pull request #545 from Edresson/dev
...
GlowTTS zeroshot TTS support
2020-10-27 15:23:41 +01:00
erogol
a6f564c8c8
pylint fixes
2020-10-27 12:35:10 +01:00
erogol
8de7c13708
fix no loss masking loss computation
2020-10-27 12:17:38 +01:00
Edresson
89e9bfe3a2
add text processing blank token test
2020-10-26 17:41:23 -03:00
Edresson
d9540a5857
add blank token in sequence for encrease glowtts results
2020-10-25 15:08:28 -03:00
Edresson
fbea058c59
add parse speakers function
2020-10-24 16:10:05 -03:00
Edresson
07345099ee
GlowTTS zero-shot TTS Support
2020-10-24 15:58:39 -03:00
Alexander Korolev
47d74ced1c
Update losses.py
...
Seems like in the latest dev merge, this change was reverted. Any specific reason for this?
Without it the problem as stated here https://github.com/mozilla/TTS/issues/473 occurs.
2020-10-23 14:15:01 +02:00
ayush-1506
2a3559f02b
Fix readme and config file
2020-10-21 13:43:49 +05:30
Edresson
b7f9ebd32b
add check arguments for GlowTTS and multispeaker training bug fix
2020-10-19 17:17:58 -03:00
erogol
c2c4126a18
remove merge conflicts
2020-10-08 01:35:27 +02:00
erogol
6f0654f9a8
differential spectral loss
2020-10-08 01:30:42 +02:00
erogol
e0d4b88877
config update
2020-10-08 01:29:30 +02:00
erogol
4e93f90108
bug fix
2020-10-08 01:29:30 +02:00
erogol
bb9b70ee27
differential spectral loss and loss weight settings
2020-10-08 01:29:30 +02:00
erogol
e1eab1ce4b
print model r value as loading it
2020-10-07 13:34:21 +02:00
Eren Gölge
4873601694
Merge pull request #531 from WeberJulian/french-cleaners
...
Adding support for french cleaners
2020-09-30 15:30:50 +02:00
Edresson
99d5a0ac07
add Speaker Conditional GST support
2020-09-29 16:09:27 -03:00
Julian WEBER
ea7c2e15c0
Adding french abbreviations
2020-09-29 15:43:39 +02:00
Julian WEBER
54b4031391
Merge remote-tracking branch 'origin/dev' into french-cleaners
2020-09-29 14:24:51 +02:00
Julian WEBER
da134eeee4
Subjective improvements
2020-09-29 14:20:52 +02:00
Julian WEBER
b2817e9e93
Adding french cleaners
2020-09-29 14:20:24 +02:00
Eren Gölge
cf02ace5b7
Merge pull request #530 from mueller91/fix_split_dataset
...
fix: split_dataset
2020-09-28 12:42:40 +02:00
erogol
e097bc6c5d
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-09-28 11:15:32 +02:00
Eren Gölge
8e2dc79c3a
Merge pull request #526 from mueller91/dev
...
Fix: Check storage params only for speaker encoder
2020-09-28 11:15:23 +02:00
erogol
6a70c63f24
correct glow-tts loss
2020-09-27 03:28:42 +02:00
erogol
665f7ca714
linter fix
2020-09-24 12:57:54 +02:00
mueller91
227b9c8864
fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
...
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
2020-09-23 23:27:51 +02:00
mueller91
1fe5eb054f
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
...
Conflicts:
TTS/bin/train_encoder.py
requirements.txt
2020-09-22 19:58:53 +02:00
mueller91
df4caec4b7
add: check_config for speaker_encoder
2020-09-22 19:52:09 +02:00
mueller91
0ea7f4e2bd
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:39:40 +02:00
mueller91
7029452228
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:31:42 +02:00
erogol
10258724d1
linter fixes
2020-09-22 03:54:16 +02:00
erogol
a6df617eb1
Merge branch 'glow-tts-amp-time_depth_conv' into dev
2020-09-21 14:23:45 +02:00
erogol
e0b9fa887f
glow-tts modules added
2020-09-21 14:15:40 +02:00
erogol
e4c6386603
change import for normalization layer
2020-09-21 13:09:52 +02:00
mueller91
9b4aac94a8
fix: linter issues
2020-09-21 12:13:02 +02:00
erogol
c008003506
do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder
2020-09-18 12:52:19 +02:00
mueller
6b0621c794
cleanup
2020-09-17 16:46:43 +02:00
mueller
e36a3067e4
add: save wavs instead feats to storage.
...
This is done in order to mitigate staleness when caching and loading from data storage
2020-09-17 14:14:30 +02:00
mueller
1511076fde
add: Configurable encoder dataset storage to reduce disk I/O
...
add: Averaged time for data loader to console and Tensorboard output
2020-09-17 12:29:38 +02:00
erogol
3660c57f1e
time seperable convolution encoder, huber loss for duration predictor
2020-09-17 03:10:58 +02:00
mueller
95d2906307
add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training
2020-09-16 16:49:53 +02:00
mueller
c909ca3855
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
2020-09-16 15:55:55 +02:00
erogol
f1a75468c2
fix arguments
2020-09-12 04:00:25 +02:00
erogol
45fbc0d003
convolution encoder with GLU and res connections
2020-09-12 03:40:21 +02:00
erogol
72b8ac0ff6
remove redundant arguments
2020-09-12 03:37:47 +02:00
erogol
15e6ab3912
glow-tts module renaming updates
2020-09-12 03:33:36 +02:00
erogol
1b238f04b2
add gated conv encoder to glow-tts
2020-09-11 19:01:38 +02:00
erogol
14356d3250
glow-tts with relative pos encoding
2020-09-11 19:01:38 +02:00
erogol
43771a3a5c
remove redundant arguments
2020-09-11 19:01:38 +02:00
erogol
1dea2c9034
faster sequence masking
2020-09-11 19:01:38 +02:00
erogol
673ba74a80
glow tts training and inference fixes
2020-09-11 19:01:38 +02:00
erogol
d5c6d60884
synthesis update for glow tts
2020-09-11 19:01:37 +02:00
erogol
89d15bf118
merge glow-tts after rebranding
2020-09-11 19:01:37 +02:00
erogol
540d811dd5
solve pickling models after module name change
2020-09-11 12:03:39 +02:00
erogol
df19428ec6
rename the project to old TTS
2020-09-09 12:27:23 +02:00