Eren Gölge
|
e4f81d6856
|
Merge pull request #654 from kirianguiller/chinese-implementation
Chinese implementation (merge into dev)
|
2021-02-18 17:15:32 +01:00 |
kirianguiller
|
fb0655d1e7
|
modify according to PR reviews
|
2021-02-17 20:53:56 +01:00 |
kirianguiller
|
c4c7bc1b88
|
<add> Chinese mandarin implementation (tacotron2)
|
2021-02-17 20:53:56 +01:00 |
Eren Gölge
|
f6e6314910
|
add RUSLAN dataset preprocessor
|
2021-02-17 13:35:23 +00:00 |
Eren Gölge
|
918f007a11
|
docstring update
|
2021-02-12 12:04:07 +00:00 |
Eren Gölge
|
9cad435288
|
css10 dataset preprocessor
|
2021-02-09 15:11:26 +00:00 |
Eren Gölge
|
d49757faaa
|
linter updates
|
2021-02-05 13:10:43 +00:00 |
erogol
|
27a75de15f
|
update processors for loading attention maps
|
2021-01-06 13:19:40 +01:00 |
erogol
|
df180148e9
|
use noise augmentation in TTSDataset
|
2020-12-09 15:46:25 +01:00 |
erogol
|
7505c0ba27
|
muliprocess phoneme computation
|
2020-12-07 11:29:41 +01:00 |
erogol
|
20c86489d7
|
make static methods for faster multiprocess call
|
2020-12-07 11:29:10 +01:00 |
erogol
|
affe1c1138
|
setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length.
|
2020-12-07 11:26:57 +01:00 |
erogol
|
a757b203bc
|
fix longer phoneme seqs
|
2020-11-26 15:05:03 +01:00 |
erogol
|
7541d2ecaa
|
return eval split optional
|
2020-11-25 14:50:09 +01:00 |
Qingping Hou
|
b0b97d636f
|
speed up metafile build for voxceleb
|
2020-11-14 23:45:17 -08:00 |
erogol
|
9b0f441945
|
argument for returning no eval split
|
2020-11-12 12:52:27 +01:00 |
Edresson
|
d9540a5857
|
add blank token in sequence for encrease glowtts results
|
2020-10-25 15:08:28 -03:00 |
erogol
|
10258724d1
|
linter fixes
|
2020-09-22 03:54:16 +02:00 |
erogol
|
a6df617eb1
|
Merge branch 'glow-tts-amp-time_depth_conv' into dev
|
2020-09-21 14:23:45 +02:00 |
mueller91
|
9b4aac94a8
|
fix: linter issues
|
2020-09-21 12:13:02 +02:00 |
mueller
|
e36a3067e4
|
add: save wavs instead feats to storage.
This is done in order to mitigate staleness when caching and loading from data storage
|
2020-09-17 14:14:30 +02:00 |
mueller
|
1511076fde
|
add: Configurable encoder dataset storage to reduce disk I/O
add: Averaged time for data loader to console and Tensorboard output
|
2020-09-17 12:29:38 +02:00 |
mueller
|
95d2906307
|
add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training
|
2020-09-16 16:49:53 +02:00 |
mueller
|
c909ca3855
|
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
|
2020-09-16 15:55:55 +02:00 |
erogol
|
89d15bf118
|
merge glow-tts after rebranding
|
2020-09-11 19:01:37 +02:00 |
erogol
|
df19428ec6
|
rename the project to old TTS
|
2020-09-09 12:27:23 +02:00 |