erogol
|
e0d4b88877
|
config update
|
2020-10-08 01:29:30 +02:00 |
erogol
|
4e93f90108
|
bug fix
|
2020-10-08 01:29:30 +02:00 |
erogol
|
bb9b70ee27
|
differential spectral loss and loss weight settings
|
2020-10-08 01:29:30 +02:00 |
erogol
|
e1eab1ce4b
|
print model r value as loading it
|
2020-10-07 13:34:21 +02:00 |
erogol
|
48a40c4730
|
remove unused import
|
2020-10-06 11:32:24 +02:00 |
erogol
|
a2606fbc22
|
format utils
|
2020-10-06 11:02:54 +02:00 |
Eren Gölge
|
4873601694
|
Merge pull request #531 from WeberJulian/french-cleaners
Adding support for french cleaners
|
2020-09-30 15:30:50 +02:00 |
Edresson
|
99d5a0ac07
|
add Speaker Conditional GST support
|
2020-09-29 16:09:27 -03:00 |
Julian WEBER
|
ea7c2e15c0
|
Adding french abbreviations
|
2020-09-29 15:43:39 +02:00 |
Julian WEBER
|
54b4031391
|
Merge remote-tracking branch 'origin/dev' into french-cleaners
|
2020-09-29 14:24:51 +02:00 |
Julian WEBER
|
da134eeee4
|
Subjective improvements
|
2020-09-29 14:20:52 +02:00 |
Julian WEBER
|
b2817e9e93
|
Adding french cleaners
|
2020-09-29 14:20:24 +02:00 |
Eren Gölge
|
cf02ace5b7
|
Merge pull request #530 from mueller91/fix_split_dataset
fix: split_dataset
|
2020-09-28 12:42:40 +02:00 |
erogol
|
154f90bc44
|
format speaker encoder imports
|
2020-09-28 11:19:19 +02:00 |
erogol
|
e097bc6c5d
|
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
|
2020-09-28 11:15:32 +02:00 |
Eren Gölge
|
8e2dc79c3a
|
Merge pull request #526 from mueller91/dev
Fix: Check storage params only for speaker encoder
|
2020-09-28 11:15:23 +02:00 |
erogol
|
6a70c63f24
|
correct glow-tts loss
|
2020-09-27 03:28:42 +02:00 |
erogol
|
665f7ca714
|
linter fix
|
2020-09-24 12:57:54 +02:00 |
mueller91
|
227b9c8864
|
fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
|
2020-09-23 23:27:51 +02:00 |
mueller91
|
cfeeef7a7f
|
fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev.
speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.
|
2020-09-22 20:10:41 +02:00 |
mueller91
|
1fe5eb054f
|
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
Conflicts:
TTS/bin/train_encoder.py
requirements.txt
|
2020-09-22 19:58:53 +02:00 |
mueller91
|
df4caec4b7
|
add: check_config for speaker_encoder
|
2020-09-22 19:52:09 +02:00 |
WeberJulian
|
3c212be5a8
|
fix: fixing the RenamingUnpickler fix
|
2020-09-22 17:36:05 +02:00 |
mueller91
|
0ea7f4e2bd
|
fix: make speaker encoder's storage parameters non-restriced
|
2020-09-22 10:39:40 +02:00 |
mueller91
|
7029452228
|
fix: make speaker encoder's storage parameters non-restriced
|
2020-09-22 10:31:42 +02:00 |
erogol
|
10258724d1
|
linter fixes
|
2020-09-22 03:54:16 +02:00 |
erogol
|
a6df617eb1
|
Merge branch 'glow-tts-amp-time_depth_conv' into dev
|
2020-09-21 14:23:45 +02:00 |
erogol
|
8150d5727e
|
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
|
2020-09-21 14:21:55 +02:00 |
erogol
|
e0b9fa887f
|
glow-tts modules added
|
2020-09-21 14:15:40 +02:00 |
erogol
|
e4c6386603
|
change import for normalization layer
|
2020-09-21 13:09:52 +02:00 |
mueller91
|
9b4aac94a8
|
fix: linter issues
|
2020-09-21 12:13:02 +02:00 |
erogol
|
c008003506
|
do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder
|
2020-09-18 12:52:19 +02:00 |
mueller
|
6b0621c794
|
cleanup
|
2020-09-17 16:46:43 +02:00 |
mueller
|
a273b1a210
|
add: add random noise to dataset
|
2020-09-17 14:23:40 +02:00 |
mueller
|
e36a3067e4
|
add: save wavs instead feats to storage.
This is done in order to mitigate staleness when caching and loading from data storage
|
2020-09-17 14:14:30 +02:00 |
mueller
|
1511076fde
|
add: Configurable encoder dataset storage to reduce disk I/O
add: Averaged time for data loader to console and Tensorboard output
|
2020-09-17 12:29:38 +02:00 |
erogol
|
3660c57f1e
|
time seperable convolution encoder, huber loss for duration predictor
|
2020-09-17 03:10:58 +02:00 |
mueller
|
95d2906307
|
add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training
|
2020-09-16 16:49:53 +02:00 |
mueller
|
c909ca3855
|
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
|
2020-09-16 15:55:55 +02:00 |
mueller
|
d733b90255
|
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
|
2020-09-16 15:09:02 +02:00 |
maxbachmann
|
60ce862113
|
use difflib for string matching
|
2020-09-14 23:55:34 +02:00 |
erogol
|
f1a75468c2
|
fix arguments
|
2020-09-12 04:00:25 +02:00 |
erogol
|
7c2c4d6f27
|
pass x_mask to layer norm
|
2020-09-12 03:41:37 +02:00 |
erogol
|
45fbc0d003
|
convolution encoder with GLU and res connections
|
2020-09-12 03:40:21 +02:00 |
erogol
|
498a3ea36f
|
fix condition check
|
2020-09-12 03:39:01 +02:00 |
erogol
|
72b8ac0ff6
|
remove redundant arguments
|
2020-09-12 03:37:47 +02:00 |
erogol
|
15e6ab3912
|
glow-tts module renaming updates
|
2020-09-12 03:33:36 +02:00 |
erogol
|
1b238f04b2
|
add gated conv encoder to glow-tts
|
2020-09-11 19:01:38 +02:00 |
erogol
|
14356d3250
|
glow-tts with relative pos encoding
|
2020-09-11 19:01:38 +02:00 |
erogol
|
43771a3a5c
|
remove redundant arguments
|
2020-09-11 19:01:38 +02:00 |
erogol
|
1dea2c9034
|
faster sequence masking
|
2020-09-11 19:01:38 +02:00 |
erogol
|
673ba74a80
|
glow tts training and inference fixes
|
2020-09-11 19:01:38 +02:00 |
erogol
|
d5c6d60884
|
synthesis update for glow tts
|
2020-09-11 19:01:37 +02:00 |
erogol
|
89d15bf118
|
merge glow-tts after rebranding
|
2020-09-11 19:01:37 +02:00 |
erogol
|
f9001a4bdd
|
refactor and fix compat issues for speaker encoder
|
2020-09-11 17:17:07 +02:00 |
erogol
|
540d811dd5
|
solve pickling models after module name change
|
2020-09-11 12:03:39 +02:00 |
erogol
|
df19428ec6
|
rename the project to old TTS
|
2020-09-09 12:27:23 +02:00 |