Thorsten Mueller
3eb00e8d93
Set out_path to be required param.
2021-03-08 02:49:15 +01:00
Alexander Korolev
ace430d5e6
fix device mismatch wavegrad training
...
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-03-08 02:49:15 +01:00
Eren Gölge
83143fbe39
fix #638
2021-03-08 02:48:31 +01:00
Eren Gölge
30c3bef3f9
move hubconf
2021-03-08 02:48:31 +01:00
Eren Gölge
bbea6a0884
hubconf.py and load .models.json from the defualt location by mange.py
2021-03-08 02:48:31 +01:00
Eren Gölge
90d4f08d6c
reorder imports
2021-03-08 02:48:31 +01:00
Eren Gölge
db231c83fc
distill import statement, check python version in setup.py
2021-03-08 02:48:31 +01:00
Thorsten Mueller
915ec1faac
Added info if model already downloaded in --list_models
2021-03-08 02:48:31 +01:00
Alexander Korolev
b4bc5f6eb1
update fixed stopnet_pos_weight parameter
...
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-03-08 02:48:31 +01:00
Eren Gölge
534e3c67c6
README update, set default models for synthesize.py and server.py. Disable verbose for ap init.
2021-03-08 02:48:31 +01:00
kirianguiller
7f36d91131
update chinese model
2021-03-01 14:55:05 +01:00
Eren Gölge
547bfc4ce9
bug fix
2021-02-18 18:24:03 +00:00
Eren Gölge
adaeec57ec
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-18 17:21:09 +00:00
Eren Gölge
5b70c8ba4f
enable backward compat for loading the best model
2021-02-18 17:20:36 +00:00
Eren Gölge
e4f81d6856
Merge pull request #654 from kirianguiller/chinese-implementation
...
Chinese implementation (merge into dev)
2021-02-18 17:15:32 +01:00
kirianguiller
22a6bbfa80
remove gst handling in synthetizer.py class
2021-02-17 20:53:56 +01:00
kirianguiller
3911b87e54
remove re.Match typing in '_number_replace()'
2021-02-17 20:53:56 +01:00
kirianguiller
fb0655d1e7
modify according to PR reviews
2021-02-17 20:53:56 +01:00
kirianguiller
c4c7bc1b88
<add> Chinese mandarin implementation (tacotron2)
2021-02-17 20:53:56 +01:00
Eren Gölge
d0454461de
Merge branch 'pr/gerazov/650-2' into dev
2021-02-17 13:40:45 +00:00
Eren Gölge
a8ea0ea6ce
Docstrings for audioprocessor
2021-02-17 13:35:41 +00:00
Eren Gölge
f6e6314910
add RUSLAN dataset preprocessor
2021-02-17 13:35:23 +00:00
Eren Gölge
ce0c5eccbd
do not test server and modelManager until fixing #657
2021-02-17 00:35:43 +00:00
gerazov
61c88beb94
refactored keep_all_best
2021-02-15 18:40:17 +01:00
Eren Gölge
eb543c027e
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-15 17:06:40 +00:00
Eren Gölge
8a106e0527
fix #655
2021-02-15 17:06:03 +00:00
Eren Gölge
216945e653
Merge pull request #647 from adonispujols/patch-1
...
Easy Fix for #454 (which was somehow deleted?)
2021-02-15 13:17:17 +01:00
Eren Gölge
06a3ba2fe2
linter update
2021-02-15 12:10:19 +00:00
Eren Gölge
7f58fa365b
Merge branch 'save_characters' into dev
2021-02-15 12:07:28 +00:00
Eren Gölge
ff218e2370
remove redundancy
2021-02-15 12:07:02 +00:00
Eren Gölge
80af8ca5e1
Update TTS/utils/arguments.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:03:59 +01:00
Eren Gölge
3b6ce04332
Update TTS/bin/find_unique_chars.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:02:29 +01:00
Eren Gölge
dc3596dad4
model_manager tests
2021-02-15 11:29:22 +00:00
Eren Gölge
77e630348e
author , license and contact info in .model.json
2021-02-15 11:02:21 +00:00
Eren Gölge
e1bc823e44
Merge branch 'pr/nmstoker/652' into dev
2021-02-15 10:57:12 +00:00
nmstoker
33bcdc6ff8
Updating models list to include EK1 TTS/vocoder
2021-02-14 23:44:05 +00:00
Eren Gölge
420901f4c2
linter fixes
2021-02-12 14:41:17 +00:00
Eren Gölge
4244096ccb
update test_text_processing for espeak-ng
2021-02-12 14:07:26 +00:00
Eren Gölge
b28c724c04
remove _phoneme_punctuations
2021-02-12 12:10:57 +00:00
Eren Gölge
7ab527d17e
save default model chars to the training config file
2021-02-12 12:06:46 +00:00
Eren Gölge
593cedee14
parse_characters function
2021-02-12 12:05:56 +00:00
Eren Gölge
2abfff17f9
enable saving model characters in io.py
2021-02-12 12:04:41 +00:00
Eren Gölge
918f007a11
docstring update
2021-02-12 12:04:07 +00:00
Eren Gölge
e774f68aee
save used model characters to the checkpoints
2021-02-12 12:03:42 +00:00
gerazov
0e78e31dbf
reformated docstrings in arguments.py
2021-02-12 11:36:01 +01:00
gerazov
310d18325e
brushed up printing model load path and best loss path
2021-02-12 10:55:45 +01:00
Eren Gölge
8b6fd76ad2
find unique characters in a dataset
2021-02-12 09:46:11 +00:00
gerazov
af46727517
loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added
2021-02-12 02:12:00 +01:00
Eren Gölge
a1e595790d
use default vocoders in server.pu
2021-02-11 15:31:39 +00:00
Eren Gölge
8aa6a0decb
set an output_sample_rate in synthesizer and use it for writing the wav
...
file
2021-02-11 15:28:07 +00:00
Eren Gölge
0c52d27d65
return the json entry of the downloaded model
2021-02-11 15:27:41 +00:00
Eren Gölge
1649ad3431
save_wav with a custom sampling rate
2021-02-11 15:27:20 +00:00
Eren Gölge
43f54d2dce
fix make_symbols
2021-02-11 15:26:52 +00:00
Eren Gölge
0657b38111
use default vocoder in synthesize.py
2021-02-11 15:26:17 +00:00
Eren Gölge
2043a9b5f5
define default vocoders
2021-02-11 15:25:55 +00:00
Eren Gölge
ff27690ca7
bug fix
2021-02-11 13:43:29 +00:00
Eren Gölge
bc131208be
fix spelling of a def argument and parse phonemes from config.json if
...
use_phonemes is True
2021-02-11 13:04:47 +00:00
Eren Gölge
f1799dbd60
docstring update
2021-02-11 11:25:31 +00:00
Eren Gölge
3baec4ea96
add missing phonemes to test_config.json
2021-02-11 11:14:39 +00:00
Eren Gölge
a3d1e65b34
Merge branch 'pr/adonispujols/646' into dev
2021-02-11 10:37:29 +00:00
Eren Gölge
3c2e13ca5c
fix the default vocoder name
2021-02-11 10:36:52 +00:00
Adonis Pujols
48011a8b58
add encoding="utf-8"
2021-02-11 05:26:06 -05:00
Adonis Pujols
b29a7e9645
spelling error. should be multiband not mulitband
2021-02-11 04:49:28 -05:00
Adonis Pujols
6c824a6629
spelling error. should be multiband not mulitband
2021-02-11 04:48:53 -05:00
Eren Gölge
b08b8ca2a1
add russian phoneme char
2021-02-10 13:30:59 +00:00
Eren Gölge
9cad435288
css10 dataset preprocessor
2021-02-09 15:11:26 +00:00
Eren Gölge
cea5e517f2
download github model releases by model manager
2021-02-09 14:24:14 +00:00
Eren Gölge
c619859a3f
linter fixes
2021-02-09 11:43:17 +00:00
gerazov
e507373b55
final final fixes
2021-02-06 23:08:47 +01:00
gerazov
ad17dc9e76
final fixes
2021-02-06 23:05:01 +01:00
gerazov
8fdd08ea15
updated to current dev
2021-02-06 22:59:52 +01:00
gerazov
2705d27b28
changed train scripts
2021-02-06 22:29:30 +01:00
gerazov
4f8f274d6e
restructured arg parsing and processing to utils
2021-02-06 22:25:56 +01:00
Eren Gölge
e7e880f514
fix gdown
2021-02-05 13:42:24 +00:00
Eren Gölge
f4f6290eec
Merge branch 'pr/gerazov/641' into dev
2021-02-05 13:14:49 +00:00
Eren Gölge
d49757faaa
linter updates
2021-02-05 13:10:43 +00:00
Branislav Gerazov
f063545325
improve robustness of defining wavernn in config file
2021-02-05 13:26:33 +01:00
Branislav Gerazov
24ffa9e9f6
update wavernn test config, delete cap=True
2021-02-05 13:10:02 +01:00
Branislav Gerazov
cb77aef36c
waveRNN fix
2021-02-04 09:52:03 +01:00
Thorsten Mueller
d74866cb8e
Merge remote-tracking branch 'upstream/dev' into dev
...
Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637
2021-02-02 19:40:18 +01:00
Thorsten Mueller
a82152eef3
Ups. Added missing ,
2021-02-02 19:29:16 +01:00
Thorsten Mueller
4cb4fcf02c
Set out_path to be required param.
2021-02-02 19:29:16 +01:00
Thorsten Mueller
c75ea74914
Added info if model already downloaded in --list_models
2021-02-02 19:29:16 +01:00
Eren Gölge
2edab4b3f9
disable pw in audio that causes numpy issue
2021-02-01 17:05:03 +00:00
Eren Gölge
5c46543765
linter fixes and version updates for deps
2021-02-01 13:18:56 +00:00
Eren Gölge
8774e37444
unpin cython version and commentout pyworld in audio.py causing dep
...
issues
2021-02-01 11:34:05 +00:00
Eren Gölge
5beed0ddcd
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-01 11:27:14 +00:00
Eren Gölge
c7407571fa
fix #638
2021-02-01 10:05:55 +00:00
Eren Gölge
dfdac1def9
Merge pull request #636 from thorstenMueller/dev
...
Set out_path to be required param in compute_statistics.py.
2021-01-29 18:08:31 +01:00
Thorsten Mueller
44c4a49745
Set out_path to be required param.
2021-01-29 17:23:38 +01:00
Eren Gölge
536366dc0a
Merge pull request #635 from SanjaESC/patch-1
...
fix device mismatch wavegrad training
2021-01-29 16:42:25 +01:00
Eren Gölge
0354b6f35e
move hubconf
2021-01-29 15:28:32 +00:00
Eren Gölge
aa5f24608a
hubconf.py and load .models.json from the defualt location by mange.py
2021-01-29 15:28:26 +00:00
Alexander Korolev
e81ebec7a8
fix device mismatch wavegrad training
...
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-01-29 15:18:59 +01:00
Eren Gölge
a926aa106d
reorder imports
2021-01-29 01:36:21 +01:00
Eren Gölge
8a6eee7fec
distill import statement, check python version in setup.py
2021-01-28 17:04:08 +01:00
Eren Gölge
131a163c95
Merge pull request #628 from thorstenMueller/dev
...
Added info if model already downloaded in --list_models
2021-01-28 13:10:06 +01:00
Alexander Korolev
ca28e05ed7
update fixed stopnet_pos_weight parameter
...
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-01-27 16:33:25 +01:00
Thorsten Mueller
ccbd542eb0
Added info if model already downloaded in --list_models
2021-01-27 16:19:02 +01:00
Eren Gölge
25c86ca715
README update, set default models for synthesize.py and server.py. Disable verbose for ap init.
2021-01-27 11:47:03 +01:00
Eren Gölge
4f32e77006
platform indep. way to fetch user data folder
2021-01-26 17:32:43 +01:00
Eren Gölge
0117c811a9
add a button to index.html to see the model details
2021-01-26 12:33:27 +01:00
Eren Gölge
a3adcaccdb
Merge branch 'pr/thorstenMueller/623' into dev
2021-01-26 12:19:39 +01:00
Eren Gölge
b464cab9b8
setup.py update and pylint fixes
2021-01-26 02:57:50 +01:00
Eren Gölge
660d61aeeb
maximum_path_numpy and CYTHON adabtable import
2021-01-26 02:57:07 +01:00
Eren Gölge
877f0bbfba
manifest.in update
2021-01-26 02:56:55 +01:00
Eren Gölge
82e029529e
fix manifest file
2021-01-25 13:27:54 +01:00
Eren Gölge
57b668fd86
fixing dome pypi issues
2021-01-25 13:06:12 +01:00
Eren Gölge
60c1bb93d9
fixes before first PyPI release
2021-01-25 11:16:20 +01:00
Thorsten Mueller
afb7db2a1d
Removed unneeded check and removed specific taco2 model name.
2021-01-22 16:22:50 +01:00
Eren Gölge
fae10309e4
Merge pull request #624 from SanjaESC/patch-3
...
Update train_tacotron.py
2021-01-22 13:29:09 +01:00
Eren Gölge
5ee73c2bae
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-01-22 13:26:27 +01:00
Eren Gölge
5fb611ef40
static image for server index.html
2021-01-22 03:01:53 +01:00
Eren Gölge
ca647cf222
Model Manager to download released models
2021-01-22 02:35:43 +01:00
Eren Gölge
ca8ad9c21e
rename audio._normalize to audio.normalize
2021-01-22 02:33:19 +01:00
Eren Gölge
c990b3a59c
linter fixes and test fixes
2021-01-22 02:32:35 +01:00
Alexander Korolev
f251dc8c0e
Update train_tacotron.py
...
When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training.
By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.
2021-01-21 21:16:30 +01:00
Eren Gölge
0ab2eb2664
use synthesizer in both synthesize.py and server.pu
2021-01-21 15:54:33 +01:00
Eren Gölge
9addfabc43
wavernn load_checkpoint function
2021-01-21 15:31:13 +01:00
Eren Gölge
50fee59a2c
update synthesizer.py for better interfacing to different models
2021-01-21 15:30:49 +01:00
Eren Gölge
007a4d7139
remove 3rd paty wavernn support from server.py and add ModelManager arguments
2021-01-21 15:30:16 +01:00
Eren Gölge
6b6e989fd2
update server readme
2021-01-21 15:29:46 +01:00
Thorsten Mueller
e414582be6
Added option for server ui details page.
2021-01-20 21:56:40 +01:00
root
1bc8fbbd3c
set eval mode whe nloading models
2021-01-20 02:14:18 +00:00
root
5bd7238153
interpolate spectrogram in vocoder generic utils for matching sample
...
rates
2021-01-20 02:13:01 +00:00
root
ca3743539a
load_checkpoint func for vocoder models
2021-01-20 02:12:29 +00:00
root
ea39715305
read_json_with_comments
2021-01-20 02:11:55 +00:00
root
563bc921d8
optional verbose for audio.py init
2021-01-20 02:11:24 +00:00
root
1faf565e3a
add load_checkpoint func to tts models
2021-01-20 02:10:56 +00:00
root
5c87753e88
glow-tts fix for saving inverse weight
2021-01-20 02:09:42 +00:00
root
3d30dae8f3
.models.json and synthesize.py update for interfacing with model manager
2021-01-20 02:08:58 +00:00
gerazov
b2b4828f17
set requires_grad=False
2021-01-16 19:46:04 +01:00
gerazov
c96f7a2614
TorchSTFT to device fix
2021-01-16 12:21:16 +01:00
root
7beaacc55b
update compute_attention_masks.py
2021-01-13 10:03:57 +00:00
erogol
428c224b88
commet update
2021-01-12 17:31:04 +01:00
erogol
bbc8d665a1
move attention layers to a sperate file
2021-01-11 17:27:30 +01:00
erogol
79c841ccd3
mass refactoring and update
2021-01-11 17:26:58 +01:00
erogol
1d961d6f8a
cladd renaming
2021-01-11 17:26:11 +01:00
erogol
c0a2aa68d3
formatting
2021-01-11 17:25:39 +01:00
erogol
b206162d11
more docstrings
2021-01-11 17:25:04 +01:00
erogol
6e9043c5d2
rename convbnblocks and handle none mask
2021-01-11 17:22:34 +01:00
erogol
921fa5db92
remove attentions from common layers
2021-01-11 15:06:42 +01:00
erogol
cc2b1e043d
docstrings for common layers
2021-01-11 15:06:12 +01:00
erogol
a6f40fef2e
stage missing files
2021-01-08 16:02:56 +01:00
erogol
d382d759b3
small fixes and test fixes
2021-01-08 15:48:40 +01:00
erogol
a6259041d3
docstring for speedyspeech
2021-01-07 14:35:22 +01:00
erogol
de2a542f83
glow-tts bug fix
2021-01-07 13:40:32 +01:00
erogol
14d33662ea
input shapes for tacotron models
2021-01-06 13:19:40 +01:00
erogol
f288e9a260
docstrings for taoctron models
2021-01-06 13:19:40 +01:00
erogol
5a45af48f1
fix
2021-01-06 13:19:40 +01:00
erogol
e7fad928e7
doc strings for the all glow-tts layers
2021-01-06 13:19:40 +01:00
erogol
d3b7284be4
glow-tts comments and refactoring
2021-01-06 13:19:40 +01:00
erogol
7586fbc4de
SS refactoring
2021-01-06 13:19:40 +01:00
erogol
e82d31b6ac
glow ttss refactoring
2021-01-06 13:19:40 +01:00
erogol
29f4329d7f
update glow-tts layers and add some comments
2021-01-06 13:19:40 +01:00
erogol
29cf933831
update SS condif
2021-01-06 13:19:40 +01:00
erogol
228ada04b5
update glow-tts ljspeech config
2021-01-06 13:19:40 +01:00
erogol
f352b3534c
make noise augmentation optional
2021-01-06 13:19:40 +01:00
erogol
71c382be14
copy model scale stats file with config.json to the trianing folder, fixed for model inits
2021-01-06 13:19:40 +01:00
erogol
aa40fe1aa0
SS model refacotring for multi speaker
2021-01-06 13:19:40 +01:00
erogol
eb555855e4
small fixes
2021-01-06 13:19:40 +01:00
erogol
5901a00576
argument rename
2021-01-06 13:19:40 +01:00
erogol
4ef083f0f1
select decoder type for SS
2021-01-06 13:19:40 +01:00
erogol
d5a0190c4b
update copy_config_file to copy_model_files
2021-01-06 13:19:40 +01:00
erogol
8971c59b2d
plot eval alignment score right
2021-01-06 13:19:40 +01:00
erogol
3fa408a5ea
change order BN + ReLU to ReLU + BN for SS
2021-01-06 13:19:40 +01:00
erogol
ac5c9217d1
positional encoding masking for SS
2021-01-06 13:19:40 +01:00
erogol
fede46e96e
pylint and test fixes
2021-01-06 13:19:40 +01:00
erogol
2abe3df153
compute_attention_masks.py
2021-01-06 13:19:40 +01:00
erogol
cf869e8922
add SS files
2021-01-06 13:19:40 +01:00
erogol
e4680e1b99
plot float16 alignments
2021-01-06 13:19:40 +01:00
erogol
13c6665c92
inference for SS
2021-01-06 13:19:40 +01:00
erogol
30788960a8
check SS model parameters
2021-01-06 13:19:40 +01:00
erogol
5cae2c5742
make optional position encoding for speedyspeech
2021-01-06 13:19:40 +01:00
erogol
dc4a16d62e
speedy speehc losses
2021-01-06 13:19:40 +01:00
erogol
d62cac7252
fix glow-tts prenet bug fix
2021-01-06 13:19:40 +01:00
erogol
a1d5a9ddda
config update tyo use noise for augmentation
2021-01-06 13:19:40 +01:00
erogol
022af74d74
update prompt msg
2021-01-06 13:19:40 +01:00
erogol
57ef53bef3
update argumnet check for non tacotron models
2021-01-06 13:19:40 +01:00
erogol
27a75de15f
update processors for loading attention maps
2021-01-06 13:19:40 +01:00
erogol
fa6907fa0e
update glow-tts parameters and fix rel-attn-win size
2021-01-06 13:19:40 +01:00
erogol
7b20d8cbd3
implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic
2021-01-06 13:19:40 +01:00
erogol
973754d893
fix for init glow-tts
2021-01-06 13:19:40 +01:00
erogol
f81af4eb0d
config update disable guided attention for dynamic conv attention
2021-01-06 13:19:40 +01:00
erogol
29b17c0808
bug fix for gradual training
2021-01-06 13:19:40 +01:00
erogol
5c50e104d6
config update
2021-01-06 13:19:40 +01:00
erogol
6478d552dc
tacotron training bug fix
2021-01-06 13:19:40 +01:00
erogol
1dd086577a
tacotron training bug fix
2021-01-06 13:18:41 +01:00
erogol
fa20638083
config for ljspeech dynamic conv attention
2021-01-06 13:18:41 +01:00
erogol
070146e143
add monotonic dynamic convolution attention
2021-01-06 13:18:41 +01:00
erogol
18392bc13a
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-01-06 13:18:08 +01:00
Thorsten Mueller
f673f8f74d
Added support for npy output from tune-wavegrad
2020-12-19 22:51:22 +01:00
Thorsten Mueller
2aa0354b44
Fix for 'NoneType' object has no attribute 'to'
2020-12-19 22:37:03 +01:00
Thorsten Mueller
28a64221ea
Improve robostness on cpu / gpu model mix
2020-12-19 22:23:28 +01:00
erogol
8293751a38
remove mozilla from server page
2020-12-17 12:28:28 +01:00
erogol
639fa29261
update speaker id casting for glow-tts
2020-12-14 16:58:47 +01:00
erogol
999120ecdf
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:50:14 +01:00
erogol
f611e6ac01
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:47:59 +01:00
Jörg Thalheim
62fd4ca70d
inflect negative numbers correctly
2020-12-10 16:47:51 +01:00
Jörg Thalheim
6646682650
cleaners: expand english time
2020-12-10 14:53:20 +01:00
Jörg Thalheim
76138687d3
expand more currencies
2020-12-10 14:53:20 +01:00
erogol
a2859b7ddc
update config args checks
2020-12-10 13:52:57 +01:00
erogol
788cd6f902
fix multi-speaker glow-tts inference
2020-12-10 02:05:48 +01:00
erogol
3d5066e2b8
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-10 00:31:03 +01:00
erogol
92cc9630d7
fix glow-tts synthesis for DPP
2020-12-10 00:30:34 +01:00
Eren Gölge
2473b2dc62
Merge pull request #559 from krzim/patch-1
...
Fix import to grab the encoder model save function
2020-12-10 00:19:32 +01:00
erogol
53679b706d
glow-tts distributed fix
2020-12-09 23:39:09 +01:00
erogol
62bc171db5
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-09 15:46:57 +01:00
erogol
df180148e9
use noise augmentation in TTSDataset
2020-12-09 15:46:25 +01:00
Thorsten Mueller
e39628ce2f
Limit filenames to 10 chars
2020-12-08 18:44:19 +01:00
erogol
06612ce305
test fixes
2020-12-07 15:57:34 +01:00
erogol
0252a07fa6
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-07 11:31:55 +01:00
erogol
482e725752
sync torch calls before logging training results
2020-12-07 11:30:19 +01:00
erogol
7505c0ba27
muliprocess phoneme computation
2020-12-07 11:29:41 +01:00
erogol
20c86489d7
make static methods for faster multiprocess call
2020-12-07 11:29:10 +01:00
erogol
affe1c1138
setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length.
2020-12-07 11:26:57 +01:00
Alexander Korolev
f42ca2b73f
Update wavegrad.py
...
This should fix the issue https://github.com/mozilla/TTS/issues/581
2020-12-04 16:43:39 +01:00
erogol
7c3cdced1a
make speaker_mapping a global variable to prevent reload. Fix glow-tts training
2020-12-01 03:23:25 +01:00
Thorsten Mueller
06a389bc08
Added option for saving raw spectograms
2020-11-27 15:49:55 +01:00
erogol
a757b203bc
fix longer phoneme seqs
2020-11-26 15:05:03 +01:00
erogol
7b0a93d2f8
fix
2020-11-26 11:44:52 +01:00
erogol
0c6f7e4c77
resample audio if flag set true
2020-11-26 11:30:48 +01:00
erogol
f6c96b0ac2
Merge branch 'dev'
2020-11-25 15:29:06 +01:00
erogol
e3b7157146
remove contextlib
2020-11-25 15:22:01 +01:00
erogol
e3eda159d1
wavegrad_dataset update
2020-11-25 14:50:50 +01:00
erogol
a1e4ee18f9
convert float16 to float32 for plotting spectrograms
2020-11-25 14:50:28 +01:00
erogol
7541d2ecaa
return eval split optional
2020-11-25 14:50:09 +01:00
erogol
4b92ac0f92
tune_wavegrad update
2020-11-25 14:49:48 +01:00
erogol
d8c1b5b73d
print max lengths in tacotron training
2020-11-25 14:49:07 +01:00
erogol
1229554c42
use native amp
2020-11-25 14:48:54 +01:00
erogol
8a820930c6
compute_embedding update
2020-11-25 14:46:08 +01:00
erogol
aa2b31a1b0
use 'enabled' argument to control autocast
2020-11-17 14:22:01 +01:00
erogol
d9d04d892b
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-17 14:17:24 +01:00
erogol
8b0e0846a3
temporary travis check
2020-11-17 14:17:03 +01:00
Qingping Hou
b0b97d636f
speed up metafile build for voxceleb
2020-11-14 23:45:17 -08:00
erogol
a2a142dc39
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-14 13:02:19 +01:00
erogol
c65712426a
change noise scheduling for wavegrad. Compute beta values externally to enable better flexibility
2020-11-14 13:01:10 +01:00
erogol
5a59467f34
scaler fix for wavegrad and wavernn. Save and load scaler
2020-11-14 13:00:35 +01:00
erogol
d8511efa8f
use native amp for tacotron training
2020-11-14 12:59:28 +01:00
Qingping Hou
0cc3650ef6
support loading config in yaml
2020-11-14 00:13:53 -08:00
erogol
6cc464ead6
fix ton of tesnting bugs
2020-11-12 16:33:29 +01:00
erogol
25551c4634
change wavernn generate to inference
2020-11-12 12:52:52 +01:00
erogol
9b0f441945
argument for returning no eval split
2020-11-12 12:52:27 +01:00
erogol
a7aefd5c50
use pytorch amp for mixed precision training for Tacotron
2020-11-12 12:51:56 +01:00
erogol
67e2b664e5
compute embeddings and create speakers.json
2020-11-12 12:51:17 +01:00
erogol
f8fd300b3e
bug fix
2020-11-10 12:53:39 +01:00
erogol
016d3503da
compute embeddings with speaker encoder
2020-11-10 12:51:02 +01:00
erogol
21364331d2
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-09 13:31:12 +01:00
erogol
c76a617072
linter updates
2020-11-09 13:18:35 +01:00
erogol
ea976b0543
python compat update for contextlib
2020-11-06 13:34:11 +01:00
erogol
c80225544e
tune wavegrad to fine the best noise schedule for inferece
2020-11-06 13:04:46 +01:00
erogol
d94782a076
reset the way ga_loss is stored in return_dict
2020-11-02 13:18:56 +01:00
erogol
a108d0ee81
check nan loss in glow-tts loss
2020-11-02 13:12:19 +01:00
erogol
b8ac9aba9d
check against NaN loss in tacotron_loss
2020-11-02 12:44:41 +01:00
erogol
ef04d7fae7
bug fix for wavernn training
2020-10-30 14:08:41 +01:00
erogol
a44ef58aea
wavegrad weight norm refactoring
2020-10-30 13:23:24 +01:00
erogol
183fe56d95
Merge branch 'ssim_loss' into dev
2020-10-29 23:49:09 +01:00
krzim
2202e171c5
Fix import to grab the encoder model save function
...
I saw that this was recently changed but I'm not sure if it should have been. This is the correct function given the arguments provided to it in the train loop.
2020-10-29 18:03:11 -04:00
erogol
73581cd94c
renaming train scripts and updating tests
2020-10-29 16:50:07 +01:00
erogol
39c71ee8a9
wavegrad refactoring, fixing tests for glow-tts and wavegrad
2020-10-29 15:47:15 +01:00
erogol
946a0c0fb9
bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts
2020-10-29 15:45:50 +01:00
erogol
14c2381207
weight norm and torch based amp training for wavegrad
2020-10-29 12:31:43 +01:00
erogol
b76a0be97a
wavegrad model and layers refactoring
2020-10-29 12:31:43 +01:00
erogol
dc2825dfb2
wavegrad dataset update
2020-10-29 12:31:43 +01:00
erogol
5b5b9fcfdd
wavegrad config updates
2020-10-29 12:31:43 +01:00
erogol
c8a4c771a8
train wavegrad updates
2020-10-29 12:31:43 +01:00
erogol
670f44aa18
enable compute stats by vocoder config
2020-10-29 12:31:43 +01:00
erogol
f79bbbbd00
use Adam for wavegras instead of RAdam
2020-10-29 12:31:43 +01:00
erogol
7bcdb7ac35
wavegrad updates
2020-10-29 12:31:43 +01:00
erogol
a1582a0e12
fix distributed training for train_* scripts
2020-10-29 12:31:43 +01:00
erogol
193b81b273
add universal_fullband_melgan config
2020-10-29 12:30:37 +01:00
erogol
e02cd6a220
initial wavegrad layers model and trainig script
2020-10-29 12:30:37 +01:00
erogol
ac57eea928
add wavegrad to vocoder generators
2020-10-29 12:30:37 +01:00
erogol
e723b99888
handle distributed model as saving
2020-10-29 12:30:37 +01:00
Eren Gölge
26c18b61c9
Merge pull request #553 from Edresson/dev
...
bug fix in the inference with GlowTTS
2020-10-28 18:49:31 +01:00
erogol
fdaed45f58
optional loss masking for stoptoken predictor
2020-10-28 18:40:54 +01:00
erogol
e49cc3bbcd
bug fix
2020-10-28 18:34:34 +01:00
erogol
59e1cf99d0
config update and ssim implementation
2020-10-28 18:30:00 +01:00
erogol
9cef923d99
ssim loss for tacotron models
2020-10-28 15:24:18 +01:00
erogol
9d0ae2bfb4
wavernn dataloader handling for short samples and mixed precision training
2020-10-28 12:31:01 +01:00
Edresson
f01502a9db
bug fix in glowTTS sythesize
2020-10-27 16:30:16 -03:00
Eren Gölge
f4b8170bd1
Merge pull request #545 from Edresson/dev
...
GlowTTS zeroshot TTS support
2020-10-27 15:23:41 +01:00
erogol
a6f564c8c8
pylint fixes
2020-10-27 12:35:10 +01:00
erogol
0becef4b58
small updates
2020-10-27 12:17:38 +01:00
sanjaesc
2ee47e9568
fix pylint once again
2020-10-27 12:17:38 +01:00
sanjaesc
1e646135ca
add model params to config
2020-10-27 12:17:38 +01:00
sanjaesc
bef3f2020b
compute audio feat on dataload
2020-10-27 12:17:38 +01:00
sanjaesc
7c72562fe7
fix travis + pylint tests
2020-10-27 12:17:38 +01:00
sanjaesc
91e5f8b63d
added to device cpu/gpu + formatting
2020-10-27 12:17:38 +01:00
sanjaesc
016a77fcf2
fix formatting + pylint
2020-10-27 12:17:38 +01:00
erogol
8de7c13708
fix no loss masking loss computation
2020-10-27 12:17:38 +01:00
sanjaesc
e8294cb9db
fixing pylint errors
2020-10-27 12:17:38 +01:00
sanjaesc
878b7c373e
added feature preprocessing if not set in config
2020-10-27 12:17:38 +01:00
sanjaesc
e495e03ea1
some minor changes to wavernn
2020-10-27 12:17:38 +01:00
Alex K
9c3c7ce2f8
wavernn stuff...
2020-10-27 12:17:38 +01:00
Alex K
6378fa2b07
add initial wavernn support
2020-10-27 12:17:38 +01:00
Edresson
89e9bfe3a2
add text processing blank token test
2020-10-26 17:41:23 -03:00
Edresson
d9540a5857
add blank token in sequence for encrease glowtts results
2020-10-25 15:08:28 -03:00
Edresson
fbea058c59
add parse speakers function
2020-10-24 16:10:05 -03:00
Edresson
07345099ee
GlowTTS zero-shot TTS Support
2020-10-24 15:58:39 -03:00
Alexander Korolev
47d74ced1c
Update losses.py
...
Seems like in the latest dev merge, this change was reverted. Any specific reason for this?
Without it the problem as stated here https://github.com/mozilla/TTS/issues/473 occurs.
2020-10-23 14:15:01 +02:00
ayush-1506
2a3559f02b
Fix readme and config file
2020-10-21 13:43:49 +05:30
Edresson
b7f9ebd32b
add check arguments for GlowTTS and multispeaker training bug fix
2020-10-19 17:17:58 -03:00
erogol
c2c4126a18
remove merge conflicts
2020-10-08 01:35:27 +02:00
erogol
c5074cfd8e
general purpose distribute.py
2020-10-08 01:30:42 +02:00
erogol
6f0654f9a8
differential spectral loss
2020-10-08 01:30:42 +02:00
erogol
e0d4b88877
config update
2020-10-08 01:29:30 +02:00
erogol
4e93f90108
bug fix
2020-10-08 01:29:30 +02:00
erogol
bb9b70ee27
differential spectral loss and loss weight settings
2020-10-08 01:29:30 +02:00
erogol
e1eab1ce4b
print model r value as loading it
2020-10-07 13:34:21 +02:00
erogol
48a40c4730
remove unused import
2020-10-06 11:32:24 +02:00
erogol
a2606fbc22
format utils
2020-10-06 11:02:54 +02:00
Eren Gölge
4873601694
Merge pull request #531 from WeberJulian/french-cleaners
...
Adding support for french cleaners
2020-09-30 15:30:50 +02:00
Edresson
99d5a0ac07
add Speaker Conditional GST support
2020-09-29 16:09:27 -03:00
Julian WEBER
ea7c2e15c0
Adding french abbreviations
2020-09-29 15:43:39 +02:00
Julian WEBER
54b4031391
Merge remote-tracking branch 'origin/dev' into french-cleaners
2020-09-29 14:24:51 +02:00
Julian WEBER
da134eeee4
Subjective improvements
2020-09-29 14:20:52 +02:00
Julian WEBER
b2817e9e93
Adding french cleaners
2020-09-29 14:20:24 +02:00
Eren Gölge
cf02ace5b7
Merge pull request #530 from mueller91/fix_split_dataset
...
fix: split_dataset
2020-09-28 12:42:40 +02:00
erogol
154f90bc44
format speaker encoder imports
2020-09-28 11:19:19 +02:00
erogol
e097bc6c5d
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-09-28 11:15:32 +02:00
Eren Gölge
8e2dc79c3a
Merge pull request #526 from mueller91/dev
...
Fix: Check storage params only for speaker encoder
2020-09-28 11:15:23 +02:00
erogol
6a70c63f24
correct glow-tts loss
2020-09-27 03:28:42 +02:00
erogol
665f7ca714
linter fix
2020-09-24 12:57:54 +02:00
mueller91
227b9c8864
fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
...
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
2020-09-23 23:27:51 +02:00
mueller91
cfeeef7a7f
fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev.
...
speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.
2020-09-22 20:10:41 +02:00
mueller91
1fe5eb054f
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
...
Conflicts:
TTS/bin/train_encoder.py
requirements.txt
2020-09-22 19:58:53 +02:00
mueller91
df4caec4b7
add: check_config for speaker_encoder
2020-09-22 19:52:09 +02:00
WeberJulian
3c212be5a8
fix: fixing the RenamingUnpickler fix
2020-09-22 17:36:05 +02:00
mueller91
0ea7f4e2bd
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:39:40 +02:00
mueller91
7029452228
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:31:42 +02:00
erogol
10258724d1
linter fixes
2020-09-22 03:54:16 +02:00
erogol
a6df617eb1
Merge branch 'glow-tts-amp-time_depth_conv' into dev
2020-09-21 14:23:45 +02:00
erogol
8150d5727e
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-09-21 14:21:55 +02:00
erogol
e0b9fa887f
glow-tts modules added
2020-09-21 14:15:40 +02:00
erogol
e4c6386603
change import for normalization layer
2020-09-21 13:09:52 +02:00
mueller91
9b4aac94a8
fix: linter issues
2020-09-21 12:13:02 +02:00
erogol
c008003506
do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder
2020-09-18 12:52:19 +02:00
mueller
6b0621c794
cleanup
2020-09-17 16:46:43 +02:00
mueller
a273b1a210
add: add random noise to dataset
2020-09-17 14:23:40 +02:00
mueller
e36a3067e4
add: save wavs instead feats to storage.
...
This is done in order to mitigate staleness when caching and loading from data storage
2020-09-17 14:14:30 +02:00
mueller
1511076fde
add: Configurable encoder dataset storage to reduce disk I/O
...
add: Averaged time for data loader to console and Tensorboard output
2020-09-17 12:29:38 +02:00
erogol
3660c57f1e
time seperable convolution encoder, huber loss for duration predictor
2020-09-17 03:10:58 +02:00
mueller
95d2906307
add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training
2020-09-16 16:49:53 +02:00
mueller
c909ca3855
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
2020-09-16 15:55:55 +02:00
mueller
d733b90255
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
2020-09-16 15:09:02 +02:00
maxbachmann
60ce862113
use difflib for string matching
2020-09-14 23:55:34 +02:00
erogol
f1a75468c2
fix arguments
2020-09-12 04:00:25 +02:00
erogol
7c2c4d6f27
pass x_mask to layer norm
2020-09-12 03:41:37 +02:00
erogol
45fbc0d003
convolution encoder with GLU and res connections
2020-09-12 03:40:21 +02:00