Edresson
77d85c6cc5
add softmaxproto loss and bug fix in data loader
2021-05-10 17:08:38 -03:00
chmodsss
607d5cf377
[ #480 ] Adding version variable
2021-05-10 19:46:34 +02:00
Adam Froghyar
7ddc885f37
deleted a line the broke GravesAttention
2021-05-10 15:42:59 +02:00
Edresson
78bad25f2b
update voxceleb download link
2021-05-07 23:45:15 -03:00
Eren Gölge
f7582107da
Merge pull request #453 from Edresson/dev
...
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
2021-05-06 17:53:28 +02:00
Edresson
501c8e0302
remove unused vars on extract tts spectrograms script
2021-05-04 19:04:13 -03:00
Eren Gölge
0325c58862
Merge pull request #468 from shaun95/patch-1
...
Update losses.py
2021-05-03 14:45:24 +02:00
Eren Gölge
8cb27267a4
formatting
2021-05-03 14:26:35 +02:00
Eren Gölge
87d674a038
bumpup librosa version to 0.8.0
2021-05-03 14:25:09 +02:00
shaun
7d0ec62bf1
Update losses.py
...
The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled.
The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)
2021-05-02 14:14:24 +02:00
Edresson
3ecd556bbe
add unit test for extract tts spectrograms script
2021-05-01 13:41:56 -03:00
Edresson
446b1da936
create inference function
2021-04-29 18:18:37 -03:00
Eren Gölge
f02f0338c2
fix .models.json and add testing to check released models availability
2021-04-29 09:32:36 +02:00
Eren Gölge
fd95e9b8a4
[ci skip] Add sam models
2021-04-28 21:57:31 +02:00
Agrin Hilmkil
351d0ed6ae
Remove unnecessary fsspec usage
2021-04-28 11:21:08 +02:00
Agrin Hilmkil
167f86417e
Move dev, tf, notebook dependencies to extras
2021-04-28 11:20:06 +02:00
Eren Gölge
1235e54738
test for synthesize.py
2021-04-27 14:17:38 +02:00
Eren Gölge
4719414f2e
remove imports
2021-04-27 11:25:17 +02:00
Eren Gölge
add97cddc1
move function and remove import
2021-04-27 11:22:56 +02:00
Eren Gölge
734e6a515c
bug fix
2021-04-27 10:27:45 +02:00
Eren Gölge
6bdd81667e
place holders for sc-glow and hifigan models
2021-04-26 19:53:12 +02:00
Eren Gölge
2f0716073e
enable multi-speaker CoquiTTS models for synthesize.py
2021-04-26 19:36:53 +02:00
Eren Gölge
b531fa699c
remove conflicy noise
2021-04-26 15:27:52 +02:00
Eren Gölge
f37b488876
Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager
2021-04-26 15:25:25 +02:00
Eren Gölge
b82daa5e86
style and linter fixes
2021-04-26 15:22:24 +02:00
Edresson
20e42a3381
add save audio option
2021-04-23 15:00:00 -03:00
Edresson
8228091f92
add script for extraction of tts spectrograms
2021-04-23 14:17:46 -03:00
Eren Gölge
4cf211348d
styling and linting
2021-04-23 18:04:37 +02:00
Eren Gölge
7eb0c60d2e
let synthesizer to pass speaker encoder file paths to speaker manager
2021-04-23 18:04:37 +02:00
Eren Gölge
f69195739e
let speaker manager compute mean x_vector from multiple wav files
2021-04-23 18:04:37 +02:00
Eren Gölge
179722e3a7
new arguments to synthesize.py for loading speaker encoder and speaker wavs
2021-04-23 18:04:37 +02:00
Eren Gölge
dfa415a8b8
small refactor in server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
c80d21f311
load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager
2021-04-23 18:04:37 +02:00
Eren Gölge
ad047c8195
html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker
2021-04-23 18:04:37 +02:00
Eren Gölge
f9f3d04d14
remove moved function
2021-04-23 18:04:37 +02:00
Eren Gölge
10c988ac8c
update server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
6d0f5e0459
use SpeakerManager in Synthesizer
2021-04-23 18:04:37 +02:00
Eren Gölge
e97126314c
add ```unique``` argument to make_symbols to fix the incompat. issue of the
...
SC-Glow models
2021-04-23 18:04:37 +02:00
Eren Gölge
d08888e603
formating speakers.py
2021-04-23 18:04:37 +02:00
Eren Gölge
df422223a3
initial SpeakerManager implementation
2021-04-23 18:04:37 +02:00
Eren Gölge
7a7aeb35f5
fix the glow-tts in setup_model
2021-04-23 18:04:37 +02:00
Eren Gölge
d42748082a
update argument name external_speaker_embedding_dim -> speaker_embedding_dim
...
add inference_noise_scale argument to glow-tts
2021-04-23 18:04:37 +02:00
Eren Gölge
2da81f5bb6
add load_chekpoint to speaker encoder
2021-04-23 18:04:37 +02:00
Eren Gölge
1229ccbf07
update argument name in server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
af2d36faeb
update synthesize.py for multi-speaker setting
2021-04-23 18:04:37 +02:00
Eren Gölge
99dc07a7dd
add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)
2021-04-23 18:04:37 +02:00
Eren Gölge
c955a12428
set the default layer size compatible with scglow
2021-04-23 18:04:37 +02:00
Eren Gölge
3ace2440fa
fix a mistake from rebase
2021-04-23 18:04:37 +02:00
Eren Gölge
aadb2106ec
code styling
2021-04-23 18:04:37 +02:00
Eren Gölge
af7baa3387
refactoring to allow defining the speaker file externally
2021-04-23 18:04:37 +02:00
kirianguiller
7dccbfdcd5
handle multi speaker and gst in Synthetizer class
2021-04-23 18:04:37 +02:00
Edresson
d2b6326b8b
change optimizer initialization for compatibility with Hifi-GAN official implementation
2021-04-23 07:54:39 -03:00
WeberJulian
4205284f92
Change name of the functions
2021-04-23 10:09:55 +02:00
WeberJulian
a26498181b
Change back the default value
2021-04-22 16:10:17 +02:00
Julian Weber
355e1f47ab
fix dumb mistake
2021-04-22 15:50:29 +02:00
Julian Weber
c125b71f36
fix windows support
2021-04-22 15:14:24 +02:00
Jörg Thalheim
f5fd7f78d4
server: also listen to ipv6
...
The [::] address will listen to both ipv4/ipv6 addresses.
2021-04-22 12:38:55 +02:00
Eren Gölge
ef37633cb3
[ci skip] use prenet_dropout by default with Tacotron models
2021-04-22 12:38:55 +02:00
Eren Gölge
e1d960da9e
use SpeakerManager in Synthesizer
2021-04-21 13:13:27 +02:00
Eren Gölge
04b6881b66
add ```unique``` argument to make_symbols to fix the incompat. issue of the
...
SC-Glow models
2021-04-21 13:12:35 +02:00
Eren Gölge
790946faec
formating speakers.py
2021-04-21 13:12:11 +02:00
Eren Gölge
ab313814de
initial SpeakerManager implementation
2021-04-21 13:11:46 +02:00
Eren Gölge
09890c7421
fix the glow-tts in setup_model
2021-04-21 13:10:40 +02:00
Eren Gölge
8764d02eb2
update argument name external_speaker_embedding_dim -> speaker_embedding_dim
...
add inference_noise_scale argument to glow-tts
2021-04-21 13:09:44 +02:00
Eren Gölge
8b40720977
add load_chekpoint to speaker encoder
2021-04-21 13:09:04 +02:00
Eren Gölge
37cad38c27
update argument name in server.py
2021-04-21 13:08:45 +02:00
Eren Gölge
9bccee9da8
update synthesize.py for multi-speaker setting
2021-04-21 13:08:25 +02:00
Eren Gölge
d2fa8add1f
add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)
2021-04-16 19:40:13 +02:00
Eren Gölge
d9612a4351
set the default layer size compatible with scglow
2021-04-16 19:40:13 +02:00
Eren Gölge
1038fd420d
fix a mistake from rebase
2021-04-16 19:39:47 +02:00
Eren Gölge
47e356cb48
code styling
2021-04-16 16:01:40 +02:00
Eren Gölge
25328aad00
refactoring to allow defining the speaker file externally
2021-04-16 15:59:57 +02:00
kirianguiller
48ae52a9a3
handle multi speaker and gst in Synthetizer class
2021-04-16 15:54:49 +02:00
Eren Gölge
a53958ae3a
fix urls for the new models
2021-04-15 17:05:00 +02:00
Eren Gölge
9cc17be53a
formatting and a small bug fix in Tacotron model
2021-04-15 16:36:51 +02:00
Eren Gölge
1ad838bc83
add newly released models under .model.json
2021-04-15 16:06:10 +02:00
Eren Gölge
7cada1a949
remove noise
2021-04-15 15:30:45 +02:00
Eren Gölge
d60a8d7211
show the real waveform on TB too for GAN vocoder training.
2021-04-15 15:30:06 +02:00
Eren Gölge
5fbe926429
change the default TTS model to TacotronDDC
2021-04-15 15:29:44 +02:00
Eren Gölge
3de5a89154
optionally enable prenet dropout at inference time for tacotron models
2021-04-13 13:24:56 +02:00
Eren Gölge
28a2fed8a3
update hifigan in .model.json
2021-04-12 16:48:05 +02:00
Eren Gölge
abaf36861a
aligntts model .model.json placeholder
2021-04-12 16:43:52 +02:00
Eren Gölge
480e2f7888
docstring update and better handling make_symbols
2021-04-12 16:40:49 +02:00
Eren Gölge
b735076bb4
linter fixes
2021-04-12 13:14:11 +02:00
Eren Gölge
b11d1cb845
small fixes
2021-04-12 12:40:55 +02:00
Eren Gölge
a7f6045644
Merge branch 'reformat' into hifigan-reformat
2021-04-12 12:00:17 +02:00
Eren Gölge
f519012dea
reformatting and styling
2021-04-12 11:47:39 +02:00
Eren Gölge
9011dddf77
tacotron DDC placeholder in models.json
2021-04-12 04:06:27 +02:00
Eren Gölge
d295d5de97
remove torch.no_grad from TorchSTFT
2021-04-10 19:43:57 +02:00
Eren Gölge
5b70da2e3f
restore schedulers only if training is continuing a previous training
...
inherit nn.Module for TorchSTFT
2021-04-09 19:31:28 +02:00
Eren Gölge
2c71c6d8cd
[ci skip]update gan vocoder configs to reflect the recent changes
2021-04-09 17:15:32 +02:00
Eren Gölge
2b529f60c8
update default hifigan config
2021-04-09 11:40:06 +02:00
Eren Gölge
105e0b4d62
vocoder gan training fixes
2021-04-09 11:38:04 +02:00
Eren Gölge
87ee6ceb57
style update #3
2021-04-09 01:17:15 +02:00
Eren Gölge
18d9ec8036
format with black
2021-04-09 00:54:59 +02:00
Eren Gölge
e5b9607bc3
isort all imports
2021-04-09 00:45:20 +02:00
Eren Gölge
0e79fa86ad
format with black and pylint 2.7.3
2021-04-09 00:38:08 +02:00
Eren Gölge
cd69da4868
linter fixes #2
2021-04-08 16:57:46 +02:00
Eren Gölge
4d3e1e9d9a
linter fix
2021-04-08 14:57:46 +02:00
Eren Gölge
53f54898bc
small fixes
2021-04-08 14:22:47 +02:00
Eren Gölge
006b1d3aaa
bug fix
2021-04-08 13:17:45 +02:00
Eren Gölge
3f0993aebe
remove junk
2021-04-08 12:17:02 +02:00
Eren Gölge
0ee0458309
remove redundant imports
2021-04-08 11:29:15 +02:00
Eren Gölge
773f1db6fa
refactor HifiGAN discriminator
2021-04-08 11:28:30 +02:00
Eren Gölge
15f362d5b1
formatting
2021-04-08 11:28:30 +02:00
Eren Gölge
aee24b0704
set different seed in gan_dataset when it is multi-workers
2021-04-08 11:28:30 +02:00
Eren Gölge
6ee211c137
remove stft params causing warning
2021-04-08 11:28:30 +02:00
Eren Gölge
4998ece8d8
allow configuration of optimziers from the config file
2021-04-08 11:28:30 +02:00
Eren Gölge
8daf407652
cache empty
2021-04-08 11:28:30 +02:00
Eren Gölge
3fb78c004a
move scheduler updates to the end of the epoch
2021-04-08 11:28:30 +02:00
Eren Gölge
2a872c98aa
don't call os.exit as it leaves the process resources standing
2021-04-08 11:27:40 +02:00
Eren Gölge
7cecd2fb2e
add hifigan D
2021-04-08 11:27:40 +02:00
Eren Gölge
13dca6e6b6
revert some of Hifigan generator updates
2021-04-08 11:27:40 +02:00
Eren Gölge
02bc776c35
prevenet grad in TorchSTFT
2021-04-08 11:27:40 +02:00
Eren Gölge
cf44624df8
more docstring
2021-04-08 11:27:40 +02:00
Eren Gölge
d95b1458e8
Linter fixes and docstrings for HiFiGAN
2021-04-08 11:27:40 +02:00
Eren Gölge
bd7a1c177b
fix #419
2021-04-08 11:26:41 +02:00
Eren Gölge
7726dfca99
change the upper bound in sound normalization
2021-04-08 11:26:01 +02:00
Eren Gölge
57f6bd1afa
make using different samples for G and D networks optional
2021-04-08 11:26:01 +02:00
Eren Gölge
67f8248492
placeholder for finetuned sam hifigan model
2021-04-08 11:25:29 +02:00
Eren Gölge
241e968df1
load_checkpoint for hifigan and no_grad for inference
2021-04-08 11:25:29 +02:00
Eren Gölge
de3a04f104
some commeting for Generator loss and check if the argument is defines in the config file
2021-04-08 11:25:29 +02:00
Eren Gölge
ff07c5f5e3
update TorchSTFT to enable melspec
2021-04-08 11:25:29 +02:00
Eren Gölge
4a5b1d4ac2
update hifigan config
2021-04-08 11:24:21 +02:00
Eren Gölge
e0e3b12b26
pass all parameters explicity to _istft
2021-04-08 11:23:20 +02:00
Eren Gölge
f0e76ee135
initial models.json entry for universal hifigan
2021-04-08 11:23:20 +02:00
Eren Gölge
d57f416957
small fixes
2021-04-08 11:22:30 +02:00
Eren Gölge
8c9e1c9e58
hifigan implementation update
2021-04-08 11:21:43 +02:00
Eren Gölge
a14d7bc5db
hifigan config update
2021-04-08 11:20:33 +02:00
Eren Gölge
8d4fd79cd7
update hifigan config
2021-04-08 11:20:33 +02:00
rishikksh20
e656e8b108
Remove select size bug
2021-04-08 11:20:33 +02:00
rishikksh20
b533474e3b
Remove minor bugs and make code trainable
2021-04-08 11:20:33 +02:00
rishikksh20
ef6ff4e95c
Add Exponential LR scheduler check
2021-04-08 11:20:33 +02:00
rishikksh20
1535777f64
1) Add ExponentialLR
2021-04-08 11:18:36 +02:00
rishikksh20
c20a6b1185
* Format the model definition
...
* Update code and integrate training code
2021-04-08 11:18:36 +02:00
rishikksh20
39b5845810
1) Add hifigan json files
...
2) Rename MPD disc
3) Re-format remove weight norm generator
2021-04-08 11:14:39 +02:00
rishikksh20
7b7c5d635f
1) Combine MSD with Multi-Period disc
...
2) Add remove weight norm layer on Generator
2021-04-08 11:14:39 +02:00
rishikksh20
4493feb95c
Add HiFi-GAN v1 generator and discriminator classes
2021-04-08 11:14:39 +02:00
Eren Gölge
c86c559349
docstring and optional padding in TorchSTFT
2021-04-07 12:36:15 +02:00
Eren Gölge
f890454de3
linter fixes
2021-04-07 12:36:03 +02:00
Eren Gölge
9782d9ea5d
[ci skip] implement #418
2021-04-06 16:24:50 +02:00
Eren Gölge
f46a275b22
update docstring 2
2021-04-06 16:24:50 +02:00
Eren Gölge
ec94ff3691
update docstring
2021-04-06 16:24:50 +02:00
Eren Gölge
2048095e9a
audio.py fix
2021-04-06 16:24:50 +02:00
Eren Gölge
e0b3008c31
allow choosing the log function used for amptodb conversion
2021-04-06 16:24:50 +02:00
Eren Gölge
44b4cb5ba5
DCA comment
2021-04-06 16:24:50 +02:00
Eren Gölge
b86e7fb2e8
pad short samples when loading precomputed features in vocoder trainign
2021-04-06 16:24:50 +02:00
Eren Gölge
6ad4eba678
gan vocoder train fix in case of restoring models wiht no scheduler is defined
2021-04-06 16:24:50 +02:00
Eren Gölge
e3ccfe37ea
add DE more urls
2021-04-02 14:54:41 +02:00
Eren Gölge
e84f120a04
sam-accenture model preprocessor
2021-04-01 03:41:41 +02:00
Eren Gölge
e3c052382b
fix loading always best_model when continue
2021-04-01 03:41:15 +02:00
Eren Gölge
48ea20e69f
example aligntts config
2021-03-30 14:41:00 +02:00
Eren Gölge
b4c2cf80f2
fix eval iter
2021-03-30 14:39:16 +02:00
Eren Gölge
a3a840fd78
linter fixes
2021-03-30 14:39:16 +02:00
Eren Gölge
6b2e13bf62
compute normalized logp using torch primitives
2021-03-30 14:39:16 +02:00
Eren Gölge
7a382a5c2b
stowed aligntts commit and small refactoring with feed_forward layers
2021-03-30 14:39:16 +02:00
Eren Gölge
d542a50818
fix losses for alignTTS
2021-03-30 14:39:16 +02:00
Eren Gölge
18cc7b95ec
update l1 and huber to mse loss
2021-03-30 14:39:16 +02:00
Eren Gölge
896d33ed49
update losses to hande alingtts phases
2021-03-30 14:39:16 +02:00
Eren Gölge
aec0b78aff
duration predictor fix 2
2021-03-30 14:39:16 +02:00
Eren Gölge
07269e639b
fix duration predictor in AlignTTS
2021-03-30 14:39:16 +02:00
Eren Gölge
c2d29e5cd4
FFTransformer encoder for aligntts
2021-03-30 14:39:16 +02:00
Eren Gölge
460a2d3e26
FFTransformer Decoder for AlignTTS
2021-03-30 14:39:16 +02:00
Eren Gölge
844e8e0ed4
adapt align_tts and model name handling
2021-03-30 14:39:16 +02:00
Eren Gölge
aa29f5b199
aligntts loss
2021-03-30 14:39:16 +02:00
Eren Gölge
a831468cab
align tts MDN layer
2021-03-30 14:39:16 +02:00
Eren Gölge
4396f8e2da
continue refactoring
2021-03-30 14:39:16 +02:00
Eren Gölge
892c3c3623
use torch for AngleProtoLoss
2021-03-30 14:39:16 +02:00
Eren Gölge
2b3e12ea49
correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting
2021-03-30 14:39:16 +02:00
Eren Gölge
ecb6b0d6ad
rename GlowTtts as GlowTTS
2021-03-30 14:39:16 +02:00
Eren Gölge
e8cf8cb00e
restructure TF tacotron files
2021-03-30 14:39:16 +02:00
Eren Gölge
1ac99ce0d0
if git is not available set git has 'unknown'
2021-03-30 14:39:16 +02:00
Eren Gölge
d9c405f0c3
create feedforward folder for SS layers
2021-03-30 14:39:16 +02:00
Eren Gölge
a8cf1ae6b4
fix wavenet running with no input mask
2021-03-30 14:39:16 +02:00
Eren Gölge
1c1949d348
utf-8 encoding for certain preprocessors
2021-03-30 14:39:16 +02:00
Eren Gölge
ca2f22cdd7
linter fix
2021-03-30 14:36:12 +02:00
Eren Gölge
d0dcd7d1b8
let the user define outpu.wav file path fix #393
2021-03-30 14:24:31 +02:00
Eren Gölge
25654233d5
[ci skip]initial commit for the new DE models and stale ot update
2021-03-29 03:23:57 +02:00
Guy Elsmore-Paddock
15459627cc
Fix `UnicodeEncodeError` on Windows Platforms
...
Prevents the following error from appearing when running training on Windows platforms:
```
UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined>
```
2021-03-20 17:30:00 -04:00
Eren Gölge
3947750dd9
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-03-18 14:09:47 +01:00
WeberJulian
4a9d2e4309
fix french_cleaners
2021-03-18 13:35:29 +01:00
WeberJulian
596ea2c98a
Add resample script
2021-03-18 13:33:37 +01:00
Eren Gölge
6e68637f48
bug fix
2021-03-18 13:33:23 +01:00
Eren Gölge
f3e5ddfaaf
bug fix in preprocessor
2021-03-18 13:33:23 +01:00
Eren Gölge
aeb4f82233
bug fix
2021-03-18 13:33:23 +01:00
Eren Gölge
0514330869
fix mozilla/TTS#685
2021-03-18 13:33:23 +01:00
Eren Gölge
f06603a0db
force utf8
2021-03-18 13:33:23 +01:00
Eren Gölge
32e8b56c45
linter fix
2021-03-18 13:33:23 +01:00
Eren Gölge
65533f33e9
fix #374
2021-03-18 13:33:00 +01:00
Eren Gölge
d790d2fccb
linter fix
2021-03-18 13:33:00 +01:00
WeberJulian
af96080e17
fix linter issues
2021-03-18 13:33:00 +01:00
WeberJulian
bf04383e74
fix french_cleaners
2021-03-18 13:33:00 +01:00
WeberJulian
f6cd8e0ecc
test case
2021-03-18 13:33:00 +01:00
WeberJulian
e954e45e57
linter + test
2021-03-18 13:33:00 +01:00
WeberJulian
e598977f3d
Using path.join instead of concat
2021-03-18 13:33:00 +01:00
WeberJulian
c5ef2de73f
Add resample script
2021-03-18 13:33:00 +01:00
Eren Gölge
2690ab2ee5
bug fix
2021-03-16 19:15:28 +01:00
Eren Gölge
4c1aed4a9c
bug fix in preprocessor
2021-03-16 19:13:32 +01:00
Eren Gölge
01e35e06c4
bug fix
2021-03-16 19:13:32 +01:00
Eren Gölge
aa8bb815a7
fix mozilla/TTS#685
2021-03-16 19:13:32 +01:00
Eren Gölge
a8c348ffb2
force utf8
2021-03-16 19:13:32 +01:00
Eren Gölge
bf0caba0bc
linter fix
2021-03-16 19:13:32 +01:00
Eren Gölge
babc94f63f
fix #374
2021-03-16 19:13:32 +01:00
Eren Gölge
bdfd1f8a89
linter fix
2021-03-16 19:13:32 +01:00
WeberJulian
11e25a7125
fix linter issues
2021-03-16 19:13:01 +01:00
WeberJulian
1574d8dd39
fix french_cleaners
2021-03-16 19:13:01 +01:00
WeberJulian
b94373afb8
test case
2021-03-16 19:13:01 +01:00
WeberJulian
93fdc0729c
linter + test
2021-03-16 19:13:01 +01:00
WeberJulian
17f197f51e
Using path.join instead of concat
2021-03-16 19:13:01 +01:00
WeberJulian
d6749f030f
Add resample script
2021-03-16 19:13:01 +01:00
Eren Gölge
838ebd6ad5
add the missing russian model
2021-03-16 18:38:35 +01:00
Eren Gölge
5c657715f2
fix #382
2021-03-16 17:31:48 +01:00
Eren Gölge
38a29ce1c9
move all models to github rls
2021-03-10 18:19:32 +01:00
Eren Gölge
e5bb317242
fix model manager
2021-03-10 17:01:19 +01:00
Eren Gölge
d260fb03a2
fix handling scale_stats.npy for models downloaded from Github rls
2021-03-10 16:40:30 +01:00
Eren Gölge
4aba4e5b1e
linter fx
2021-03-10 15:33:11 +01:00
Eren Gölge
6c932c8503
print the desc if required parameters are not provided
2021-03-10 15:19:00 +01:00
Eren Gölge
9e84c8a623
do not copy scale_stats if exist in the output folder
2021-03-10 15:13:55 +01:00
Eren Gölge
7782034e7e
fix #369
2021-03-10 15:13:21 +01:00
Eren Gölge
4337e9ff87
pad_mode in torch_stft
2021-03-10 14:41:00 +01:00
Eren Gölge
599149a7e5
downloading models from github releases
2021-03-10 11:09:01 +01:00
Eren Gölge
fc19411ac6
update some of the models to github releases
2021-03-10 11:08:15 +01:00
Eren Gölge
19bb9ba851
fix tts endpoint using list-models argument
2021-03-09 14:06:09 +01:00
Eren Gölge
43379eecef
fix the nl model and add the vocoder
2021-03-09 14:05:56 +01:00
r-dh
8a4dcd152f
Add Dutch model
2021-03-09 13:22:19 +01:00
Eren Gölge
94805236fb
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-03-08 15:21:06 +01:00
Eren Gölge
5dcc4be560
rebrand demo server
2021-03-08 14:51:04 +01:00
Eren Gölge
947e3d6a93
rename test
2021-03-08 14:50:54 +01:00
Eren Gölge
a519ed52f2
deprecate embedding models to the wheel
2021-03-08 14:06:15 +01:00
Eren Gölge
c16ad38930
update server rEADME
2021-03-08 14:05:59 +01:00
Eren Gölge
594d8d8f09
linter fixes
2021-03-08 11:22:59 +01:00
Eren Gölge
00b5090974
linter fix
2021-03-08 11:05:30 +01:00
Eren Gölge
e15734c3fc
linter fix
2021-03-08 05:29:43 +01:00
Eren Gölge
9a48ba3821
a ton of linter updates
2021-03-08 05:06:54 +01:00
Eren Gölge
e03a426378
bug fix
2021-03-08 02:59:48 +01:00
kirianguiller
628afe5cb0
remove gst handling in synthetizer.py class
2021-03-08 02:59:48 +01:00
kirianguiller
557239db7f
remove re.Match typing in '_number_replace()'
2021-03-08 02:59:48 +01:00
kirianguiller
9ab07f94e2
modify according to PR reviews
2021-03-08 02:59:48 +01:00
kirianguiller
42ba30eb8f
<add> Chinese mandarin implementation (tacotron2)
2021-03-08 02:59:24 +01:00
kirianguiller
49665783a6
remove gst handling in synthetizer.py class
2021-03-08 02:57:11 +01:00
kirianguiller
e85658ac2b
remove re.Match typing in '_number_replace()'
2021-03-08 02:57:11 +01:00
kirianguiller
0d4525322c
modify according to PR reviews
2021-03-08 02:57:11 +01:00
kirianguiller
e6fd118cf8
<add> Chinese mandarin implementation (tacotron2)
2021-03-08 02:57:11 +01:00
Eren Gölge
e3102e753c
enable backward compat for loading the best model
2021-03-08 02:57:11 +01:00
gerazov
2451a813a2
refactored keep_all_best
2021-03-08 02:57:11 +01:00
gerazov
8cefa76bae
reformated docstrings in arguments.py
2021-03-08 02:57:11 +01:00
gerazov
2db40457e8
brushed up printing model load path and best loss path
2021-03-08 02:56:36 +01:00
gerazov
f2e474cd37
loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added
2021-03-08 02:56:36 +01:00
Eren Gölge
4111df6769
Docstrings for audioprocessor
2021-03-08 02:54:47 +01:00
Eren Gölge
2ca74b8ab3
add RUSLAN dataset preprocessor
2021-03-08 02:54:47 +01:00
Eren Gölge
8993120634
do not test server and modelManager until fixing #657
2021-03-08 02:54:47 +01:00
Adonis Pujols
89b7f01534
add encoding="utf-8"
2021-03-08 02:54:47 +01:00
Eren Gölge
ffceccb021
fix #655
2021-03-08 02:54:47 +01:00
Eren Gölge
534c341f16
linter update
2021-03-08 02:54:47 +01:00
Eren Gölge
0e1e60bef0
remove redundancy
2021-03-08 02:54:47 +01:00
Eren Gölge
93a83c0068
Update TTS/utils/arguments.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-03-08 02:54:47 +01:00
Eren Gölge
39fbf2fe84
Update TTS/bin/find_unique_chars.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-03-08 02:54:47 +01:00
Eren Gölge
ee71eb4eb7
linter fixes
2021-03-08 02:54:47 +01:00
Eren Gölge
55fc50b26d
update test_text_processing for espeak-ng
2021-03-08 02:54:47 +01:00
Eren Gölge
5b8a6736a7
remove _phoneme_punctuations
2021-03-08 02:54:47 +01:00
Eren Gölge
194f82de51
save default model chars to the training config file
2021-03-08 02:54:47 +01:00
Eren Gölge
62a8eba3b2
parse_characters function
2021-03-08 02:54:47 +01:00
Eren Gölge
0b33acdcca
enable saving model characters in io.py
2021-03-08 02:54:47 +01:00
Eren Gölge
f9fe167537
docstring update
2021-03-08 02:54:47 +01:00
Eren Gölge
62aeacbdd1
save used model characters to the checkpoints
2021-03-08 02:54:47 +01:00
Eren Gölge
e06c93fe81
model_manager tests
2021-03-08 02:54:47 +01:00
Eren Gölge
fe41084eb3
author , license and contact info in .model.json
2021-03-08 02:54:47 +01:00
nmstoker
ae0d54ddae
Updating models list to include EK1 TTS/vocoder
2021-03-08 02:54:47 +01:00
Eren Gölge
c6702b5b9f
find unique characters in a dataset
2021-03-08 02:54:47 +01:00
Eren Gölge
dad3565379
use default vocoders in server.pu
2021-03-08 02:54:47 +01:00
Eren Gölge
d30608ab17
set an output_sample_rate in synthesizer and use it for writing the wav
...
file
2021-03-08 02:54:47 +01:00
Eren Gölge
3ccb015cd8
return the json entry of the downloaded model
2021-03-08 02:54:47 +01:00
Eren Gölge
00e0933f43
save_wav with a custom sampling rate
2021-03-08 02:54:47 +01:00
Eren Gölge
9fefc79f0c
fix make_symbols
2021-03-08 02:54:47 +01:00
Eren Gölge
8955333e9d
use default vocoder in synthesize.py
2021-03-08 02:54:47 +01:00
Eren Gölge
23b282f600
define default vocoders
2021-03-08 02:54:47 +01:00
Eren Gölge
6bd8485d10
bug fix
2021-03-08 02:54:47 +01:00
Eren Gölge
5f1018abee
fix spelling of a def argument and parse phonemes from config.json if
...
use_phonemes is True
2021-03-08 02:54:47 +01:00
Eren Gölge
1c1abb8a9b
docstring update
2021-03-08 02:54:47 +01:00
Eren Gölge
6cd642c2e1
add missing phonemes to test_config.json
2021-03-08 02:54:47 +01:00
Eren Gölge
43b951018e
fix the default vocoder name
2021-03-08 02:54:47 +01:00
Adonis Pujols
81b145c321
spelling error. should be multiband not mulitband
2021-03-08 02:54:47 +01:00
Adonis Pujols
59b1b13e07
spelling error. should be multiband not mulitband
2021-03-08 02:54:47 +01:00
Eren Gölge
ee58ff2d38
add russian phoneme char
2021-03-08 02:54:47 +01:00
Eren Gölge
29d928d531
css10 dataset preprocessor
2021-03-08 02:54:47 +01:00
Eren Gölge
49771f2541
download github model releases by model manager
2021-03-08 02:54:21 +01:00
Eren Gölge
3c961370e7
linter fixes
2021-03-08 02:54:21 +01:00
gerazov
2b5cb24db7
final final fixes
2021-03-08 02:54:21 +01:00
gerazov
b3c5cc2cdc
final fixes
2021-03-08 02:54:21 +01:00
gerazov
10d5a63d49
updated to current dev
2021-03-08 02:54:21 +01:00
gerazov
6f06e31541
changed train scripts
2021-03-08 02:54:21 +01:00
gerazov
2daca15802
restructured arg parsing and processing to utils
2021-03-08 02:54:21 +01:00
Eren Gölge
2fbe4a1b8a
fix gdown
2021-03-08 02:54:21 +01:00
Branislav Gerazov
ed56944c4a
improve robustness of defining wavernn in config file
2021-03-08 02:54:21 +01:00
Branislav Gerazov
5e2bc8c99f
update wavernn test config, delete cap=True
2021-03-08 02:54:21 +01:00
Branislav Gerazov
b1e3160884
waveRNN fix
2021-03-08 02:54:21 +01:00
Eren Gölge
08581deb61
linter updates
2021-03-08 02:53:02 +01:00
Thorsten Mueller
167901813d
Ups. Added missing ,
2021-03-08 02:53:02 +01:00
Eren Gölge
93a6bdfd6c
linter fixes and version updates for deps
2021-03-08 02:51:10 +01:00
Eren Gölge
a30a231566
unpin cython version and commentout pyworld in audio.py causing dep
...
issues
2021-03-08 02:50:15 +01:00
Thorsten Mueller
3eb00e8d93
Set out_path to be required param.
2021-03-08 02:49:15 +01:00
Alexander Korolev
ace430d5e6
fix device mismatch wavegrad training
...
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-03-08 02:49:15 +01:00
Eren Gölge
83143fbe39
fix #638
2021-03-08 02:48:31 +01:00
Eren Gölge
30c3bef3f9
move hubconf
2021-03-08 02:48:31 +01:00
Eren Gölge
bbea6a0884
hubconf.py and load .models.json from the defualt location by mange.py
2021-03-08 02:48:31 +01:00
Eren Gölge
90d4f08d6c
reorder imports
2021-03-08 02:48:31 +01:00
Eren Gölge
db231c83fc
distill import statement, check python version in setup.py
2021-03-08 02:48:31 +01:00
Thorsten Mueller
915ec1faac
Added info if model already downloaded in --list_models
2021-03-08 02:48:31 +01:00
Alexander Korolev
b4bc5f6eb1
update fixed stopnet_pos_weight parameter
...
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-03-08 02:48:31 +01:00
Eren Gölge
534e3c67c6
README update, set default models for synthesize.py and server.py. Disable verbose for ap init.
2021-03-08 02:48:31 +01:00
kirianguiller
7f36d91131
update chinese model
2021-03-01 14:55:05 +01:00
Eren Gölge
547bfc4ce9
bug fix
2021-02-18 18:24:03 +00:00
Eren Gölge
adaeec57ec
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-18 17:21:09 +00:00
Eren Gölge
5b70c8ba4f
enable backward compat for loading the best model
2021-02-18 17:20:36 +00:00
Eren Gölge
e4f81d6856
Merge pull request #654 from kirianguiller/chinese-implementation
...
Chinese implementation (merge into dev)
2021-02-18 17:15:32 +01:00
kirianguiller
22a6bbfa80
remove gst handling in synthetizer.py class
2021-02-17 20:53:56 +01:00
kirianguiller
3911b87e54
remove re.Match typing in '_number_replace()'
2021-02-17 20:53:56 +01:00
kirianguiller
fb0655d1e7
modify according to PR reviews
2021-02-17 20:53:56 +01:00
kirianguiller
c4c7bc1b88
<add> Chinese mandarin implementation (tacotron2)
2021-02-17 20:53:56 +01:00
Eren Gölge
d0454461de
Merge branch 'pr/gerazov/650-2' into dev
2021-02-17 13:40:45 +00:00
Eren Gölge
a8ea0ea6ce
Docstrings for audioprocessor
2021-02-17 13:35:41 +00:00
Eren Gölge
f6e6314910
add RUSLAN dataset preprocessor
2021-02-17 13:35:23 +00:00
Eren Gölge
ce0c5eccbd
do not test server and modelManager until fixing #657
2021-02-17 00:35:43 +00:00
gerazov
61c88beb94
refactored keep_all_best
2021-02-15 18:40:17 +01:00
Eren Gölge
eb543c027e
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-15 17:06:40 +00:00
Eren Gölge
8a106e0527
fix #655
2021-02-15 17:06:03 +00:00
Eren Gölge
216945e653
Merge pull request #647 from adonispujols/patch-1
...
Easy Fix for #454 (which was somehow deleted?)
2021-02-15 13:17:17 +01:00
Eren Gölge
06a3ba2fe2
linter update
2021-02-15 12:10:19 +00:00
Eren Gölge
7f58fa365b
Merge branch 'save_characters' into dev
2021-02-15 12:07:28 +00:00
Eren Gölge
ff218e2370
remove redundancy
2021-02-15 12:07:02 +00:00
Eren Gölge
80af8ca5e1
Update TTS/utils/arguments.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:03:59 +01:00
Eren Gölge
3b6ce04332
Update TTS/bin/find_unique_chars.py
...
Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>
2021-02-15 13:02:29 +01:00
Eren Gölge
dc3596dad4
model_manager tests
2021-02-15 11:29:22 +00:00
Eren Gölge
77e630348e
author , license and contact info in .model.json
2021-02-15 11:02:21 +00:00
Eren Gölge
e1bc823e44
Merge branch 'pr/nmstoker/652' into dev
2021-02-15 10:57:12 +00:00
nmstoker
33bcdc6ff8
Updating models list to include EK1 TTS/vocoder
2021-02-14 23:44:05 +00:00
Eren Gölge
420901f4c2
linter fixes
2021-02-12 14:41:17 +00:00
Eren Gölge
4244096ccb
update test_text_processing for espeak-ng
2021-02-12 14:07:26 +00:00
Eren Gölge
b28c724c04
remove _phoneme_punctuations
2021-02-12 12:10:57 +00:00
Eren Gölge
7ab527d17e
save default model chars to the training config file
2021-02-12 12:06:46 +00:00
Eren Gölge
593cedee14
parse_characters function
2021-02-12 12:05:56 +00:00
Eren Gölge
2abfff17f9
enable saving model characters in io.py
2021-02-12 12:04:41 +00:00
Eren Gölge
918f007a11
docstring update
2021-02-12 12:04:07 +00:00
Eren Gölge
e774f68aee
save used model characters to the checkpoints
2021-02-12 12:03:42 +00:00
gerazov
0e78e31dbf
reformated docstrings in arguments.py
2021-02-12 11:36:01 +01:00
gerazov
310d18325e
brushed up printing model load path and best loss path
2021-02-12 10:55:45 +01:00
Eren Gölge
8b6fd76ad2
find unique characters in a dataset
2021-02-12 09:46:11 +00:00
gerazov
af46727517
loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added
2021-02-12 02:12:00 +01:00
Eren Gölge
a1e595790d
use default vocoders in server.pu
2021-02-11 15:31:39 +00:00
Eren Gölge
8aa6a0decb
set an output_sample_rate in synthesizer and use it for writing the wav
...
file
2021-02-11 15:28:07 +00:00
Eren Gölge
0c52d27d65
return the json entry of the downloaded model
2021-02-11 15:27:41 +00:00
Eren Gölge
1649ad3431
save_wav with a custom sampling rate
2021-02-11 15:27:20 +00:00
Eren Gölge
43f54d2dce
fix make_symbols
2021-02-11 15:26:52 +00:00
Eren Gölge
0657b38111
use default vocoder in synthesize.py
2021-02-11 15:26:17 +00:00
Eren Gölge
2043a9b5f5
define default vocoders
2021-02-11 15:25:55 +00:00
Eren Gölge
ff27690ca7
bug fix
2021-02-11 13:43:29 +00:00
Eren Gölge
bc131208be
fix spelling of a def argument and parse phonemes from config.json if
...
use_phonemes is True
2021-02-11 13:04:47 +00:00
Eren Gölge
f1799dbd60
docstring update
2021-02-11 11:25:31 +00:00
Eren Gölge
3baec4ea96
add missing phonemes to test_config.json
2021-02-11 11:14:39 +00:00
Eren Gölge
a3d1e65b34
Merge branch 'pr/adonispujols/646' into dev
2021-02-11 10:37:29 +00:00
Eren Gölge
3c2e13ca5c
fix the default vocoder name
2021-02-11 10:36:52 +00:00
Adonis Pujols
48011a8b58
add encoding="utf-8"
2021-02-11 05:26:06 -05:00
Adonis Pujols
b29a7e9645
spelling error. should be multiband not mulitband
2021-02-11 04:49:28 -05:00
Adonis Pujols
6c824a6629
spelling error. should be multiband not mulitband
2021-02-11 04:48:53 -05:00
Eren Gölge
b08b8ca2a1
add russian phoneme char
2021-02-10 13:30:59 +00:00
Eren Gölge
9cad435288
css10 dataset preprocessor
2021-02-09 15:11:26 +00:00
Eren Gölge
cea5e517f2
download github model releases by model manager
2021-02-09 14:24:14 +00:00
Eren Gölge
c619859a3f
linter fixes
2021-02-09 11:43:17 +00:00
gerazov
e507373b55
final final fixes
2021-02-06 23:08:47 +01:00
gerazov
ad17dc9e76
final fixes
2021-02-06 23:05:01 +01:00
gerazov
8fdd08ea15
updated to current dev
2021-02-06 22:59:52 +01:00
gerazov
2705d27b28
changed train scripts
2021-02-06 22:29:30 +01:00
gerazov
4f8f274d6e
restructured arg parsing and processing to utils
2021-02-06 22:25:56 +01:00
Eren Gölge
e7e880f514
fix gdown
2021-02-05 13:42:24 +00:00
Eren Gölge
f4f6290eec
Merge branch 'pr/gerazov/641' into dev
2021-02-05 13:14:49 +00:00
Eren Gölge
d49757faaa
linter updates
2021-02-05 13:10:43 +00:00
Branislav Gerazov
f063545325
improve robustness of defining wavernn in config file
2021-02-05 13:26:33 +01:00
Branislav Gerazov
24ffa9e9f6
update wavernn test config, delete cap=True
2021-02-05 13:10:02 +01:00
Branislav Gerazov
cb77aef36c
waveRNN fix
2021-02-04 09:52:03 +01:00
Thorsten Mueller
d74866cb8e
Merge remote-tracking branch 'upstream/dev' into dev
...
Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637
2021-02-02 19:40:18 +01:00
Thorsten Mueller
a82152eef3
Ups. Added missing ,
2021-02-02 19:29:16 +01:00
Thorsten Mueller
4cb4fcf02c
Set out_path to be required param.
2021-02-02 19:29:16 +01:00
Thorsten Mueller
c75ea74914
Added info if model already downloaded in --list_models
2021-02-02 19:29:16 +01:00
Eren Gölge
2edab4b3f9
disable pw in audio that causes numpy issue
2021-02-01 17:05:03 +00:00
Eren Gölge
5c46543765
linter fixes and version updates for deps
2021-02-01 13:18:56 +00:00
Eren Gölge
8774e37444
unpin cython version and commentout pyworld in audio.py causing dep
...
issues
2021-02-01 11:34:05 +00:00
Eren Gölge
5beed0ddcd
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-02-01 11:27:14 +00:00
Eren Gölge
c7407571fa
fix #638
2021-02-01 10:05:55 +00:00
Eren Gölge
dfdac1def9
Merge pull request #636 from thorstenMueller/dev
...
Set out_path to be required param in compute_statistics.py.
2021-01-29 18:08:31 +01:00
Thorsten Mueller
44c4a49745
Set out_path to be required param.
2021-01-29 17:23:38 +01:00
Eren Gölge
536366dc0a
Merge pull request #635 from SanjaESC/patch-1
...
fix device mismatch wavegrad training
2021-01-29 16:42:25 +01:00
Eren Gölge
0354b6f35e
move hubconf
2021-01-29 15:28:32 +00:00
Eren Gölge
aa5f24608a
hubconf.py and load .models.json from the defualt location by mange.py
2021-01-29 15:28:26 +00:00
Alexander Korolev
e81ebec7a8
fix device mismatch wavegrad training
...
this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916
2021-01-29 15:18:59 +01:00
Eren Gölge
a926aa106d
reorder imports
2021-01-29 01:36:21 +01:00
Eren Gölge
8a6eee7fec
distill import statement, check python version in setup.py
2021-01-28 17:04:08 +01:00
Eren Gölge
131a163c95
Merge pull request #628 from thorstenMueller/dev
...
Added info if model already downloaded in --list_models
2021-01-28 13:10:06 +01:00
Alexander Korolev
ca28e05ed7
update fixed stopnet_pos_weight parameter
...
config parameter c.stopnet_pos_weight has currently no effect as it is not used.
2021-01-27 16:33:25 +01:00
Thorsten Mueller
ccbd542eb0
Added info if model already downloaded in --list_models
2021-01-27 16:19:02 +01:00
Eren Gölge
25c86ca715
README update, set default models for synthesize.py and server.py. Disable verbose for ap init.
2021-01-27 11:47:03 +01:00
Eren Gölge
4f32e77006
platform indep. way to fetch user data folder
2021-01-26 17:32:43 +01:00
Eren Gölge
0117c811a9
add a button to index.html to see the model details
2021-01-26 12:33:27 +01:00
Eren Gölge
a3adcaccdb
Merge branch 'pr/thorstenMueller/623' into dev
2021-01-26 12:19:39 +01:00
Eren Gölge
b464cab9b8
setup.py update and pylint fixes
2021-01-26 02:57:50 +01:00
Eren Gölge
660d61aeeb
maximum_path_numpy and CYTHON adabtable import
2021-01-26 02:57:07 +01:00
Eren Gölge
877f0bbfba
manifest.in update
2021-01-26 02:56:55 +01:00
Eren Gölge
82e029529e
fix manifest file
2021-01-25 13:27:54 +01:00
Eren Gölge
57b668fd86
fixing dome pypi issues
2021-01-25 13:06:12 +01:00
Eren Gölge
60c1bb93d9
fixes before first PyPI release
2021-01-25 11:16:20 +01:00
Thorsten Mueller
afb7db2a1d
Removed unneeded check and removed specific taco2 model name.
2021-01-22 16:22:50 +01:00
Eren Gölge
fae10309e4
Merge pull request #624 from SanjaESC/patch-3
...
Update train_tacotron.py
2021-01-22 13:29:09 +01:00
Eren Gölge
5ee73c2bae
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-01-22 13:26:27 +01:00
Eren Gölge
5fb611ef40
static image for server index.html
2021-01-22 03:01:53 +01:00
Eren Gölge
ca647cf222
Model Manager to download released models
2021-01-22 02:35:43 +01:00
Eren Gölge
ca8ad9c21e
rename audio._normalize to audio.normalize
2021-01-22 02:33:19 +01:00
Eren Gölge
c990b3a59c
linter fixes and test fixes
2021-01-22 02:32:35 +01:00
Alexander Korolev
f251dc8c0e
Update train_tacotron.py
...
When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training.
By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.
2021-01-21 21:16:30 +01:00
Eren Gölge
0ab2eb2664
use synthesizer in both synthesize.py and server.pu
2021-01-21 15:54:33 +01:00
Eren Gölge
9addfabc43
wavernn load_checkpoint function
2021-01-21 15:31:13 +01:00
Eren Gölge
50fee59a2c
update synthesizer.py for better interfacing to different models
2021-01-21 15:30:49 +01:00
Eren Gölge
007a4d7139
remove 3rd paty wavernn support from server.py and add ModelManager arguments
2021-01-21 15:30:16 +01:00
Eren Gölge
6b6e989fd2
update server readme
2021-01-21 15:29:46 +01:00
Thorsten Mueller
e414582be6
Added option for server ui details page.
2021-01-20 21:56:40 +01:00
root
1bc8fbbd3c
set eval mode whe nloading models
2021-01-20 02:14:18 +00:00
root
5bd7238153
interpolate spectrogram in vocoder generic utils for matching sample
...
rates
2021-01-20 02:13:01 +00:00
root
ca3743539a
load_checkpoint func for vocoder models
2021-01-20 02:12:29 +00:00
root
ea39715305
read_json_with_comments
2021-01-20 02:11:55 +00:00
root
563bc921d8
optional verbose for audio.py init
2021-01-20 02:11:24 +00:00
root
1faf565e3a
add load_checkpoint func to tts models
2021-01-20 02:10:56 +00:00
root
5c87753e88
glow-tts fix for saving inverse weight
2021-01-20 02:09:42 +00:00
root
3d30dae8f3
.models.json and synthesize.py update for interfacing with model manager
2021-01-20 02:08:58 +00:00
gerazov
b2b4828f17
set requires_grad=False
2021-01-16 19:46:04 +01:00
gerazov
c96f7a2614
TorchSTFT to device fix
2021-01-16 12:21:16 +01:00
root
7beaacc55b
update compute_attention_masks.py
2021-01-13 10:03:57 +00:00
erogol
428c224b88
commet update
2021-01-12 17:31:04 +01:00
erogol
bbc8d665a1
move attention layers to a sperate file
2021-01-11 17:27:30 +01:00
erogol
79c841ccd3
mass refactoring and update
2021-01-11 17:26:58 +01:00
erogol
1d961d6f8a
cladd renaming
2021-01-11 17:26:11 +01:00
erogol
c0a2aa68d3
formatting
2021-01-11 17:25:39 +01:00
erogol
b206162d11
more docstrings
2021-01-11 17:25:04 +01:00
erogol
6e9043c5d2
rename convbnblocks and handle none mask
2021-01-11 17:22:34 +01:00
erogol
921fa5db92
remove attentions from common layers
2021-01-11 15:06:42 +01:00
erogol
cc2b1e043d
docstrings for common layers
2021-01-11 15:06:12 +01:00
erogol
a6f40fef2e
stage missing files
2021-01-08 16:02:56 +01:00
erogol
d382d759b3
small fixes and test fixes
2021-01-08 15:48:40 +01:00
erogol
a6259041d3
docstring for speedyspeech
2021-01-07 14:35:22 +01:00
erogol
de2a542f83
glow-tts bug fix
2021-01-07 13:40:32 +01:00
erogol
14d33662ea
input shapes for tacotron models
2021-01-06 13:19:40 +01:00
erogol
f288e9a260
docstrings for taoctron models
2021-01-06 13:19:40 +01:00
erogol
5a45af48f1
fix
2021-01-06 13:19:40 +01:00
erogol
e7fad928e7
doc strings for the all glow-tts layers
2021-01-06 13:19:40 +01:00
erogol
d3b7284be4
glow-tts comments and refactoring
2021-01-06 13:19:40 +01:00
erogol
7586fbc4de
SS refactoring
2021-01-06 13:19:40 +01:00
erogol
e82d31b6ac
glow ttss refactoring
2021-01-06 13:19:40 +01:00
erogol
29f4329d7f
update glow-tts layers and add some comments
2021-01-06 13:19:40 +01:00
erogol
29cf933831
update SS condif
2021-01-06 13:19:40 +01:00
erogol
228ada04b5
update glow-tts ljspeech config
2021-01-06 13:19:40 +01:00
erogol
f352b3534c
make noise augmentation optional
2021-01-06 13:19:40 +01:00
erogol
71c382be14
copy model scale stats file with config.json to the trianing folder, fixed for model inits
2021-01-06 13:19:40 +01:00
erogol
aa40fe1aa0
SS model refacotring for multi speaker
2021-01-06 13:19:40 +01:00
erogol
eb555855e4
small fixes
2021-01-06 13:19:40 +01:00
erogol
5901a00576
argument rename
2021-01-06 13:19:40 +01:00
erogol
4ef083f0f1
select decoder type for SS
2021-01-06 13:19:40 +01:00
erogol
d5a0190c4b
update copy_config_file to copy_model_files
2021-01-06 13:19:40 +01:00
erogol
8971c59b2d
plot eval alignment score right
2021-01-06 13:19:40 +01:00
erogol
3fa408a5ea
change order BN + ReLU to ReLU + BN for SS
2021-01-06 13:19:40 +01:00
erogol
ac5c9217d1
positional encoding masking for SS
2021-01-06 13:19:40 +01:00
erogol
fede46e96e
pylint and test fixes
2021-01-06 13:19:40 +01:00
erogol
2abe3df153
compute_attention_masks.py
2021-01-06 13:19:40 +01:00
erogol
cf869e8922
add SS files
2021-01-06 13:19:40 +01:00
erogol
e4680e1b99
plot float16 alignments
2021-01-06 13:19:40 +01:00
erogol
13c6665c92
inference for SS
2021-01-06 13:19:40 +01:00
erogol
30788960a8
check SS model parameters
2021-01-06 13:19:40 +01:00
erogol
5cae2c5742
make optional position encoding for speedyspeech
2021-01-06 13:19:40 +01:00
erogol
dc4a16d62e
speedy speehc losses
2021-01-06 13:19:40 +01:00
erogol
d62cac7252
fix glow-tts prenet bug fix
2021-01-06 13:19:40 +01:00
erogol
a1d5a9ddda
config update tyo use noise for augmentation
2021-01-06 13:19:40 +01:00
erogol
022af74d74
update prompt msg
2021-01-06 13:19:40 +01:00
erogol
57ef53bef3
update argumnet check for non tacotron models
2021-01-06 13:19:40 +01:00
erogol
27a75de15f
update processors for loading attention maps
2021-01-06 13:19:40 +01:00
erogol
fa6907fa0e
update glow-tts parameters and fix rel-attn-win size
2021-01-06 13:19:40 +01:00
erogol
7b20d8cbd3
implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic
2021-01-06 13:19:40 +01:00
erogol
973754d893
fix for init glow-tts
2021-01-06 13:19:40 +01:00
erogol
f81af4eb0d
config update disable guided attention for dynamic conv attention
2021-01-06 13:19:40 +01:00
erogol
29b17c0808
bug fix for gradual training
2021-01-06 13:19:40 +01:00
erogol
5c50e104d6
config update
2021-01-06 13:19:40 +01:00
erogol
6478d552dc
tacotron training bug fix
2021-01-06 13:19:40 +01:00
erogol
1dd086577a
tacotron training bug fix
2021-01-06 13:18:41 +01:00
erogol
fa20638083
config for ljspeech dynamic conv attention
2021-01-06 13:18:41 +01:00
erogol
070146e143
add monotonic dynamic convolution attention
2021-01-06 13:18:41 +01:00
erogol
18392bc13a
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2021-01-06 13:18:08 +01:00
Thorsten Mueller
f673f8f74d
Added support for npy output from tune-wavegrad
2020-12-19 22:51:22 +01:00
Thorsten Mueller
2aa0354b44
Fix for 'NoneType' object has no attribute 'to'
2020-12-19 22:37:03 +01:00
Thorsten Mueller
28a64221ea
Improve robostness on cpu / gpu model mix
2020-12-19 22:23:28 +01:00
erogol
8293751a38
remove mozilla from server page
2020-12-17 12:28:28 +01:00
erogol
639fa29261
update speaker id casting for glow-tts
2020-12-14 16:58:47 +01:00
erogol
999120ecdf
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:50:14 +01:00
erogol
f611e6ac01
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-12 18:47:59 +01:00
Jörg Thalheim
62fd4ca70d
inflect negative numbers correctly
2020-12-10 16:47:51 +01:00
Jörg Thalheim
6646682650
cleaners: expand english time
2020-12-10 14:53:20 +01:00
Jörg Thalheim
76138687d3
expand more currencies
2020-12-10 14:53:20 +01:00
erogol
a2859b7ddc
update config args checks
2020-12-10 13:52:57 +01:00
erogol
788cd6f902
fix multi-speaker glow-tts inference
2020-12-10 02:05:48 +01:00
erogol
3d5066e2b8
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-10 00:31:03 +01:00
erogol
92cc9630d7
fix glow-tts synthesis for DPP
2020-12-10 00:30:34 +01:00
Eren Gölge
2473b2dc62
Merge pull request #559 from krzim/patch-1
...
Fix import to grab the encoder model save function
2020-12-10 00:19:32 +01:00
erogol
53679b706d
glow-tts distributed fix
2020-12-09 23:39:09 +01:00
erogol
62bc171db5
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-09 15:46:57 +01:00
erogol
df180148e9
use noise augmentation in TTSDataset
2020-12-09 15:46:25 +01:00
Thorsten Mueller
e39628ce2f
Limit filenames to 10 chars
2020-12-08 18:44:19 +01:00
erogol
06612ce305
test fixes
2020-12-07 15:57:34 +01:00
erogol
0252a07fa6
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-12-07 11:31:55 +01:00
erogol
482e725752
sync torch calls before logging training results
2020-12-07 11:30:19 +01:00
erogol
7505c0ba27
muliprocess phoneme computation
2020-12-07 11:29:41 +01:00
erogol
20c86489d7
make static methods for faster multiprocess call
2020-12-07 11:29:10 +01:00
erogol
affe1c1138
setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length.
2020-12-07 11:26:57 +01:00
Alexander Korolev
f42ca2b73f
Update wavegrad.py
...
This should fix the issue https://github.com/mozilla/TTS/issues/581
2020-12-04 16:43:39 +01:00
erogol
7c3cdced1a
make speaker_mapping a global variable to prevent reload. Fix glow-tts training
2020-12-01 03:23:25 +01:00
Thorsten Mueller
06a389bc08
Added option for saving raw spectograms
2020-11-27 15:49:55 +01:00
erogol
a757b203bc
fix longer phoneme seqs
2020-11-26 15:05:03 +01:00
erogol
7b0a93d2f8
fix
2020-11-26 11:44:52 +01:00
erogol
0c6f7e4c77
resample audio if flag set true
2020-11-26 11:30:48 +01:00
erogol
f6c96b0ac2
Merge branch 'dev'
2020-11-25 15:29:06 +01:00
erogol
e3b7157146
remove contextlib
2020-11-25 15:22:01 +01:00
erogol
e3eda159d1
wavegrad_dataset update
2020-11-25 14:50:50 +01:00
erogol
a1e4ee18f9
convert float16 to float32 for plotting spectrograms
2020-11-25 14:50:28 +01:00
erogol
7541d2ecaa
return eval split optional
2020-11-25 14:50:09 +01:00
erogol
4b92ac0f92
tune_wavegrad update
2020-11-25 14:49:48 +01:00
erogol
d8c1b5b73d
print max lengths in tacotron training
2020-11-25 14:49:07 +01:00
erogol
1229554c42
use native amp
2020-11-25 14:48:54 +01:00
erogol
8a820930c6
compute_embedding update
2020-11-25 14:46:08 +01:00
erogol
aa2b31a1b0
use 'enabled' argument to control autocast
2020-11-17 14:22:01 +01:00
erogol
d9d04d892b
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-17 14:17:24 +01:00
erogol
8b0e0846a3
temporary travis check
2020-11-17 14:17:03 +01:00
Qingping Hou
b0b97d636f
speed up metafile build for voxceleb
2020-11-14 23:45:17 -08:00
erogol
a2a142dc39
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-14 13:02:19 +01:00
erogol
c65712426a
change noise scheduling for wavegrad. Compute beta values externally to enable better flexibility
2020-11-14 13:01:10 +01:00
erogol
5a59467f34
scaler fix for wavegrad and wavernn. Save and load scaler
2020-11-14 13:00:35 +01:00
erogol
d8511efa8f
use native amp for tacotron training
2020-11-14 12:59:28 +01:00
Qingping Hou
0cc3650ef6
support loading config in yaml
2020-11-14 00:13:53 -08:00
erogol
6cc464ead6
fix ton of tesnting bugs
2020-11-12 16:33:29 +01:00
erogol
25551c4634
change wavernn generate to inference
2020-11-12 12:52:52 +01:00
erogol
9b0f441945
argument for returning no eval split
2020-11-12 12:52:27 +01:00
erogol
a7aefd5c50
use pytorch amp for mixed precision training for Tacotron
2020-11-12 12:51:56 +01:00
erogol
67e2b664e5
compute embeddings and create speakers.json
2020-11-12 12:51:17 +01:00
erogol
f8fd300b3e
bug fix
2020-11-10 12:53:39 +01:00
erogol
016d3503da
compute embeddings with speaker encoder
2020-11-10 12:51:02 +01:00
erogol
21364331d2
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-11-09 13:31:12 +01:00
erogol
c76a617072
linter updates
2020-11-09 13:18:35 +01:00
erogol
ea976b0543
python compat update for contextlib
2020-11-06 13:34:11 +01:00
erogol
c80225544e
tune wavegrad to fine the best noise schedule for inferece
2020-11-06 13:04:46 +01:00
erogol
d94782a076
reset the way ga_loss is stored in return_dict
2020-11-02 13:18:56 +01:00
erogol
a108d0ee81
check nan loss in glow-tts loss
2020-11-02 13:12:19 +01:00
erogol
b8ac9aba9d
check against NaN loss in tacotron_loss
2020-11-02 12:44:41 +01:00
erogol
ef04d7fae7
bug fix for wavernn training
2020-10-30 14:08:41 +01:00
erogol
a44ef58aea
wavegrad weight norm refactoring
2020-10-30 13:23:24 +01:00
erogol
183fe56d95
Merge branch 'ssim_loss' into dev
2020-10-29 23:49:09 +01:00
krzim
2202e171c5
Fix import to grab the encoder model save function
...
I saw that this was recently changed but I'm not sure if it should have been. This is the correct function given the arguments provided to it in the train loop.
2020-10-29 18:03:11 -04:00
erogol
73581cd94c
renaming train scripts and updating tests
2020-10-29 16:50:07 +01:00
erogol
39c71ee8a9
wavegrad refactoring, fixing tests for glow-tts and wavegrad
2020-10-29 15:47:15 +01:00
erogol
946a0c0fb9
bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts
2020-10-29 15:45:50 +01:00
erogol
14c2381207
weight norm and torch based amp training for wavegrad
2020-10-29 12:31:43 +01:00
erogol
b76a0be97a
wavegrad model and layers refactoring
2020-10-29 12:31:43 +01:00
erogol
dc2825dfb2
wavegrad dataset update
2020-10-29 12:31:43 +01:00
erogol
5b5b9fcfdd
wavegrad config updates
2020-10-29 12:31:43 +01:00
erogol
c8a4c771a8
train wavegrad updates
2020-10-29 12:31:43 +01:00
erogol
670f44aa18
enable compute stats by vocoder config
2020-10-29 12:31:43 +01:00
erogol
f79bbbbd00
use Adam for wavegras instead of RAdam
2020-10-29 12:31:43 +01:00
erogol
7bcdb7ac35
wavegrad updates
2020-10-29 12:31:43 +01:00
erogol
a1582a0e12
fix distributed training for train_* scripts
2020-10-29 12:31:43 +01:00
erogol
193b81b273
add universal_fullband_melgan config
2020-10-29 12:30:37 +01:00
erogol
e02cd6a220
initial wavegrad layers model and trainig script
2020-10-29 12:30:37 +01:00
erogol
ac57eea928
add wavegrad to vocoder generators
2020-10-29 12:30:37 +01:00
erogol
e723b99888
handle distributed model as saving
2020-10-29 12:30:37 +01:00
Eren Gölge
26c18b61c9
Merge pull request #553 from Edresson/dev
...
bug fix in the inference with GlowTTS
2020-10-28 18:49:31 +01:00
erogol
fdaed45f58
optional loss masking for stoptoken predictor
2020-10-28 18:40:54 +01:00
erogol
e49cc3bbcd
bug fix
2020-10-28 18:34:34 +01:00
erogol
59e1cf99d0
config update and ssim implementation
2020-10-28 18:30:00 +01:00
erogol
9cef923d99
ssim loss for tacotron models
2020-10-28 15:24:18 +01:00
erogol
9d0ae2bfb4
wavernn dataloader handling for short samples and mixed precision training
2020-10-28 12:31:01 +01:00
Edresson
f01502a9db
bug fix in glowTTS sythesize
2020-10-27 16:30:16 -03:00
Eren Gölge
f4b8170bd1
Merge pull request #545 from Edresson/dev
...
GlowTTS zeroshot TTS support
2020-10-27 15:23:41 +01:00
erogol
a6f564c8c8
pylint fixes
2020-10-27 12:35:10 +01:00
erogol
0becef4b58
small updates
2020-10-27 12:17:38 +01:00
sanjaesc
2ee47e9568
fix pylint once again
2020-10-27 12:17:38 +01:00
sanjaesc
1e646135ca
add model params to config
2020-10-27 12:17:38 +01:00
sanjaesc
bef3f2020b
compute audio feat on dataload
2020-10-27 12:17:38 +01:00
sanjaesc
7c72562fe7
fix travis + pylint tests
2020-10-27 12:17:38 +01:00
sanjaesc
91e5f8b63d
added to device cpu/gpu + formatting
2020-10-27 12:17:38 +01:00
sanjaesc
016a77fcf2
fix formatting + pylint
2020-10-27 12:17:38 +01:00
erogol
8de7c13708
fix no loss masking loss computation
2020-10-27 12:17:38 +01:00
sanjaesc
e8294cb9db
fixing pylint errors
2020-10-27 12:17:38 +01:00
sanjaesc
878b7c373e
added feature preprocessing if not set in config
2020-10-27 12:17:38 +01:00
sanjaesc
e495e03ea1
some minor changes to wavernn
2020-10-27 12:17:38 +01:00
Alex K
9c3c7ce2f8
wavernn stuff...
2020-10-27 12:17:38 +01:00
Alex K
6378fa2b07
add initial wavernn support
2020-10-27 12:17:38 +01:00
Edresson
89e9bfe3a2
add text processing blank token test
2020-10-26 17:41:23 -03:00
Edresson
d9540a5857
add blank token in sequence for encrease glowtts results
2020-10-25 15:08:28 -03:00
Edresson
fbea058c59
add parse speakers function
2020-10-24 16:10:05 -03:00
Edresson
07345099ee
GlowTTS zero-shot TTS Support
2020-10-24 15:58:39 -03:00
Alexander Korolev
47d74ced1c
Update losses.py
...
Seems like in the latest dev merge, this change was reverted. Any specific reason for this?
Without it the problem as stated here https://github.com/mozilla/TTS/issues/473 occurs.
2020-10-23 14:15:01 +02:00
ayush-1506
2a3559f02b
Fix readme and config file
2020-10-21 13:43:49 +05:30
Edresson
b7f9ebd32b
add check arguments for GlowTTS and multispeaker training bug fix
2020-10-19 17:17:58 -03:00
erogol
c2c4126a18
remove merge conflicts
2020-10-08 01:35:27 +02:00
erogol
c5074cfd8e
general purpose distribute.py
2020-10-08 01:30:42 +02:00
erogol
6f0654f9a8
differential spectral loss
2020-10-08 01:30:42 +02:00
erogol
e0d4b88877
config update
2020-10-08 01:29:30 +02:00
erogol
4e93f90108
bug fix
2020-10-08 01:29:30 +02:00
erogol
bb9b70ee27
differential spectral loss and loss weight settings
2020-10-08 01:29:30 +02:00
erogol
e1eab1ce4b
print model r value as loading it
2020-10-07 13:34:21 +02:00
erogol
48a40c4730
remove unused import
2020-10-06 11:32:24 +02:00
erogol
a2606fbc22
format utils
2020-10-06 11:02:54 +02:00
Eren Gölge
4873601694
Merge pull request #531 from WeberJulian/french-cleaners
...
Adding support for french cleaners
2020-09-30 15:30:50 +02:00
Edresson
99d5a0ac07
add Speaker Conditional GST support
2020-09-29 16:09:27 -03:00
Julian WEBER
ea7c2e15c0
Adding french abbreviations
2020-09-29 15:43:39 +02:00
Julian WEBER
54b4031391
Merge remote-tracking branch 'origin/dev' into french-cleaners
2020-09-29 14:24:51 +02:00
Julian WEBER
da134eeee4
Subjective improvements
2020-09-29 14:20:52 +02:00
Julian WEBER
b2817e9e93
Adding french cleaners
2020-09-29 14:20:24 +02:00
Eren Gölge
cf02ace5b7
Merge pull request #530 from mueller91/fix_split_dataset
...
fix: split_dataset
2020-09-28 12:42:40 +02:00
erogol
154f90bc44
format speaker encoder imports
2020-09-28 11:19:19 +02:00
erogol
e097bc6c5d
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-09-28 11:15:32 +02:00
Eren Gölge
8e2dc79c3a
Merge pull request #526 from mueller91/dev
...
Fix: Check storage params only for speaker encoder
2020-09-28 11:15:23 +02:00
erogol
6a70c63f24
correct glow-tts loss
2020-09-27 03:28:42 +02:00
erogol
665f7ca714
linter fix
2020-09-24 12:57:54 +02:00
mueller91
227b9c8864
fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
...
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
2020-09-23 23:27:51 +02:00
mueller91
cfeeef7a7f
fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev.
...
speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.
2020-09-22 20:10:41 +02:00
mueller91
1fe5eb054f
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
...
Conflicts:
TTS/bin/train_encoder.py
requirements.txt
2020-09-22 19:58:53 +02:00
mueller91
df4caec4b7
add: check_config for speaker_encoder
2020-09-22 19:52:09 +02:00
WeberJulian
3c212be5a8
fix: fixing the RenamingUnpickler fix
2020-09-22 17:36:05 +02:00
mueller91
0ea7f4e2bd
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:39:40 +02:00
mueller91
7029452228
fix: make speaker encoder's storage parameters non-restriced
2020-09-22 10:31:42 +02:00
erogol
10258724d1
linter fixes
2020-09-22 03:54:16 +02:00
erogol
a6df617eb1
Merge branch 'glow-tts-amp-time_depth_conv' into dev
2020-09-21 14:23:45 +02:00
erogol
8150d5727e
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
2020-09-21 14:21:55 +02:00
erogol
e0b9fa887f
glow-tts modules added
2020-09-21 14:15:40 +02:00
erogol
e4c6386603
change import for normalization layer
2020-09-21 13:09:52 +02:00
mueller91
9b4aac94a8
fix: linter issues
2020-09-21 12:13:02 +02:00
erogol
c008003506
do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder
2020-09-18 12:52:19 +02:00
mueller
6b0621c794
cleanup
2020-09-17 16:46:43 +02:00
mueller
a273b1a210
add: add random noise to dataset
2020-09-17 14:23:40 +02:00
mueller
e36a3067e4
add: save wavs instead feats to storage.
...
This is done in order to mitigate staleness when caching and loading from data storage
2020-09-17 14:14:30 +02:00
mueller
1511076fde
add: Configurable encoder dataset storage to reduce disk I/O
...
add: Averaged time for data loader to console and Tensorboard output
2020-09-17 12:29:38 +02:00
erogol
3660c57f1e
time seperable convolution encoder, huber loss for duration predictor
2020-09-17 03:10:58 +02:00
mueller
95d2906307
add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training
2020-09-16 16:49:53 +02:00
mueller
c909ca3855
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
2020-09-16 15:55:55 +02:00
mueller
d733b90255
Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|)
2020-09-16 15:09:02 +02:00
maxbachmann
60ce862113
use difflib for string matching
2020-09-14 23:55:34 +02:00
erogol
f1a75468c2
fix arguments
2020-09-12 04:00:25 +02:00
erogol
7c2c4d6f27
pass x_mask to layer norm
2020-09-12 03:41:37 +02:00
erogol
45fbc0d003
convolution encoder with GLU and res connections
2020-09-12 03:40:21 +02:00