Eren Gölge
|
21dd4d7960
|
fix load_config imports for Coqpit
|
2021-05-11 11:29:18 +02:00 |
Eren Gölge
|
c57f0b46bb
|
reintro use_gst for backwars compat
|
2021-05-11 11:29:18 +02:00 |
Eren Gölge
|
9ee70af9bb
|
code styling
|
2021-05-11 11:29:18 +02:00 |
Eren Gölge
|
7663bc63c1
|
add Coqpit configs for the TTS models
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
7227e8f1d2
|
update train_align_tts.py for coqpit
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
51a7e06945
|
glow_tts_config.py and train test on python
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
720fe13056
|
update glow_tts modules and training script for coqpit use
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
816e7ee698
|
remove default configs.json as replacing with Coqpit configs
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
647163397d
|
coqpit refactoring
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
eaa130e813
|
fix tacotron for coqpit
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
05d9543ed8
|
init GST module using gst config in Tacotron models
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
93a00373f6
|
move split_dataset
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
79d7215142
|
config refactor #5 WIP
|
2021-05-11 11:29:17 +02:00 |
Eren Gölge
|
dc50f5f0b0
|
config refactor #4 WIP
|
2021-05-11 11:28:35 +02:00 |
Eren Gölge
|
97bd5f9734
|
[ci skip] config update #3 WIP
|
2021-05-11 11:28:35 +02:00 |
Eren Gölge
|
a21c0b5585
|
config update 2 WIP
|
2021-05-11 11:28:35 +02:00 |
Eren Gölge
|
e092ae40dc
|
config update WIP
|
2021-05-11 11:28:35 +02:00 |
Adam Froghyar
|
7ddc885f37
|
deleted a line the broke GravesAttention
|
2021-05-10 15:42:59 +02:00 |
Eren Gölge
|
f7582107da
|
Merge pull request #453 from Edresson/dev
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
|
2021-05-06 17:53:28 +02:00 |
Eren Gölge
|
8cb27267a4
|
formatting
|
2021-05-03 14:26:35 +02:00 |
Eren Gölge
|
2f0716073e
|
enable multi-speaker CoquiTTS models for synthesize.py
|
2021-04-26 19:36:53 +02:00 |
Eren Gölge
|
b531fa699c
|
remove conflicy noise
|
2021-04-26 15:27:52 +02:00 |
Eren Gölge
|
f37b488876
|
Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager
|
2021-04-26 15:25:25 +02:00 |
Edresson
|
8228091f92
|
add script for extraction of tts spectrograms
|
2021-04-23 14:17:46 -03:00 |
Eren Gölge
|
4cf211348d
|
styling and linting
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
f69195739e
|
let speaker manager compute mean x_vector from multiple wav files
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
c80d21f311
|
load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
e97126314c
|
add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
d08888e603
|
formating speakers.py
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
df422223a3
|
initial SpeakerManager implementation
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
7a7aeb35f5
|
fix the glow-tts in setup_model
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
d42748082a
|
update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
99dc07a7dd
|
add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
c955a12428
|
set the default layer size compatible with scglow
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
aadb2106ec
|
code styling
|
2021-04-23 18:04:37 +02:00 |
kirianguiller
|
7dccbfdcd5
|
handle multi speaker and gst in Synthetizer class
|
2021-04-23 18:04:37 +02:00 |
Eren Gölge
|
ef37633cb3
|
[ci skip] use prenet_dropout by default with Tacotron models
|
2021-04-22 12:38:55 +02:00 |
Eren Gölge
|
04b6881b66
|
add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
|
2021-04-21 13:12:35 +02:00 |
Eren Gölge
|
790946faec
|
formating speakers.py
|
2021-04-21 13:12:11 +02:00 |
Eren Gölge
|
ab313814de
|
initial SpeakerManager implementation
|
2021-04-21 13:11:46 +02:00 |
Eren Gölge
|
09890c7421
|
fix the glow-tts in setup_model
|
2021-04-21 13:10:40 +02:00 |
Eren Gölge
|
8764d02eb2
|
update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
|
2021-04-21 13:09:44 +02:00 |
Eren Gölge
|
d2fa8add1f
|
add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)
|
2021-04-16 19:40:13 +02:00 |
Eren Gölge
|
d9612a4351
|
set the default layer size compatible with scglow
|
2021-04-16 19:40:13 +02:00 |
Eren Gölge
|
47e356cb48
|
code styling
|
2021-04-16 16:01:40 +02:00 |
kirianguiller
|
48ae52a9a3
|
handle multi speaker and gst in Synthetizer class
|
2021-04-16 15:54:49 +02:00 |
Eren Gölge
|
9cc17be53a
|
formatting and a small bug fix in Tacotron model
|
2021-04-15 16:36:51 +02:00 |
Eren Gölge
|
3de5a89154
|
optionally enable prenet dropout at inference time for tacotron models
|
2021-04-13 13:24:56 +02:00 |
Eren Gölge
|
480e2f7888
|
docstring update and better handling make_symbols
|
2021-04-12 16:40:49 +02:00 |
Eren Gölge
|
b735076bb4
|
linter fixes
|
2021-04-12 13:14:11 +02:00 |