Edresson
77d85c6cc5
add softmaxproto loss and bug fix in data loader
2021-05-10 17:08:38 -03:00
Edresson
78bad25f2b
update voxceleb download link
2021-05-07 23:45:15 -03:00
Eren Gölge
f7582107da
Merge pull request #453 from Edresson/dev
...
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
2021-05-06 17:53:28 +02:00
Edresson
501c8e0302
remove unused vars on extract tts spectrograms script
2021-05-04 19:04:13 -03:00
Eren Gölge
0325c58862
Merge pull request #468 from shaun95/patch-1
...
Update losses.py
2021-05-03 14:45:24 +02:00
Eren Gölge
8cb27267a4
formatting
2021-05-03 14:26:35 +02:00
Eren Gölge
87d674a038
bumpup librosa version to 0.8.0
2021-05-03 14:25:09 +02:00
shaun
7d0ec62bf1
Update losses.py
...
The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled.
The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)
2021-05-02 14:14:24 +02:00
Edresson
3ecd556bbe
add unit test for extract tts spectrograms script
2021-05-01 13:41:56 -03:00
Edresson
446b1da936
create inference function
2021-04-29 18:18:37 -03:00
Eren Gölge
f02f0338c2
fix .models.json and add testing to check released models availability
2021-04-29 09:32:36 +02:00
Eren Gölge
fd95e9b8a4
[ci skip] Add sam models
2021-04-28 21:57:31 +02:00
Agrin Hilmkil
351d0ed6ae
Remove unnecessary fsspec usage
2021-04-28 11:21:08 +02:00
Agrin Hilmkil
167f86417e
Move dev, tf, notebook dependencies to extras
2021-04-28 11:20:06 +02:00
Eren Gölge
1235e54738
test for synthesize.py
2021-04-27 14:17:38 +02:00
Eren Gölge
4719414f2e
remove imports
2021-04-27 11:25:17 +02:00
Eren Gölge
add97cddc1
move function and remove import
2021-04-27 11:22:56 +02:00
Eren Gölge
734e6a515c
bug fix
2021-04-27 10:27:45 +02:00
Eren Gölge
6bdd81667e
place holders for sc-glow and hifigan models
2021-04-26 19:53:12 +02:00
Eren Gölge
2f0716073e
enable multi-speaker CoquiTTS models for synthesize.py
2021-04-26 19:36:53 +02:00
Eren Gölge
b531fa699c
remove conflicy noise
2021-04-26 15:27:52 +02:00
Eren Gölge
f37b488876
Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager
2021-04-26 15:25:25 +02:00
Eren Gölge
b82daa5e86
style and linter fixes
2021-04-26 15:22:24 +02:00
Edresson
20e42a3381
add save audio option
2021-04-23 15:00:00 -03:00
Edresson
8228091f92
add script for extraction of tts spectrograms
2021-04-23 14:17:46 -03:00
Eren Gölge
4cf211348d
styling and linting
2021-04-23 18:04:37 +02:00
Eren Gölge
7eb0c60d2e
let synthesizer to pass speaker encoder file paths to speaker manager
2021-04-23 18:04:37 +02:00
Eren Gölge
f69195739e
let speaker manager compute mean x_vector from multiple wav files
2021-04-23 18:04:37 +02:00
Eren Gölge
179722e3a7
new arguments to synthesize.py for loading speaker encoder and speaker wavs
2021-04-23 18:04:37 +02:00
Eren Gölge
dfa415a8b8
small refactor in server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
c80d21f311
load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager
2021-04-23 18:04:37 +02:00
Eren Gölge
ad047c8195
html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker
2021-04-23 18:04:37 +02:00
Eren Gölge
f9f3d04d14
remove moved function
2021-04-23 18:04:37 +02:00
Eren Gölge
10c988ac8c
update server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
6d0f5e0459
use SpeakerManager in Synthesizer
2021-04-23 18:04:37 +02:00
Eren Gölge
e97126314c
add ```unique``` argument to make_symbols to fix the incompat. issue of the
...
SC-Glow models
2021-04-23 18:04:37 +02:00
Eren Gölge
d08888e603
formating speakers.py
2021-04-23 18:04:37 +02:00
Eren Gölge
df422223a3
initial SpeakerManager implementation
2021-04-23 18:04:37 +02:00
Eren Gölge
7a7aeb35f5
fix the glow-tts in setup_model
2021-04-23 18:04:37 +02:00
Eren Gölge
d42748082a
update argument name external_speaker_embedding_dim -> speaker_embedding_dim
...
add inference_noise_scale argument to glow-tts
2021-04-23 18:04:37 +02:00
Eren Gölge
2da81f5bb6
add load_chekpoint to speaker encoder
2021-04-23 18:04:37 +02:00
Eren Gölge
1229ccbf07
update argument name in server.py
2021-04-23 18:04:37 +02:00
Eren Gölge
af2d36faeb
update synthesize.py for multi-speaker setting
2021-04-23 18:04:37 +02:00
Eren Gölge
99dc07a7dd
add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)
2021-04-23 18:04:37 +02:00
Eren Gölge
c955a12428
set the default layer size compatible with scglow
2021-04-23 18:04:37 +02:00
Eren Gölge
3ace2440fa
fix a mistake from rebase
2021-04-23 18:04:37 +02:00
Eren Gölge
aadb2106ec
code styling
2021-04-23 18:04:37 +02:00
Eren Gölge
af7baa3387
refactoring to allow defining the speaker file externally
2021-04-23 18:04:37 +02:00
kirianguiller
7dccbfdcd5
handle multi speaker and gst in Synthetizer class
2021-04-23 18:04:37 +02:00
Edresson
d2b6326b8b
change optimizer initialization for compatibility with Hifi-GAN official implementation
2021-04-23 07:54:39 -03:00