Commit Graph

3197 Commits

Author SHA1 Message Date
Eren Gölge cbb52b3d83 fix speaker_manager init 2021-06-22 13:12:52 +02:00
Eren Gölge 3f96491168 reduce multiband melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge d2fd6a34a1 use get_speaker_manager in Trainer and save speakers.json file when
needed
2021-06-22 13:12:52 +02:00
Eren Gölge 147550c65f make style and linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge a605dd3d08 Compute d_vectors and speaker_ids separately in TTSDataset 2021-06-22 13:12:52 +02:00
Eren Gölge f00ef90ce6 rename external speaker embedding arguments as `d_vectors` 2021-06-22 13:12:52 +02:00
Eren Gölge e7b7268c43 use `to_cuda()` for moving data in `format_batch()` 2021-06-22 13:12:52 +02:00
Eren Gölge fdce9eb234 reduce size of the metadata.csv used at testing 2021-06-22 13:12:52 +02:00
Eren Gölge 26a3312f0d change `to(device)` to `type_as` in models 2021-06-22 13:12:52 +02:00
Eren Gölge c09622459e init `durations = None` 2021-06-22 13:12:52 +02:00
Eren Gölge 2e31659dd9 docstring fix 2021-06-22 13:12:52 +02:00
Eren Gölge 7a0750a4f5 make style 2021-06-22 13:12:52 +02:00
Eren Gölge 534401377d styling formatting.py 2021-06-22 13:12:52 +02:00
Eren Gölge 120ea679f9 add `test_all` to makefile 2021-06-22 13:12:52 +02:00
Eren Gölge e229f5c081 fix type annotations 2021-06-22 13:12:52 +02:00
Eren Gölge ab7f299d48 update test to be less demanding 2021-06-22 13:12:52 +02:00
Eren Gölge a58e986f68 reduce fullband-melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge 9960c0c356 update test for the new input output API of the tts models 2021-06-22 13:12:52 +02:00
Eren Gölge 506189bdee update glow-tts output shapes to match [B, T, C] 2021-06-22 13:12:52 +02:00
Eren Gölge f568833d28 formating `cond_input` with a function in Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 254707c610 update imports for `formatters` 2021-06-22 13:12:52 +02:00
Eren Gölge 223502d827 fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-22 13:12:52 +02:00
Eren Gölge d4b1acfa81 refactor `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 26e7c0960c linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge 79f7c5da1e delete separate tts training scripts and pre-commit configuration 2021-06-22 13:12:52 +02:00
Eren Gölge ca787be193 make style 2021-06-22 13:12:52 +02:00
Eren Gölge d376647ca0 `logging/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge bb58a0588e fix logger imports 2021-06-22 13:12:52 +02:00
Eren Gölge 9bbc924377 import missings 2021-06-22 13:12:52 +02:00
Eren Gölge b4d4ce0d7e remove redundant imports 2021-06-22 13:12:52 +02:00
Eren Gölge aefa71155c make style 2021-06-22 13:12:52 +02:00
Eren Gölge 88d8a94a10 update extract_tts_spectrogram for `cond_input` API of the models 2021-06-22 13:12:52 +02:00
Eren Gölge 667bb708b6 update `extract_tts_spec...` using `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 830306d2fd update `extract_tts_spectrograms` for the new model API 2021-06-22 13:12:52 +02:00
Eren Gölge c673eb8ef8 correct import of `load_meta_data`
remove redundant import
2021-06-22 13:12:52 +02:00
Eren Gölge f0a419546b fix `Synthesized` for the new `synthesis()` 2021-06-22 13:12:52 +02:00
Eren Gölge c7ff175592 revert logging.info to print statements for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge fd6afe5ae5 update `setup_model.py` imports 2021-06-22 13:12:52 +02:00
Eren Gölge 0c05318e8d update tts training tests to use the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c82d91051d update align_tts.py model for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 4f66e816d1 update align_tts_loss for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8213ad8b5f update aling_tts_config for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8dfd4c91ff update trainer.py for better logging handling, restoring models and
rename init_ functions with get_
2021-06-22 13:12:52 +02:00
Eren Gölge fb9289d365 update `synthesis.py` for being more generic 2021-06-22 13:12:52 +02:00
Eren Gölge f121b0ff5d update `speedy_speech.py` model for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 843b3ba960 update `speedy_speecy_config.py` for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c9790bee2c update tacotron model to return `model_outputs` 2021-06-22 13:12:52 +02:00
Eren Gölge f09ec7e3a7 update glow-tts for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 3346a6d9dc update `sequence_mask` import globally 2021-06-22 13:12:52 +02:00
Eren Gölge 9765b1aa6b update `glow_tts_config.py` for setting the optimizer and the scheduler 2021-06-22 13:12:52 +02:00