Commit Graph

3212 Commits

Author SHA1 Message Date
Eren Gölge 76e590f33c Update model test configs 2021-06-22 16:21:11 +02:00
Eren Gölge acd96a4940 Implement unified IO utils 2021-06-22 16:21:10 +02:00
Eren Gölge 6b907554f8 Implement unified trainer 2021-06-22 16:21:10 +02:00
Eren Gölge 20c4a8c8e1 `tts` model abstraction with `TTSModel` 2021-06-22 16:21:10 +02:00
Eren Gölge b934665fc0 fix calculation of `loader_start_time` 2021-06-22 16:21:10 +02:00
Eren Gölge 64f0f57757 `TrainerAbstract` and related updates for `TrainerTTS` 2021-06-22 16:21:10 +02:00
Eren Gölge f077a356e0 rename to 2021-06-22 16:21:10 +02:00
Eren Gölge 4575b70826 merge if branches with the same implementation 2021-06-22 16:21:10 +02:00
Eren Gölge 59be1b9af1 adjust `distribute.py` for the `train_tts.py` 2021-06-22 16:21:10 +02:00
Eren Gölge 614738cc85 downsize melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge 4f29725eb6 fix glow-tts `inference()` 2021-06-22 13:12:52 +02:00
Eren Gölge 7ea71c7586 use one testing sentence in tts tests 2021-06-22 13:12:52 +02:00
Eren Gölge a87c886497 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 0206bb847b add max_decoder_steps argument to tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge cbb52b3d83 fix speaker_manager init 2021-06-22 13:12:52 +02:00
Eren Gölge 3f96491168 reduce multiband melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge d2fd6a34a1 use get_speaker_manager in Trainer and save speakers.json file when
needed
2021-06-22 13:12:52 +02:00
Eren Gölge 147550c65f make style and linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge a605dd3d08 Compute d_vectors and speaker_ids separately in TTSDataset 2021-06-22 13:12:52 +02:00
Eren Gölge f00ef90ce6 rename external speaker embedding arguments as `d_vectors` 2021-06-22 13:12:52 +02:00
Eren Gölge e7b7268c43 use `to_cuda()` for moving data in `format_batch()` 2021-06-22 13:12:52 +02:00
Eren Gölge fdce9eb234 reduce size of the metadata.csv used at testing 2021-06-22 13:12:52 +02:00
Eren Gölge 26a3312f0d change `to(device)` to `type_as` in models 2021-06-22 13:12:52 +02:00
Eren Gölge c09622459e init `durations = None` 2021-06-22 13:12:52 +02:00
Eren Gölge 2e31659dd9 docstring fix 2021-06-22 13:12:52 +02:00
Eren Gölge 7a0750a4f5 make style 2021-06-22 13:12:52 +02:00
Eren Gölge 534401377d styling formatting.py 2021-06-22 13:12:52 +02:00
Eren Gölge 120ea679f9 add `test_all` to makefile 2021-06-22 13:12:52 +02:00
Eren Gölge e229f5c081 fix type annotations 2021-06-22 13:12:52 +02:00
Eren Gölge ab7f299d48 update test to be less demanding 2021-06-22 13:12:52 +02:00
Eren Gölge a58e986f68 reduce fullband-melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge 9960c0c356 update test for the new input output API of the tts models 2021-06-22 13:12:52 +02:00
Eren Gölge 506189bdee update glow-tts output shapes to match [B, T, C] 2021-06-22 13:12:52 +02:00
Eren Gölge f568833d28 formating `cond_input` with a function in Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 254707c610 update imports for `formatters` 2021-06-22 13:12:52 +02:00
Eren Gölge 223502d827 fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-22 13:12:52 +02:00
Eren Gölge d4b1acfa81 refactor `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 26e7c0960c linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge 79f7c5da1e delete separate tts training scripts and pre-commit configuration 2021-06-22 13:12:52 +02:00
Eren Gölge ca787be193 make style 2021-06-22 13:12:52 +02:00
Eren Gölge d376647ca0 `logging/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge bb58a0588e fix logger imports 2021-06-22 13:12:52 +02:00
Eren Gölge 9bbc924377 import missings 2021-06-22 13:12:52 +02:00
Eren Gölge b4d4ce0d7e remove redundant imports 2021-06-22 13:12:52 +02:00
Eren Gölge aefa71155c make style 2021-06-22 13:12:52 +02:00
Eren Gölge 88d8a94a10 update extract_tts_spectrogram for `cond_input` API of the models 2021-06-22 13:12:52 +02:00
Eren Gölge 667bb708b6 update `extract_tts_spec...` using `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 830306d2fd update `extract_tts_spectrograms` for the new model API 2021-06-22 13:12:52 +02:00
Eren Gölge c673eb8ef8 correct import of `load_meta_data`
remove redundant import
2021-06-22 13:12:52 +02:00
Eren Gölge f0a419546b fix `Synthesized` for the new `synthesis()` 2021-06-22 13:12:52 +02:00