Commit Graph

1019 Commits

Author SHA1 Message Date
Eren Gölge 8e52a69230 delete separate tts training scripts and pre-commit configuration 2021-06-28 17:03:19 +02:00
Eren Gölge d96ebcd6d3 make style 2021-06-28 17:03:19 +02:00
Eren Gölge b643e8b37c `logging/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge 0cee5042a9 fix logger imports 2021-06-28 17:03:19 +02:00
Eren Gölge 72dceca52c import missings 2021-06-28 17:03:19 +02:00
Eren Gölge 0eec238429 remove redundant imports 2021-06-28 17:03:19 +02:00
Eren Gölge b500338faa make style 2021-06-28 17:03:19 +02:00
Eren Gölge 469d2e620a update extract_tts_spectrogram for `cond_input` API of the models 2021-06-28 17:03:19 +02:00
Eren Gölge 5ab28fa618 update `extract_tts_spec...` using `SpeakerManager` 2021-06-28 17:03:19 +02:00
Eren Gölge c392fa4288 update `extract_tts_spectrograms` for the new model API 2021-06-28 17:03:19 +02:00
Eren Gölge 8f47f95998 correct import of `load_meta_data`
remove redundant import
2021-06-28 17:03:19 +02:00
Eren Gölge c680a07a20 fix `Synthesized` for the new `synthesis()` 2021-06-28 17:03:19 +02:00
Eren Gölge 73bf9673ed revert logging.info to print statements for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge d25f017b42 update `setup_model.py` imports 2021-06-28 17:03:19 +02:00
Eren Gölge bb355b7441 update align_tts.py model for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 9203b863d9 update align_tts_loss for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge fc9a0fb8ce update aling_tts_config for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge e298b8e364 update trainer.py for better logging handling, restoring models and
rename init_ functions with get_
2021-06-28 17:03:19 +02:00
Eren Gölge b8a4af4010 update `synthesis.py` for being more generic 2021-06-28 17:03:19 +02:00
Eren Gölge c70d0c9dae update `speedy_speech.py` model for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 06ee57d816 update `speedy_speecy_config.py` for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 4e910993f1 update tacotron model to return `model_outputs` 2021-06-28 17:03:19 +02:00
Eren Gölge bb4deee64c update glow-tts for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 9134c7dfb6 update `sequence_mask` import globally 2021-06-28 17:03:19 +02:00
Eren Gölge b2218e882a update `glow_tts_config.py` for setting the optimizer and the scheduler 2021-06-28 17:03:19 +02:00
Eren Gölge 891631ab47 typing annotation for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 5f07315722 add trainer and train_tts 2021-06-28 17:03:19 +02:00
Eren Gölge 34f8a74e4d remove `truncated` from synthesizer 2021-06-28 17:03:19 +02:00
Eren Gölge 178eccbc16 update console logger 2021-06-28 17:03:19 +02:00
Eren Gölge f4f83b6379 update `synthesis.py` for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 130781dab6 remove `tts.generic_utils` as all the functions are moved to other files 2021-06-28 17:03:19 +02:00
Eren Gölge 535a458f40 update Tacotron models for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge bdbfc95618 add `gradual_training` argument to tacotron.py 2021-06-28 17:03:19 +02:00
Eren Gölge 5a2e75f0ee import missings for tacotron.py 2021-06-28 17:03:19 +02:00
Eren Gölge da7d10e53c mode `setup_model()` to `models/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge ca302db7b0 add sequence_mask to `utils.data` 2021-06-28 17:03:19 +02:00
Eren Gölge 844abb3b1d `setup_loss()` in `layer/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge a20a1c7d06 rename preprocess.py -> formatters.py 2021-06-28 17:03:19 +02:00
Eren Gölge b9bccbb243 move load_meta_data and related functions to `datasets/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge d09385808a set test_sentences in config 2021-06-28 17:03:19 +02:00
Eren Gölge 8def3c87af trainer-API updates 2021-06-28 17:03:19 +02:00
Eren Gölge 42554cc711 rename MyDataset -> TTSDataset 2021-06-28 17:03:19 +02:00
Edresson 1c4e806f54 use speaker manager on compute embeddings script 2021-06-27 03:35:34 -03:00
Edresson Casanova eb84bb2bc8
Merge branch 'dev' into dev 2021-06-26 15:32:19 -03:00
Eren Gölge 987cf1178b Bump up to v0.0.16 2021-06-25 14:44:56 +02:00
Michael Hansen 3f172b84d8 Fix linting issues 2021-06-25 14:41:31 +02:00
Michael Hansen 4d8426fa0a Use eSpeak IPA lexicons by default for phoneme models 2021-06-25 14:41:05 +02:00
Michael Hansen 618b509204 Use combined characters available in TTS phonemes (like ç) 2021-06-25 14:41:05 +02:00
Michael Hansen da6f6a4a01 Update docstring for clean_gruut_phonemes 2021-06-25 14:41:05 +02:00
Michael Hansen 47191f3ecc Add tests for gruut phonemization 2021-06-25 14:41:05 +02:00
Michael Hansen 67869e77f9 Use gruut for phonemization 2021-06-25 14:41:05 +02:00
Eren Gölge 788992093d Add UnivNet vocoder 🚀 2021-06-23 13:51:04 +02:00
Eren Gölge 64fd59204c Use `torch.linalg.qr` for pytorch > `v1.9.0` 2021-06-23 13:49:42 +02:00
Eren Gölge aba840b4e6 Fix loading the `amp` scaler from a checkpoint 🛠️ 2021-06-23 13:49:42 +02:00
Eren Gölge 18e5393f16 Add 🐍 python 3.9 to CI 2021-06-23 13:49:36 +02:00
Eren Gölge 0ff2d2336a Fix wrong argument name 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge 61c3cb871f Docstring edit in `TTSDataset.py` ✍️ 2021-06-22 16:21:11 +02:00
Eren Gölge 6f739ea07a Fix `eval_log` for `gan.py` 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge ebb91c0fbb Move `TorchSTFT` to `utils.audio` 2021-06-22 16:21:11 +02:00
Eren Gölge 01c4b22a2f Fixup `trainer.py` 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge 7de2756fc4 Enable support for 🐍 python 3.10
Bump up versions numpy 1.19.5 and TF 2.5.0
2021-06-22 16:21:11 +02:00
Eren Gölge 220e184f66 Apply small fixes for API compatibility 2021-06-22 16:21:11 +02:00
Eren Gölge 77d57dd301 Print `max_decoder_steps` when model reaches the limit 2021-06-22 16:21:11 +02:00
Eren Gölge 7dc2177df4 Update `synthesizer` for speaker and model init 2021-06-22 16:21:11 +02:00
Eren Gölge c3a0bc702e fixup configs 2021-06-22 16:21:11 +02:00
Eren Gölge 0e01c2594f Update `speaker_manager` 2021-06-22 16:21:11 +02:00
Eren Gölge 8182f5168f Fixup `utils` for the trainer 2021-06-22 16:21:11 +02:00
Eren Gölge b4bb567e04 Update `vocoder` utils 2021-06-22 16:21:11 +02:00
Eren Gölge f3ff5b1971 Update `TTS.bin` scripts for the new API 2021-06-22 16:21:11 +02:00
Eren Gölge aed919cf1c Update `vocoder` datasets and `setup_dataset` 2021-06-22 16:21:11 +02:00
Eren Gölge 59abf490a1 Implement `setup_model` for vocoder models 2021-06-22 16:21:11 +02:00
Eren Gölge 420820caf4 Update vocoder models 2021-06-22 16:21:11 +02:00
Eren Gölge d10f9c5676 Update `tts.models.setup_model` 2021-06-22 16:21:11 +02:00
Eren Gölge cae702980f Create base 🐸TTS model abstraction for tts models 2021-06-22 16:21:11 +02:00
Eren Gölge 70d968b169 Update vocoder model configs 2021-06-22 16:21:11 +02:00
Eren Gölge f8a3460818 Update tts model configs 2021-06-22 16:21:11 +02:00
Eren Gölge acd96a4940 Implement unified IO utils 2021-06-22 16:21:10 +02:00
Eren Gölge 6b907554f8 Implement unified trainer 2021-06-22 16:21:10 +02:00
Eren Gölge 20c4a8c8e1 `tts` model abstraction with `TTSModel` 2021-06-22 16:21:10 +02:00
Eren Gölge b934665fc0 fix calculation of `loader_start_time` 2021-06-22 16:21:10 +02:00
Eren Gölge 64f0f57757 `TrainerAbstract` and related updates for `TrainerTTS` 2021-06-22 16:21:10 +02:00
Eren Gölge f077a356e0 rename to 2021-06-22 16:21:10 +02:00
Eren Gölge 4575b70826 merge if branches with the same implementation 2021-06-22 16:21:10 +02:00
Eren Gölge 59be1b9af1 adjust `distribute.py` for the `train_tts.py` 2021-06-22 16:21:10 +02:00
Eren Gölge 614738cc85 downsize melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge 4f29725eb6 fix glow-tts `inference()` 2021-06-22 13:12:52 +02:00
Eren Gölge a87c886497 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 0206bb847b add max_decoder_steps argument to tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge cbb52b3d83 fix speaker_manager init 2021-06-22 13:12:52 +02:00
Eren Gölge d2fd6a34a1 use get_speaker_manager in Trainer and save speakers.json file when
needed
2021-06-22 13:12:52 +02:00
Eren Gölge 147550c65f make style and linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge a605dd3d08 Compute d_vectors and speaker_ids separately in TTSDataset 2021-06-22 13:12:52 +02:00
Eren Gölge f00ef90ce6 rename external speaker embedding arguments as `d_vectors` 2021-06-22 13:12:52 +02:00
Eren Gölge e7b7268c43 use `to_cuda()` for moving data in `format_batch()` 2021-06-22 13:12:52 +02:00
Eren Gölge 26a3312f0d change `to(device)` to `type_as` in models 2021-06-22 13:12:52 +02:00
Eren Gölge c09622459e init `durations = None` 2021-06-22 13:12:52 +02:00
Eren Gölge 2e31659dd9 docstring fix 2021-06-22 13:12:52 +02:00
Eren Gölge 7a0750a4f5 make style 2021-06-22 13:12:52 +02:00
Eren Gölge 534401377d styling formatting.py 2021-06-22 13:12:52 +02:00
Eren Gölge e229f5c081 fix type annotations 2021-06-22 13:12:52 +02:00
Eren Gölge 506189bdee update glow-tts output shapes to match [B, T, C] 2021-06-22 13:12:52 +02:00
Eren Gölge f568833d28 formating `cond_input` with a function in Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 254707c610 update imports for `formatters` 2021-06-22 13:12:52 +02:00
Eren Gölge 223502d827 fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-22 13:12:52 +02:00
Eren Gölge d4b1acfa81 refactor `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 26e7c0960c linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge 79f7c5da1e delete separate tts training scripts and pre-commit configuration 2021-06-22 13:12:52 +02:00
Eren Gölge ca787be193 make style 2021-06-22 13:12:52 +02:00
Eren Gölge d376647ca0 `logging/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge bb58a0588e fix logger imports 2021-06-22 13:12:52 +02:00
Eren Gölge 9bbc924377 import missings 2021-06-22 13:12:52 +02:00
Eren Gölge b4d4ce0d7e remove redundant imports 2021-06-22 13:12:52 +02:00
Eren Gölge aefa71155c make style 2021-06-22 13:12:52 +02:00
Eren Gölge 88d8a94a10 update extract_tts_spectrogram for `cond_input` API of the models 2021-06-22 13:12:52 +02:00
Eren Gölge 667bb708b6 update `extract_tts_spec...` using `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 830306d2fd update `extract_tts_spectrograms` for the new model API 2021-06-22 13:12:52 +02:00
Eren Gölge c673eb8ef8 correct import of `load_meta_data`
remove redundant import
2021-06-22 13:12:52 +02:00
Eren Gölge f0a419546b fix `Synthesized` for the new `synthesis()` 2021-06-22 13:12:52 +02:00
Eren Gölge c7ff175592 revert logging.info to print statements for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge fd6afe5ae5 update `setup_model.py` imports 2021-06-22 13:12:52 +02:00
Eren Gölge c82d91051d update align_tts.py model for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 4f66e816d1 update align_tts_loss for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8213ad8b5f update aling_tts_config for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8dfd4c91ff update trainer.py for better logging handling, restoring models and
rename init_ functions with get_
2021-06-22 13:12:52 +02:00
Eren Gölge fb9289d365 update `synthesis.py` for being more generic 2021-06-22 13:12:52 +02:00
Eren Gölge f121b0ff5d update `speedy_speech.py` model for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 843b3ba960 update `speedy_speecy_config.py` for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c9790bee2c update tacotron model to return `model_outputs` 2021-06-22 13:12:52 +02:00
Eren Gölge f09ec7e3a7 update glow-tts for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 3346a6d9dc update `sequence_mask` import globally 2021-06-22 13:12:52 +02:00
Eren Gölge 9765b1aa6b update `glow_tts_config.py` for setting the optimizer and the scheduler 2021-06-22 13:12:52 +02:00
Eren Gölge 6bf6543df8 typing annotation for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 57cdddef16 add trainer and train_tts 2021-06-22 13:12:52 +02:00
Eren Gölge d769af9e3b remove `truncated` from synthesizer 2021-06-22 13:12:52 +02:00
Eren Gölge 570633ab80 update console logger 2021-06-22 13:12:52 +02:00
Eren Gölge 2ac6b824ca update `synthesis.py` for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c9e5527070 remove `tts.generic_utils` as all the functions are moved to other files 2021-06-22 13:12:52 +02:00
Eren Gölge 2ab723cd10 update Tacotron models for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge d6b6a15b5c add `gradual_training` argument to tacotron.py 2021-06-22 13:12:52 +02:00
Eren Gölge 118a7f2b43 import missings for tacotron.py 2021-06-22 13:12:52 +02:00
Eren Gölge c98149d488 mode `setup_model()` to `models/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge 86edf6ab0e add sequence_mask to `utils.data` 2021-06-22 13:12:52 +02:00
Eren Gölge c61486b1e3 `setup_loss()` in `layer/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge f07209d2e0 rename preprocess.py -> formatters.py 2021-06-22 13:12:52 +02:00
Eren Gölge facb782851 move load_meta_data and related functions to `datasets/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge b9d4355d20 set test_sentences in config 2021-06-22 13:12:52 +02:00
Eren Gölge 7bdd0eb72f trainer-API updates 2021-06-22 13:12:52 +02:00
Eren Gölge 0f284841d1 rename MyDataset -> TTSDataset 2021-06-22 13:12:52 +02:00
Edresson 99d40e98d9 fix Lint checks 2021-06-18 14:59:01 -03:00
Edresson 28bec238ca fix Lint checks 2021-06-18 14:33:50 -03:00