Commit Graph

1123 Commits

Author SHA1 Message Date
Eren Gölge ef4ea9e527 update imports for `formatters` 2021-06-28 17:03:19 +02:00
Eren Gölge 6c495c6a6e fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-28 17:03:19 +02:00
Eren Gölge f840268181 refactor `SpeakerManager` 2021-06-28 17:03:19 +02:00
Eren Gölge 421194880d linter fixes 2021-06-28 17:03:19 +02:00
Eren Gölge 8e52a69230 delete separate tts training scripts and pre-commit configuration 2021-06-28 17:03:19 +02:00
Eren Gölge d96ebcd6d3 make style 2021-06-28 17:03:19 +02:00
Eren Gölge b643e8b37c `logging/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge 0cee5042a9 fix logger imports 2021-06-28 17:03:19 +02:00
Eren Gölge 72dceca52c import missings 2021-06-28 17:03:19 +02:00
Eren Gölge 0eec238429 remove redundant imports 2021-06-28 17:03:19 +02:00
Eren Gölge b500338faa make style 2021-06-28 17:03:19 +02:00
Eren Gölge 469d2e620a update extract_tts_spectrogram for `cond_input` API of the models 2021-06-28 17:03:19 +02:00
Eren Gölge 5ab28fa618 update `extract_tts_spec...` using `SpeakerManager` 2021-06-28 17:03:19 +02:00
Eren Gölge c392fa4288 update `extract_tts_spectrograms` for the new model API 2021-06-28 17:03:19 +02:00
Eren Gölge 8f47f95998 correct import of `load_meta_data`
remove redundant import
2021-06-28 17:03:19 +02:00
Eren Gölge c680a07a20 fix `Synthesized` for the new `synthesis()` 2021-06-28 17:03:19 +02:00
Eren Gölge 73bf9673ed revert logging.info to print statements for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge d25f017b42 update `setup_model.py` imports 2021-06-28 17:03:19 +02:00
Eren Gölge bb355b7441 update align_tts.py model for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 9203b863d9 update align_tts_loss for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge fc9a0fb8ce update aling_tts_config for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge e298b8e364 update trainer.py for better logging handling, restoring models and
rename init_ functions with get_
2021-06-28 17:03:19 +02:00
Eren Gölge b8a4af4010 update `synthesis.py` for being more generic 2021-06-28 17:03:19 +02:00
Eren Gölge c70d0c9dae update `speedy_speech.py` model for trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 06ee57d816 update `speedy_speecy_config.py` for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 4e910993f1 update tacotron model to return `model_outputs` 2021-06-28 17:03:19 +02:00
Eren Gölge bb4deee64c update glow-tts for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 9134c7dfb6 update `sequence_mask` import globally 2021-06-28 17:03:19 +02:00
Eren Gölge b2218e882a update `glow_tts_config.py` for setting the optimizer and the scheduler 2021-06-28 17:03:19 +02:00
Eren Gölge 891631ab47 typing annotation for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 5f07315722 add trainer and train_tts 2021-06-28 17:03:19 +02:00
Eren Gölge 34f8a74e4d remove `truncated` from synthesizer 2021-06-28 17:03:19 +02:00
Eren Gölge 178eccbc16 update console logger 2021-06-28 17:03:19 +02:00
Eren Gölge f4f83b6379 update `synthesis.py` for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 130781dab6 remove `tts.generic_utils` as all the functions are moved to other files 2021-06-28 17:03:19 +02:00
Eren Gölge 535a458f40 update Tacotron models for the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge bdbfc95618 add `gradual_training` argument to tacotron.py 2021-06-28 17:03:19 +02:00
Eren Gölge 5a2e75f0ee import missings for tacotron.py 2021-06-28 17:03:19 +02:00
Eren Gölge da7d10e53c mode `setup_model()` to `models/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge ca302db7b0 add sequence_mask to `utils.data` 2021-06-28 17:03:19 +02:00
Eren Gölge 844abb3b1d `setup_loss()` in `layer/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge a20a1c7d06 rename preprocess.py -> formatters.py 2021-06-28 17:03:19 +02:00
Eren Gölge b9bccbb243 move load_meta_data and related functions to `datasets/__init__.py` 2021-06-28 17:03:19 +02:00
Eren Gölge d09385808a set test_sentences in config 2021-06-28 17:03:19 +02:00
Eren Gölge 8def3c87af trainer-API updates 2021-06-28 17:03:19 +02:00
Eren Gölge 42554cc711 rename MyDataset -> TTSDataset 2021-06-28 17:03:19 +02:00
Edresson 1c4e806f54 use speaker manager on compute embeddings script 2021-06-27 03:35:34 -03:00
Edresson Casanova eb84bb2bc8
Merge branch 'dev' into dev 2021-06-26 15:32:19 -03:00
Eren Gölge 987cf1178b Bump up to v0.0.16 2021-06-25 14:44:56 +02:00
Michael Hansen 3f172b84d8 Fix linting issues 2021-06-25 14:41:31 +02:00
Michael Hansen 4d8426fa0a Use eSpeak IPA lexicons by default for phoneme models 2021-06-25 14:41:05 +02:00
Michael Hansen 618b509204 Use combined characters available in TTS phonemes (like ç) 2021-06-25 14:41:05 +02:00
Michael Hansen da6f6a4a01 Update docstring for clean_gruut_phonemes 2021-06-25 14:41:05 +02:00
Michael Hansen 47191f3ecc Add tests for gruut phonemization 2021-06-25 14:41:05 +02:00
Michael Hansen 67869e77f9 Use gruut for phonemization 2021-06-25 14:41:05 +02:00
Eren Gölge 788992093d Add UnivNet vocoder 🚀 2021-06-23 13:51:04 +02:00
Eren Gölge 64fd59204c Use `torch.linalg.qr` for pytorch > `v1.9.0` 2021-06-23 13:49:42 +02:00
Eren Gölge aba840b4e6 Fix loading the `amp` scaler from a checkpoint 🛠️ 2021-06-23 13:49:42 +02:00
Eren Gölge 18e5393f16 Add 🐍 python 3.9 to CI 2021-06-23 13:49:36 +02:00
Eren Gölge 0ff2d2336a Fix wrong argument name 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge 61c3cb871f Docstring edit in `TTSDataset.py` ✍️ 2021-06-22 16:21:11 +02:00
Eren Gölge 6f739ea07a Fix `eval_log` for `gan.py` 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge ebb91c0fbb Move `TorchSTFT` to `utils.audio` 2021-06-22 16:21:11 +02:00
Eren Gölge 01c4b22a2f Fixup `trainer.py` 🛠️ 2021-06-22 16:21:11 +02:00
Eren Gölge 7de2756fc4 Enable support for 🐍 python 3.10
Bump up versions numpy 1.19.5 and TF 2.5.0
2021-06-22 16:21:11 +02:00
Eren Gölge 220e184f66 Apply small fixes for API compatibility 2021-06-22 16:21:11 +02:00
Eren Gölge 77d57dd301 Print `max_decoder_steps` when model reaches the limit 2021-06-22 16:21:11 +02:00
Eren Gölge 7dc2177df4 Update `synthesizer` for speaker and model init 2021-06-22 16:21:11 +02:00
Eren Gölge c3a0bc702e fixup configs 2021-06-22 16:21:11 +02:00
Eren Gölge 0e01c2594f Update `speaker_manager` 2021-06-22 16:21:11 +02:00
Eren Gölge 8182f5168f Fixup `utils` for the trainer 2021-06-22 16:21:11 +02:00
Eren Gölge b4bb567e04 Update `vocoder` utils 2021-06-22 16:21:11 +02:00
Eren Gölge f3ff5b1971 Update `TTS.bin` scripts for the new API 2021-06-22 16:21:11 +02:00
Eren Gölge aed919cf1c Update `vocoder` datasets and `setup_dataset` 2021-06-22 16:21:11 +02:00
Eren Gölge 59abf490a1 Implement `setup_model` for vocoder models 2021-06-22 16:21:11 +02:00
Eren Gölge 420820caf4 Update vocoder models 2021-06-22 16:21:11 +02:00
Eren Gölge d10f9c5676 Update `tts.models.setup_model` 2021-06-22 16:21:11 +02:00
Eren Gölge cae702980f Create base 🐸TTS model abstraction for tts models 2021-06-22 16:21:11 +02:00
Eren Gölge 70d968b169 Update vocoder model configs 2021-06-22 16:21:11 +02:00
Eren Gölge f8a3460818 Update tts model configs 2021-06-22 16:21:11 +02:00
Eren Gölge acd96a4940 Implement unified IO utils 2021-06-22 16:21:10 +02:00
Eren Gölge 6b907554f8 Implement unified trainer 2021-06-22 16:21:10 +02:00
Eren Gölge 20c4a8c8e1 `tts` model abstraction with `TTSModel` 2021-06-22 16:21:10 +02:00
Eren Gölge b934665fc0 fix calculation of `loader_start_time` 2021-06-22 16:21:10 +02:00
Eren Gölge 64f0f57757 `TrainerAbstract` and related updates for `TrainerTTS` 2021-06-22 16:21:10 +02:00
Eren Gölge f077a356e0 rename to 2021-06-22 16:21:10 +02:00
Eren Gölge 4575b70826 merge if branches with the same implementation 2021-06-22 16:21:10 +02:00
Eren Gölge 59be1b9af1 adjust `distribute.py` for the `train_tts.py` 2021-06-22 16:21:10 +02:00
Eren Gölge 614738cc85 downsize melgan test model size 2021-06-22 13:12:52 +02:00
Eren Gölge 4f29725eb6 fix glow-tts `inference()` 2021-06-22 13:12:52 +02:00
Eren Gölge a87c886497 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 0206bb847b add max_decoder_steps argument to tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge cbb52b3d83 fix speaker_manager init 2021-06-22 13:12:52 +02:00
Eren Gölge d2fd6a34a1 use get_speaker_manager in Trainer and save speakers.json file when
needed
2021-06-22 13:12:52 +02:00
Eren Gölge 147550c65f make style and linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge a605dd3d08 Compute d_vectors and speaker_ids separately in TTSDataset 2021-06-22 13:12:52 +02:00
Eren Gölge f00ef90ce6 rename external speaker embedding arguments as `d_vectors` 2021-06-22 13:12:52 +02:00
Eren Gölge e7b7268c43 use `to_cuda()` for moving data in `format_batch()` 2021-06-22 13:12:52 +02:00
Eren Gölge 26a3312f0d change `to(device)` to `type_as` in models 2021-06-22 13:12:52 +02:00
Eren Gölge c09622459e init `durations = None` 2021-06-22 13:12:52 +02:00
Eren Gölge 2e31659dd9 docstring fix 2021-06-22 13:12:52 +02:00
Eren Gölge 7a0750a4f5 make style 2021-06-22 13:12:52 +02:00
Eren Gölge 534401377d styling formatting.py 2021-06-22 13:12:52 +02:00
Eren Gölge e229f5c081 fix type annotations 2021-06-22 13:12:52 +02:00
Eren Gölge 506189bdee update glow-tts output shapes to match [B, T, C] 2021-06-22 13:12:52 +02:00
Eren Gölge f568833d28 formating `cond_input` with a function in Tacotron models 2021-06-22 13:12:52 +02:00
Eren Gölge 254707c610 update imports for `formatters` 2021-06-22 13:12:52 +02:00
Eren Gölge 223502d827 fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-22 13:12:52 +02:00
Eren Gölge d4b1acfa81 refactor `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 26e7c0960c linter fixes 2021-06-22 13:12:52 +02:00
Eren Gölge 79f7c5da1e delete separate tts training scripts and pre-commit configuration 2021-06-22 13:12:52 +02:00
Eren Gölge ca787be193 make style 2021-06-22 13:12:52 +02:00
Eren Gölge d376647ca0 `logging/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge bb58a0588e fix logger imports 2021-06-22 13:12:52 +02:00
Eren Gölge 9bbc924377 import missings 2021-06-22 13:12:52 +02:00
Eren Gölge b4d4ce0d7e remove redundant imports 2021-06-22 13:12:52 +02:00
Eren Gölge aefa71155c make style 2021-06-22 13:12:52 +02:00
Eren Gölge 88d8a94a10 update extract_tts_spectrogram for `cond_input` API of the models 2021-06-22 13:12:52 +02:00
Eren Gölge 667bb708b6 update `extract_tts_spec...` using `SpeakerManager` 2021-06-22 13:12:52 +02:00
Eren Gölge 830306d2fd update `extract_tts_spectrograms` for the new model API 2021-06-22 13:12:52 +02:00
Eren Gölge c673eb8ef8 correct import of `load_meta_data`
remove redundant import
2021-06-22 13:12:52 +02:00
Eren Gölge f0a419546b fix `Synthesized` for the new `synthesis()` 2021-06-22 13:12:52 +02:00
Eren Gölge c7ff175592 revert logging.info to print statements for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge fd6afe5ae5 update `setup_model.py` imports 2021-06-22 13:12:52 +02:00
Eren Gölge c82d91051d update align_tts.py model for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 4f66e816d1 update align_tts_loss for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8213ad8b5f update aling_tts_config for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 8dfd4c91ff update trainer.py for better logging handling, restoring models and
rename init_ functions with get_
2021-06-22 13:12:52 +02:00
Eren Gölge fb9289d365 update `synthesis.py` for being more generic 2021-06-22 13:12:52 +02:00
Eren Gölge f121b0ff5d update `speedy_speech.py` model for trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 843b3ba960 update `speedy_speecy_config.py` for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c9790bee2c update tacotron model to return `model_outputs` 2021-06-22 13:12:52 +02:00
Eren Gölge f09ec7e3a7 update glow-tts for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 3346a6d9dc update `sequence_mask` import globally 2021-06-22 13:12:52 +02:00
Eren Gölge 9765b1aa6b update `glow_tts_config.py` for setting the optimizer and the scheduler 2021-06-22 13:12:52 +02:00
Eren Gölge 6bf6543df8 typing annotation for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge 57cdddef16 add trainer and train_tts 2021-06-22 13:12:52 +02:00
Eren Gölge d769af9e3b remove `truncated` from synthesizer 2021-06-22 13:12:52 +02:00
Eren Gölge 570633ab80 update console logger 2021-06-22 13:12:52 +02:00
Eren Gölge 2ac6b824ca update `synthesis.py` for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge c9e5527070 remove `tts.generic_utils` as all the functions are moved to other files 2021-06-22 13:12:52 +02:00
Eren Gölge 2ab723cd10 update Tacotron models for the trainer 2021-06-22 13:12:52 +02:00
Eren Gölge d6b6a15b5c add `gradual_training` argument to tacotron.py 2021-06-22 13:12:52 +02:00
Eren Gölge 118a7f2b43 import missings for tacotron.py 2021-06-22 13:12:52 +02:00
Eren Gölge c98149d488 mode `setup_model()` to `models/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge 86edf6ab0e add sequence_mask to `utils.data` 2021-06-22 13:12:52 +02:00
Eren Gölge c61486b1e3 `setup_loss()` in `layer/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge f07209d2e0 rename preprocess.py -> formatters.py 2021-06-22 13:12:52 +02:00
Eren Gölge facb782851 move load_meta_data and related functions to `datasets/__init__.py` 2021-06-22 13:12:52 +02:00
Eren Gölge b9d4355d20 set test_sentences in config 2021-06-22 13:12:52 +02:00
Eren Gölge 7bdd0eb72f trainer-API updates 2021-06-22 13:12:52 +02:00
Eren Gölge 0f284841d1 rename MyDataset -> TTSDataset 2021-06-22 13:12:52 +02:00
Edresson 99d40e98d9 fix Lint checks 2021-06-18 14:59:01 -03:00
Edresson 28bec238ca fix Lint checks 2021-06-18 14:33:50 -03:00
Edresson 83644056e3 fix Lint checks 2021-06-18 14:32:28 -03:00
Edresson Casanova e78e3cd81e
Merge branch 'dev' into dev 2021-06-18 14:10:03 -03:00
Edresson b74b510d3c Compute embeddings and find characters using config file 2021-06-18 14:04:49 -03:00
Adam Froghyar b0aa189348 Forcing do_trim_silence to False in the extract TTS script 2021-06-14 10:44:00 +02:00
Eren Gölge d245b5d48f bump up v0.0.15.1 2021-06-08 09:21:01 +02:00
Edresson 14b209c7e9 Create a batch for more fast inference on LSTM Speaker Encoder 2021-06-05 03:12:17 -03:00
Eren Gölge b8b79a5e5a fix `use_cuda` bug in `server.py` 2021-06-04 14:02:53 +02:00
Eren Gölge 203ab855c3 bump up to v0.0.15 2021-06-04 13:52:54 +02:00
Eren Gölge ba9bcf7c6b auto upload to pypi on release 2021-06-04 12:20:06 +02:00
Eren Gölge e66753bd0d fixup! new japanese model placeholder in `.models.json` 2021-06-03 18:04:28 +02:00
Eren Gölge bd434636a9 new japanese model placeholder in `.models.json` 2021-06-02 15:54:37 +02:00
Eren Gölge 401fbd8978 bump up to v0.0.15 2021-06-02 11:48:17 +02:00
Eren Gölge 49c5e5d820 maket style japanese PR 2021-06-02 11:44:46 +02:00
Eren Gölge 73b4083c6c
Merge pull request #502 from kaiidams/kaiidams/kokoro
Japanese Tacotron 2 model
2021-06-02 10:20:08 +02:00
Katsuya Iida 6d8310d2a9 Set the version to the same with the dev branch. 2021-06-02 07:48:28 +09:00
Alexander Korolev c1eb9bdcca
fix speaker dim inference 2021-06-01 15:15:26 +02:00
Katsuya Iida 1cc18d1972 Move unittest of Japanese phonemizer. 2021-06-01 18:51:34 +09:00
Alexander Korolev 5b89ef2c6e
fix speaker-embeddings dimension during inference 2021-06-01 11:06:35 +02:00
Eren Gölge d0ab0382fc linter fixes 2021-06-01 09:15:32 +02:00
Eren Gölge bec85ac58d make style 2021-05-31 16:37:15 +02:00
Eren Gölge d9f1268f99 init tb_logger None for rank > 0 processes 2021-05-31 15:47:07 +02:00
Eren Gölge 301c516abd Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-05-31 15:46:25 +02:00
Edresson 7448177b72 use SpeakerManager on compute embeddings script 2021-05-29 21:11:53 -03:00
Katsuya Iida c4a5a73f18 update Kokoro config 2021-05-29 19:17:27 +09:00
Katsuya Iida 3a9ac2de4a Merge remote-tracking branch 'coqui-ai/main' into kaiidams/kokoro 2021-05-29 09:39:23 +09:00
Katsuya Iida d0c9c1ca5c Move TTS/tts/utils/japanese 2021-05-29 09:21:47 +09:00
Edresson 099142d4dd bug fix 2021-05-27 21:50:56 -03:00
Edresson 208bb0f0ee add batched speaker encoder inference 2021-05-27 20:01:00 -03:00
Edresson 825734a3a9 remove unused embeddings export 2021-05-27 19:10:24 -03:00
Katsuya Iida c4987e9d4e Move import at the head of the file. 2021-05-28 00:22:57 +09:00
Eren Gölge 925c08cf95 replace unidecode with anyascii 2021-05-27 14:02:44 +02:00
Eren Gölge e08c58db3b bump up version to v0.14.1 2021-05-27 13:11:01 +02:00
Eren Gölge c6f22aaa67 fix #509 2021-05-27 13:09:15 +02:00
Edresson 1496f271dc update Compute embeddings script 2021-05-27 00:45:18 -03:00
Edresson bc5307caa0 add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix 2021-05-26 20:35:58 -03:00
Edresson c90037c2e9 solve merge problems 2021-05-26 16:01:30 -03:00
Katsuya Iida f921a05bdb Fixed lint errors 2021-05-26 19:02:16 +09:00
Edresson Casanova f89cb6aec2
Merge branch 'dev' into dev 2021-05-25 17:30:25 -03:00
Edresson d570c2d790 pylint fix and data loader bug fix 2021-05-26 01:11:37 -03:00
Katsuya Iida 0536aa6d0f Japanese Tacotron 2 model 2021-05-22 17:12:19 +09:00
Eren Gölge 5482a0f62d type def for gradual_training 2021-05-19 14:03:26 +02:00
Eren Gölge df6a98d0c3 type def for gradual_training 2021-05-19 14:00:44 +02:00
Eren Gölge 16576d6408 bump version number 2021-05-19 12:35:10 +02:00
Eren Gölge 8a7c40736c set use_phonemes false 2021-05-19 01:27:26 +02:00
Eren Gölge ccfaa6b1d5 add `needs_phonemizer` field to models.json. If set true these models
are only compatible with v0.0.13 or below.
2021-05-18 17:57:28 +02:00
Eren Gölge a14fcf2a13 remove text_processing test 2021-05-18 17:57:28 +02:00
Eren Gölge d7fae3f515 remove all espeaker and phonemizer deps 2021-05-18 17:57:28 +02:00
Eren Gölge ced05e812a move chinese phonemizer 2021-05-18 17:57:28 +02:00
Eren Gölge 218af1d9a2 change `list` to `List` in config 2021-05-18 17:30:27 +02:00
Eren Gölge 4df31f7fbd unused_speakers argument for ignoring speaker ids in multi-speaker
training
2021-05-18 14:50:03 +02:00
Eren Gölge c2c7dff805 use relaxted coqpit parser 2021-05-18 14:49:47 +02:00
Edresson 856ea19758 bug fix in dataloader and update inference 2021-05-18 03:43:16 -03:00
Eren Gölge d1b469935d tacotron DDC LJSpeech recipe 2021-05-17 11:42:14 +02:00
Eren Gölge 34a42d379f update tacotron_config.py for checking `r` and the docstring 2021-05-17 11:35:30 +02:00
Eren Gölge 12722501bb styling 2021-05-15 23:48:31 +02:00
Eren Gölge 8b1014d188 add docstrings with default value fixes 2021-05-15 23:45:10 +02:00
Eren Gölge da49089a72 update melgan training test batch size 2021-05-12 10:12:11 +02:00
Edresson 3433c2f348 add compute embedding for the new speaker encoder 2021-05-12 03:06:46 -03:00
Eren Gölge 0213e1cbf4 update configs for tts models to match the field typed with the expected
values
2021-05-12 00:57:38 +02:00
Eren Gölge 715b0a65a0 update main.yml for python x64
fix test
2021-05-12 00:57:29 +02:00
Edresson 3fcc748b2e implement the Speaker Encoder H/ASP 2021-05-11 16:27:05 -03:00
Eren Gölge 843d1b3d98 linter fixes 2021-05-11 11:30:00 +02:00
Eren Gölge 19fb1d743d style update 2021-05-11 11:30:00 +02:00
Eren Gölge 6e980b49c4 fix synthesizer.py for Coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge db14dcd95a remove old load_config 2021-05-11 11:29:18 +02:00
Eren Gölge a21ac883dd add get_cuda() 2021-05-11 11:29:18 +02:00
Eren Gölge 21dd4d7960 fix load_config imports for Coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge c57f0b46bb reintro use_gst for backwars compat 2021-05-11 11:29:18 +02:00
Eren Gölge 18e76a2309 fix speaker encoder model initialization 2021-05-11 11:29:18 +02:00
Eren Gölge 10de40bba1 make num_workers mandatory config field 2021-05-11 11:29:18 +02:00
Eren Gölge df1ddd3539 allow read_json_with_comments for backward compat 2021-05-11 11:29:18 +02:00
Eren Gölge 9f7599e3c3 fix train_encoder for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge f8e52965dd add speaker encoder coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge ce2bba543e remove extra from utils and move funcs to io.py 2021-05-11 11:29:18 +02:00
Eren Gölge 812dbc2b06 rm config.json 2021-05-11 11:29:18 +02:00
Eren Gölge 3fde2001b1 train_encoder refactoring for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge 9ee70af9bb code styling 2021-05-11 11:29:18 +02:00
Eren Gölge 10db2baa06 global shared Coqpit configs 2021-05-11 11:29:18 +02:00
Eren Gölge 3dec62b183 add Coqpits for the vocoder models 2021-05-11 11:29:18 +02:00
Eren Gölge 6f4eed94f5 remove *.json vocoder configs 2021-05-11 11:29:18 +02:00
Eren Gölge 78b3825d0b update train scripts for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge 757e90b1cc load_config function to initialize the right Coqpit for the given model 2021-05-11 11:29:18 +02:00
Eren Gölge e6f45b9eb7 update train_vocoder_gan.py for coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge bcebd69d09 remove bash tts training tests 2021-05-11 11:29:17 +02:00
Eren Gölge 7663bc63c1 add Coqpit configs for the TTS models 2021-05-11 11:29:17 +02:00
Eren Gölge 7227e8f1d2 update train_align_tts.py for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 51a7e06945 glow_tts_config.py and train test on python 2021-05-11 11:29:17 +02:00
Eren Gölge 720fe13056 update glow_tts modules and training script for coqpit use 2021-05-11 11:29:17 +02:00
Eren Gölge 816e7ee698 remove default configs.json as replacing with Coqpit configs 2021-05-11 11:29:17 +02:00
Eren Gölge 35341d5482 move bash script based tests to python with coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 647163397d coqpit refactoring 2021-05-11 11:29:17 +02:00
Eren Gölge eaa130e813 fix tacotron for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 65d7ad4250 refactor train_speedy_speech.py for coqpit 2021-05-11 11:29:17 +02:00
Eren Gölge 4a58fdfd59 comment out check-arguments before copying fields to the configs 2021-05-11 11:29:17 +02:00
Eren Gölge 05d9543ed8 init GST module using gst config in Tacotron models 2021-05-11 11:29:17 +02:00
Eren Gölge 93a00373f6 move split_dataset 2021-05-11 11:29:17 +02:00