coqui-tts/tests/tts_tests
a-froghyar 8be21ec387
Capacitron (#977)
* new CI config

* initial Capacitron implementation

* delete old unused file

* fix empty formatting changes

* update losses and training script

* fix previous commit

* fix commit

* Add Capacitron test and first round of test fixes

* revert formatter change

* add changes to the synthesizer

* add stepwise gradual lr scheduler and changes to the recipe

* add inference script for dev use

* feat: add posterior inference arguments to synth methods
- added reference wav and text args for posterior inference
- some formatting

* fix: add espeak flag to base_tts and dataset APIs
- use_espeak_phonemes flag was not implemented in those APIs
- espeak is now able to be utilised for phoneme generation
- necessary phonemizer for the Capacitron model

* chore: update training script and style
- training script includes the espeak flag and other hyperparams
- made style

* chore: fix linting

* feat: add Tacotron 2 support

* leftover from dev

* chore:rename parser args

* feat: extract optimizers
- created a separate optimizer class to merge the two optimizers

* chore: revert arbitrary trainer changes

* fmt: revert formatting bug

* formatting again

* formatting fixed

* fix: log func

* fix: update optimizer
- Implemented load_state_dict for continuing training

* fix: clean optimizer init for standard models

* improvement: purge espeak flags and add training scripts

* Delete capacitronT2.py

delete old training script, new one is pushed

* feat: capacitron trainer methods
- extracted capacitron specific training  operations from the trainer into custom
methods in taco1 and taco2 models

* chore: renaming and merging capacitron and gst style args

* fix: bug fixes from the previous commit

* fix: implement state_dict method on CapacitronOptimizer

* fix: call method

* fix: inference naming

* Delete train_capacitron.py

* fix: synthesize

* feat: update tests

* chore: fix style

* Delete capacitron_inference.py

* fix: fix train tts t2 capacitron tests

* fix: double forward in T2 train step

* fix: double forward in T1 train step

* fix: run make style

* fix: remove unused import

* fix: test for T1 capacitron

* fix: make lint

* feat: add blizzard2013 recipes

* make style

* fix: update recipes

* chore: make style

* Plot test sentences in Tacotron

* chore: make style and fix import

* fix: call forward first before problematic floordiv op

* fix: update recipes

* feat: add min_audio_len to recipes

* aux_input["style_mel"]

* chore: make style

* Make capacitron T2 recipe more stable

* Remove T1 capacitron Ljspeech

* feat: implement new grad clipping routine and update configs

* make style

* Add pretrained checkpoints

* Add default vocoder

* Change trainer package

* Fix grad clip issue for tacotron

* Fix scheduler issue with tacotron

Co-authored-by: Eren Gölge <egolge@coqui.ai>
Co-authored-by: WeberJulian <julian.weber@hotmail.fr>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2022-05-20 16:17:11 +02:00
..
__init__.py move bash script based tests to python with coqpit 2021-05-11 11:29:17 +02:00
test_align_tts_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_fast_pitch_speaker_emb_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_fast_pitch_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_feed_forward_layers.py Fix imports 2021-09-08 13:34:40 +00:00
test_forward_tts.py Remove SpeedySpeech from .models.json 2021-09-10 17:47:27 +00:00
test_glow_tts.py Add EmbeddingManager and BaseIDManager (#1374) 2022-03-31 13:41:16 +02:00
test_glow_tts_d-vectors_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_glow_tts_speaker_emb_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_glow_tts_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_helpers.py Make style 2022-02-25 11:26:59 +01:00
test_speedy_speech_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_tacotron2_d-vectors_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_tacotron2_model.py Capacitron (#977) 2022-05-20 16:17:11 +02:00
test_tacotron2_speaker_emb_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_tacotron2_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_tacotron_layers.py Fix imports 2021-09-08 13:34:40 +00:00
test_tacotron_model.py Capacitron (#977) 2022-05-20 16:17:11 +02:00
test_tacotron_train.py Make style and lint 2022-03-02 13:25:35 +01:00
test_vits.py Fix voice conversion inference (#1583) 2022-05-20 15:50:25 +02:00
test_vits_d-vectors_train.py Delete `use_espeak_phonemes` from tests 2022-02-25 11:18:00 +01:00
test_vits_multilingual_speaker_emb_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_vits_multilingual_train-d_vectors.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_vits_speaker_emb_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
test_vits_train.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00