coqui-tts

History

a-froghyar 8be21ec387 Capacitron (#977 ) * new CI config * initial Capacitron implementation * delete old unused file * fix empty formatting changes * update losses and training script * fix previous commit * fix commit * Add Capacitron test and first round of test fixes * revert formatter change * add changes to the synthesizer * add stepwise gradual lr scheduler and changes to the recipe * add inference script for dev use * feat: add posterior inference arguments to synth methods - added reference wav and text args for posterior inference - some formatting * fix: add espeak flag to base_tts and dataset APIs - use_espeak_phonemes flag was not implemented in those APIs - espeak is now able to be utilised for phoneme generation - necessary phonemizer for the Capacitron model * chore: update training script and style - training script includes the espeak flag and other hyperparams - made style * chore: fix linting * feat: add Tacotron 2 support * leftover from dev * chore:rename parser args * feat: extract optimizers - created a separate optimizer class to merge the two optimizers * chore: revert arbitrary trainer changes * fmt: revert formatting bug * formatting again * formatting fixed * fix: log func * fix: update optimizer - Implemented load_state_dict for continuing training * fix: clean optimizer init for standard models * improvement: purge espeak flags and add training scripts * Delete capacitronT2.py delete old training script, new one is pushed * feat: capacitron trainer methods - extracted capacitron specific training operations from the trainer into custom methods in taco1 and taco2 models * chore: renaming and merging capacitron and gst style args * fix: bug fixes from the previous commit * fix: implement state_dict method on CapacitronOptimizer * fix: call method * fix: inference naming * Delete train_capacitron.py * fix: synthesize * feat: update tests * chore: fix style * Delete capacitron_inference.py * fix: fix train tts t2 capacitron tests * fix: double forward in T2 train step * fix: double forward in T1 train step * fix: run make style * fix: remove unused import * fix: test for T1 capacitron * fix: make lint * feat: add blizzard2013 recipes * make style * fix: update recipes * chore: make style * Plot test sentences in Tacotron * chore: make style and fix import * fix: call forward first before problematic floordiv op * fix: update recipes * feat: add min_audio_len to recipes * aux_input["style_mel"] * chore: make style * Make capacitron T2 recipe more stable * Remove T1 capacitron Ljspeech * feat: implement new grad clipping routine and update configs * make style * Add pretrained checkpoints * Add default vocoder * Change trainer package * Fix grad clip issue for tacotron * Fix scheduler issue with tacotron Co-authored-by: Eren Gölge <egolge@coqui.ai> Co-authored-by: WeberJulian <julian.weber@hotmail.fr> Co-authored-by: Eren Gölge <erogol@hotmail.com>		2022-05-20 16:17:11 +02:00
..
__init__.py	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
test_align_tts_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_fast_pitch_speaker_emb_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_fast_pitch_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_feed_forward_layers.py	Fix imports	2021-09-08 13:34:40 +00:00
test_forward_tts.py	Remove SpeedySpeech from .models.json	2021-09-10 17:47:27 +00:00
test_glow_tts.py	Add EmbeddingManager and BaseIDManager (#1374 )	2022-03-31 13:41:16 +02:00
test_glow_tts_d-vectors_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_glow_tts_speaker_emb_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_glow_tts_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_helpers.py	Make style	2022-02-25 11:26:59 +01:00
test_speedy_speech_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_tacotron2_d-vectors_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_tacotron2_model.py	Capacitron (#977 )	2022-05-20 16:17:11 +02:00
test_tacotron2_speaker_emb_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_tacotron2_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_tacotron_layers.py	Fix imports	2021-09-08 13:34:40 +00:00
test_tacotron_model.py	Capacitron (#977 )	2022-05-20 16:17:11 +02:00
test_tacotron_train.py	Make style and lint	2022-03-02 13:25:35 +01:00
test_vits.py	Fix voice conversion inference (#1583 )	2022-05-20 15:50:25 +02:00
test_vits_d-vectors_train.py	Delete `use_espeak_phonemes` from tests	2022-02-25 11:18:00 +01:00
test_vits_multilingual_speaker_emb_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_vits_multilingual_train-d_vectors.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_vits_speaker_emb_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00
test_vits_train.py	Enforce phonemizer definition for synthesis (#1441 )	2022-03-25 23:15:33 +01:00