coqui-tts/TTS/bin
logan hart 6fdb88f8e2
Add Delightful-TTS implementation (#2095)
* add configs

* Update config file

* Add model configs

* Add model layers

* Add layer files

* Add layer modules

* change config names

* Add emotion manager

* fIX missing ap bug

* Fix missing ap bug

* Add base TTS e2e class

* Fix wrong variable name in load_tts_samples

* Add training script

* Remove range predictor and gaussian upsampling

* Add helper function

* Add vctk recipe

* Add conformer docs

* Fix linting in conformer.py

* Add Docs

* remove duplicate import

* refactor args

* Fix bugs

* Removew emotion embedding

* remove unused arg

* Remove emotion embedding arg

* Remove emotion embedding arg

* fix style issues

* Fix bugs

* Fix bugs

* Add unittests

* make style

* fix formatter bug

* fix test

* Add pyworld compute pitch func

* Update requirments.txt

* Fix dataset Bug

* Chnge layer norm to instance norm

* Add missing import

* Remove emotions.py

* remove ssim loss

* Add init layers func to aligner

* refactor model layers

* remove audio_config arg

* Rename loss func

* Rename to delightful-tts

* Rename loss func

* Remove unused modules

* refactor imports

* replace audio config with audio processor

* Add change sample rate option

* remove broken resample func

* update recipe

* fix style, add config docs

* fix tests and multispeaker embd dim

* remove pyworld

* Make style and fix inference

* Split tts tests

* Fixup

* Fixup

* Fixup

* Add argument names

* Set "random" speaker in the model Tortoise/Bark

* Use a diff f0_cache path for delightfull tts

* Fix delightful speaker handling

* Fix lint

* Make style

---------

Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
..
__init__.py fixing dome pypi issues 2021-01-25 13:06:12 +01:00
collect_env_info.py Make style and lint 2021-12-01 10:42:52 +00:00
compute_attention_masks.py Update model file extension (#1422) 2022-03-22 17:55:00 +01:00
compute_embeddings.py Update compute_embeddings.py (#2668) 2023-07-04 11:37:47 +02:00
compute_statistics.py Update TTS.tts formatters (#1228) 2022-02-11 23:03:43 +01:00
eval_encoder.py Bump up to v0.11.0 (#2329) 2023-02-08 13:58:49 +01:00
extract_tts_spectrograms.py Bump up to v0.11.0 (#2329) 2023-02-08 13:58:49 +01:00
find_unique_chars.py Open bible dataset formatter (#1365) 2022-03-11 10:43:31 +01:00
find_unique_phonemes.py Fix find unique phonemes script (#1928) 2022-09-08 10:17:35 +02:00
remove_silence_using_vad.py Update VAD for silence trimming. (#2604) 2023-05-11 11:09:23 +02:00
resample.py Bump up to v0.11.0 (#2329) 2023-02-08 13:58:49 +01:00
synthesize.py Add Delightful-TTS implementation (#2095) 2023-07-24 13:41:26 +02:00
train_encoder.py Implement bucketed weighted sampling for VITS (#1871) 2022-08-15 11:08:11 +02:00
train_tts.py Enforce phonemizer definition for synthesis (#1441) 2022-03-25 23:15:33 +01:00
train_vocoder.py Make stlye 2022-02-25 11:31:56 +01:00
tune_wavegrad.py Fix tune wavegrad (#1844) 2022-08-22 09:55:32 +02:00