coqui-tts/TTS/tts/utils
manmay nakhashi a3d5801c44
Tortoise TTS inference (#2547)
* initial commit

* Tortoise inference

* revert path change

* style fix

* remove accidental remove

* style fixes

* style fixes

* removed unwanted assests and deps

* remove changes

* remove cvvp

* style fix black

* added tortoise config and updated config and args, refactoring the code

* added tortoise to api

* Pull mel_norm from url

* Use TTS cleaners

* Let download model files

* add ability to pass tortoise presets through coqui api

* fix tests

* fix style and tests

* fix tts commandline for tortoise

* Add config.json to tortoise

* Use kwargs

* Use regular model api for loading tortoise

* Add load from dir to synthesizer

* Fix Tortoise floats

* Use model_dir when there are multiple urls

* Use `synthesize` when exists

* lint fixes and resolve preset bug

* resolve a download bug and update model link

* fix json

* do tortoise inference from voice dir

* fix

* fix test

* fix speaker id and remove assests

* update inference_tests.yml

* replace inference_test.yml

* fix extra dir as None

* fix tests

* remove space

* Reformat docstring

* Add docs

* Update docs

* lint fixes

---------

Co-authored-by: Eren Gölge <egolge@coqui.ai>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-05-16 00:58:21 +02:00
..
assets/tortoise Tortoise TTS inference (#2547) 2023-05-16 00:58:21 +02:00
monotonic_align Fix import statements 2021-10-25 19:29:16 +02:00
text Make lint 2023-04-17 15:02:56 +02:00
__init__.py rename the project to old TTS 2020-09-09 12:27:23 +02:00
data.py Add audio length sampler balancer (#1561) 2022-05-12 19:59:19 +02:00
helpers.py Fix rand_segment edge case (input_len == seg_len - 1) 2022-08-01 11:37:45 +02:00
languages.py d-vector handling (#1945) 2022-09-13 14:10:33 +02:00
managers.py Cache speaker encoder model (#2284) 2023-01-23 11:49:51 +01:00
measures.py Format style of the recipes 2021-09-06 15:16:58 +00:00
speakers.py Fixed bug related to yourtts speaker embeddings issue (#2234) 2023-01-02 14:20:02 +01:00
ssim.py Implement VitsAudioConfig (#1556) 2022-07-12 18:49:58 +02:00
synthesis.py Multilingual tokenizer (#2229) 2023-01-02 10:03:19 +01:00
visual.py Fastspeech2 (#2073) 2023-01-15 22:39:22 +01:00