Gorkem
78a596618a
Fix for exception on streaming if last chunk empty ( #3160 )
2023-11-08 11:32:02 +01:00
Edresson Casanova
09fb317e6d
Remove unused code
2023-11-06 17:36:32 -03:00
Edresson Casanova
1b6f8d0e46
Update unit tests and recipes
2023-11-06 20:25:06 +01:00
Edresson Casanova
72b2bac0f8
Load reference in 24khz to avoid issued with multiple sr references
2023-11-06 20:25:06 +01:00
Edresson Casanova
00294ffdf6
Update XTTS docs
2023-11-06 20:24:06 +01:00
Edresson Casanova
459ad70dc8
Add support for multiples speaker references on XTTS inference
2023-11-06 20:22:35 +01:00
Eren Gölge
f0cb19ecca
Drop diffusion from XTTS ( #3150 )
...
* Drop diffusion for XTTS
* Make style
* Drop diffusion deps in code
* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge
5d418bb84a
Update docs
2023-11-06 18:48:41 +01:00
Eren G??lge
9bbf6eb8dd
Drop use_ne_hifigan
2023-11-06 18:43:38 +01:00
Eren G??lge
9d54bd7655
Fixup XTTS
2023-11-06 18:13:58 +01:00
Edresson Casanova
e45227d9ff
XTTS v2.0 ( #3137 )
...
* Implement most similar ref training approach
* Use non-enhanced hifigan for test samples
* Add Perceiver
* Update GPT Trainer for perceiver support
* Update XTTS docs
* Bug fix masking with XTTS perceiver
* Bug fix on gpt forward
* Bug Fix on XTTS v2.0 training
* Add XTTS v2.0 unit tests
* Add XTTS v2.0 inference unit tests
* Bug Fix on diffusion inference
* Add XTTS v2.0 training recipe
* Placeholder model entry
* Add cloning params to config
* Make prompt embedding configurable
* Make cloning configurable
* Cheap fix for a cheaper fix
* Prevent resampling
* Update model entry
* Update docs
* Update requirements
* Code linting
* Add xtts v2 to sep tests
* Bug fix on XTTS get_gpt_cond_latents
* Bug fix on rebase
* Make style
* Bug fix in Japenese tokenizer
* Add num2words to deps
* Remove unused kwarg and added num_beams=1 as default
---------
Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
WeberJulian
1c98821359
Remove unused load_audio function
2023-10-27 22:27:18 +02:00
WeberJulian
d4e08c8d6c
Add features to get_conditioning_latents
2023-10-26 14:57:33 +02:00
WeberJulian
c1133724a1
Move lang token add to tokenizer
2023-10-26 14:52:13 +02:00
WeberJulian
6fa46d197d
Fix get_conditioning_latents when using only ne
2023-10-26 14:51:35 +02:00
Edresson Casanova
0f96abb5ec
Add FT inference example on XTTS docs
2023-10-23 13:23:30 -03:00
Edresson Casanova
37b7945474
Update XTTS train not implemented error to point to the XTTS docs
2023-10-23 11:39:17 -03:00
Edresson Casanova
affaf11148
Add XTTS training unit test
2023-10-21 13:41:12 -03:00
Edresson Casanova
5f98dbeec9
Update Ljspeech XTTS recipe
2023-10-21 13:37:21 -03:00
Edresson Casanova
9e3598c3b7
Bug Fix on inference using XTTS trainer checkpoint
2023-10-21 13:37:21 -03:00
Edresson Casanova
c4ceaabe2c
Add test sentences during the training
2023-10-21 13:33:56 -03:00
Edresson Casanova
59576fc0ec
Bug fix on XTTS v1.1 inference ( #3093 )
...
* Bug fix on XTTS v1.1 inference
* Update .models.json
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-10-20 17:29:43 -03:00
Julian Weber
cf97116185
XTTS v1.1 ( #3089 )
...
* Add support for ne_hifigan
* Update model.json
* Update hash
* Fix model loading
* Enhance text_normalization
* Add xtts to zoo test exception
* Add model hash check
* Add get_number_tokens
2023-10-20 16:02:08 +02:00
Aya Jafari
ffddf10458
unit test fix
2023-10-13 10:56:47 -03:00
Aya Jafari
6eaecab0ca
fixed bugs in fastpitch tts synthesis
2023-10-10 23:02:31 -03:00
Julian Weber
e5e0cbffc9
Streaming inference for XTTS 🚀 ( #3035 )
2023-10-06 18:34:06 +02:00
Aarni Koskela
33a7c722f6
Merge duplicate on_train_step_start functions in delightful_tts
2023-09-27 01:10:44 +03:00
Aarni Koskela
861c68b0b8
Rename misnamed setter
2023-09-27 01:09:59 +03:00
loupzeur
da8b6bbce1
fix: xtts not taking into account device flag ( #2951 )
...
* fix: xtts not taking into account device flag
* Style changes
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-09-20 09:57:02 +02:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00
Eren Gölge
a7a96d08dd
Fix loading Bark ( #2893 )
...
* Fixup hubert path
* Make style
2023-08-26 11:59:00 +02:00
Eren Gölge
3a104d5c49
Update Studio API for XTTS ( #2861 )
...
* Update Studio API for XTTS
* Update the docs
* Update README.md
* Update README.md
Update README
2023-08-13 12:04:12 +02:00
Eren G??lge
37b558ccb9
Make style
2023-08-11 12:55:23 +02:00
Javier
4e7f8cd021
Add fairseq onnx support and strict configuration, fixes some onnx errors ( #2831 )
2023-08-04 11:02:59 +02:00
Eren Gölge
69f080eb47
Fix DelightfulTTS ( #2823 )
...
* Fix tests
* Make style
2023-07-31 13:52:45 +02:00
Eren Gölge
483888b9d8
Add kwargs to ignore extra arguments w/o error ( #2822 )
2023-07-31 11:37:35 +02:00
Javier
c140df5a58
Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false ( #2816 )
2023-07-31 10:19:49 +02:00
Eren Gölge
8aacb81849
Fix Tortoise load ( #2791 )
...
* Remove key prunning in tortoise
* Make lint
2023-07-24 13:42:47 +02:00
logan hart
6fdb88f8e2
Add Delightful-TTS implementation ( #2095 )
...
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
Eren Gölge
a2984fb435
Fix #2745 ( #2748 )
2023-07-07 20:23:27 +02:00
Eren Gölge
7b5c8422c8
Export multispeaker onnx ( #2743 )
2023-07-06 13:36:50 +02:00
ZhouGongZaiShi
d5f16d77c2
delete meaningless print() ( #2662 )
2023-07-04 11:38:17 +02:00
Eren G??lge
cb9c320691
Fixup
2023-06-30 14:13:11 +02:00
Eren Gölge
4cf8652392
Fix Tortoise load ( #2697 )
...
* Handle missing gpt weights
* Make style
* Fix lint
2023-06-21 15:42:01 +02:00
Eren Gölge
e785d101a1
Port Fairseq TTS models ( #2628 )
...
* Load fairseq models
* Add docs and missing files
* Managing fairseq models and docs for API
* Make style
* Use scarf URL
* Add tests
* Fix URL
* Pass cpu
* Make lint
* Fixup
* Make lint
* fixup
* Fixup
* Change tokenization order
* Update README
* Fixup
* Fixup
2023-06-05 11:15:13 +02:00
Eren Gölge
9e99e0f42d
Disable reduction
2023-05-18 11:12:51 +02:00
Eren Gölge
4de797bb11
Draft ONNX export for VITS ( #2563 )
...
* Draft ONNX export for VITS
Could not get it work to output variable length sequence
* Fixup for onnx constant output
* Make style
* Remove commented code
2023-05-16 01:07:56 +02:00
manmay nakhashi
a3d5801c44
Tortoise TTS inference ( #2547 )
...
* initial commit
* Tortoise inference
* revert path change
* style fix
* remove accidental remove
* style fixes
* style fixes
* removed unwanted assests and deps
* remove changes
* remove cvvp
* style fix black
* added tortoise config and updated config and args, refactoring the code
* added tortoise to api
* Pull mel_norm from url
* Use TTS cleaners
* Let download model files
* add ability to pass tortoise presets through coqui api
* fix tests
* fix style and tests
* fix tts commandline for tortoise
* Add config.json to tortoise
* Use kwargs
* Use regular model api for loading tortoise
* Add load from dir to synthesizer
* Fix Tortoise floats
* Use model_dir when there are multiple urls
* Use `synthesize` when exists
* lint fixes and resolve preset bug
* resolve a download bug and update model link
* fix json
* do tortoise inference from voice dir
* fix
* fix test
* fix speaker id and remove assests
* update inference_tests.yml
* replace inference_test.yml
* fix extra dir as None
* fix tests
* remove space
* Reformat docstring
* Add docs
* Update docs
* lint fixes
---------
Co-authored-by: Eren Gölge <egolge@coqui.ai>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-05-16 00:58:21 +02:00
Matthew Boakes
4c829e74a1
Update Librosa Version To V0.10.0
2023-04-05 00:59:20 +01:00