Commit Graph

4675 Commits

Author SHA1 Message Date
Eren Gölge f0cb19ecca
Drop diffusion from XTTS (#3150)
* Drop diffusion for XTTS

* Make style

* Drop diffusion deps in code

* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge 5d418bb84a Update docs 2023-11-06 18:48:41 +01:00
Eren G??lge 9bbf6eb8dd Drop use_ne_hifigan 2023-11-06 18:43:38 +01:00
Eren G??lge 9d54bd7655 Fixup XTTS 2023-11-06 18:13:58 +01:00
Eren Gölge c713a839da
Update VERSION 2023-11-06 15:51:56 +01:00
Eren Gölge 7eedfc67da
Update README.md 2023-11-06 15:37:32 +01:00
Edresson Casanova e45227d9ff
XTTS v2.0 (#3137)
* Implement most similar ref training approach

* Use non-enhanced hifigan for test samples

* Add Perceiver

* Update GPT Trainer for perceiver support

* Update XTTS docs

* Bug fix masking with XTTS perceiver

* Bug fix on gpt forward

* Bug Fix on XTTS v2.0 training

* Add XTTS v2.0 unit tests

* Add XTTS v2.0 inference unit tests

* Bug Fix on diffusion inference

* Add XTTS v2.0 training recipe

* Placeholder model entry

* Add cloning params to config

* Make prompt embedding configurable

* Make cloning configurable

* Cheap fix for a cheaper fix

* Prevent resampling

* Update model entry

* Update docs

* Update requirements

* Code linting

* Add xtts v2 to sep tests

* Bug fix on XTTS get_gpt_cond_latents

* Bug fix on rebase

* Make style

* Bug fix in Japenese tokenizer

* Add num2words to deps

* Remove unused kwarg and added num_beams=1 as default

---------

Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela 38f6f8f0bb
Run `make style` & re-enable it in CI (#3127) 2023-11-06 11:36:37 +01:00
Aarni Koskela 5ae369d629 Move FreeVCConfig to TTS.vc.configs (like all other config classes) 2023-10-31 16:56:25 +02:00
Eren Gölge 6fef4f9067
Bump up to v0.19.1 2023-10-30 10:37:28 +01:00
Eren Gölge eccc94be9b
Merge pull request #2983 from vltmedia/dev
Bug: self.model_name needed to be initialized.
2023-10-28 10:39:25 +02:00
Eren Gölge 2d6bd716ef
Merge pull request #3109 from coqui-ai/tts_3067
fix for issue 3067
2023-10-28 10:37:52 +02:00
Eren Gölge 788959d720
Merge pull request #3103 from coqui-ai/fix_xttsv1.1_again
Second round of issue fixing for XTTS v1.1
2023-10-28 10:33:19 +02:00
WeberJulian 1c98821359 Remove unused load_audio function 2023-10-27 22:27:18 +02:00
Aya Jafari 041b4b6723 fix for issue 3067 2023-10-26 13:06:01 -03:00
WeberJulian d4e08c8d6c Add features to get_conditioning_latents 2023-10-26 14:57:33 +02:00
WeberJulian c1133724a1 Move lang token add to tokenizer 2023-10-26 14:52:13 +02:00
WeberJulian 6fa46d197d Fix get_conditioning_latents when using only ne 2023-10-26 14:51:35 +02:00
Eren Gölge edd3a28723
Bump up to v0.19.0 2023-10-25 13:29:38 +02:00
Eren Gölge 16ba377f61
Merge pull request #3086 from coqui-ai/xtts_trainer
XTTS v1.1 GPT Trainer
2023-10-25 13:28:47 +02:00
Edresson Casanova 01839af926 Bug fix on XTTS masking training 2023-10-24 18:30:14 -03:00
Edresson Casanova 8af3d2dbcd Add a dedicated workflow for XTTS tests 2023-10-24 09:52:44 -03:00
VLT Media 818aa0eb7e
Merge branch 'coqui-ai:dev' into dev 2023-10-23 23:36:33 -04:00
Edresson Casanova de1d521c8a Update XTTS docs 2023-10-23 13:35:15 -03:00
Edresson Casanova 0f96abb5ec Add FT inference example on XTTS docs 2023-10-23 13:23:30 -03:00
Edresson Casanova 67ca70aff4 Fix Delightful TTS layers unit test 2023-10-23 11:47:10 -03:00
Edresson Casanova 37b7945474 Update XTTS train not implemented error to point to the XTTS docs 2023-10-23 11:39:17 -03:00
Edresson Casanova 1ee8096799 Update XTTS docs 2023-10-23 11:13:09 -03:00
Edresson Casanova 6fefc36e5a Update XTTS docs 2023-10-23 11:03:57 -03:00
Edresson Casanova 8853e1c3ec Update XTTS recipe to only download checkpoint if it is needed 2023-10-23 10:45:41 -03:00
Edresson Casanova 653f2e75ef Update xtts trainer recipe 2023-10-23 09:58:16 -03:00
Edresson Casanova e8a1a50273 Remove unused vars in Delightful TTS layers tests 2023-10-23 09:26:36 -03:00
Edresson Casanova ec7f54768a Rebase bug fix and update recipe 2023-10-21 17:37:51 -03:00
Edresson Casanova affaf11148 Add XTTS training unit test 2023-10-21 13:41:12 -03:00
Edresson Casanova 1f92741d6a Fix issue #2971 2023-10-21 13:37:21 -03:00
Edresson Casanova 94dcf84979 Rename XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova 5f98dbeec9 Update Ljspeech XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova 469d624615 Update LJspeech XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova 9e3598c3b7 Bug Fix on inference using XTTS trainer checkpoint 2023-10-21 13:37:21 -03:00
Edresson Casanova c4ceaabe2c Add test sentences during the training 2023-10-21 13:33:56 -03:00
Edresson Casanova 2f868dd5c2 Bug fix on reproducible evaluation 2023-10-21 13:33:56 -03:00
Edresson Casanova bafab049c2 Add prompting masking 2023-10-21 13:33:56 -03:00
Edresson Casanova 47d613df3a Add reproducible evaluation 2023-10-21 13:33:56 -03:00
Edresson Casanova 40a4e631ea Update mel spectrogram for the style encoder 2023-10-21 13:33:56 -03:00
Edresson Casanova a32961bcb4 Add XTTS base training code 2023-10-21 13:33:56 -03:00
Eren Gölge 1e152692ed
Bump up to v0.18.2 2023-10-21 17:29:53 +02:00
Eren Gölge 420a90ed63
Merge pull request #3096 from coqui-ai/fix-xtts-v1.1
Fix xtts v1.1
2023-10-21 17:28:58 +02:00
Julian Weber dad6a7b0b6
Preserve [ja] token of the text processing 2023-10-21 11:26:03 +02:00
Julian Weber c7a16042e3
Remove global cutlet import 2023-10-21 11:18:58 +02:00
Edresson Casanova 414f0de0a1
Bump up to v0.18.1 2023-10-20 17:30:58 -03:00