Eren Gölge
8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
...
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
WeberJulian
5cd750ac7e
Fix API and CI
2023-12-11 20:21:53 +01:00
WeberJulian
a5c0d9780f
rename manager
2023-12-11 18:48:31 +01:00
WeberJulian
36143fee26
Add basic speaker manager
2023-12-11 15:25:46 +01:00
Aaron-Li
b6e929696a
support multiple GPU training
2023-12-08 16:55:32 +08:00
Eren Gölge
e49c512d99
Merge pull request #3351 from aaron-lii/chinese-puncs
...
fix pause problem of Chinese speech
2023-12-04 15:57:42 +01:00
Edresson Casanova
5f900f156a
Add XTTS Fine tuning gradio demo ( #3296 )
...
* Add XTTS FT demo data processing pipeline
* Add training and inference columns
* Uses tabs instead of columns
* Fix demo freezing issue
* Update demo
* Convert stereo to mono
* Bug fix on XTTS inference
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Add parameters to be able to set then on colab demo
* Add erros messages
* Add intuitive error messages
* Update
* Add max_audio_length parameter
* Add XTTS fine-tuner docs
* Update XTTS finetuner docs
* Delete trainer to freeze memory
* Delete unused variables
* Add gc.collect()
* Update xtts.md
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Aaron-Li
7b8808186a
fix pause problem of Chinese speech
2023-12-01 23:30:03 +08:00
Eren G??lge
3b8894a3dd
Make style
2023-11-27 14:15:50 +01:00
Eren G??lge
32065139e7
Simple text cleaner for "hi"
2023-11-24 15:14:34 +01:00
Edresson Casanova
11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren G??lge
44880f09ed
Make style
2023-11-17 13:43:34 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Eren Gölge
ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
...
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
WeberJulian
b85536b23f
fix max generation length
2023-11-13 13:18:45 +01:00
Eren G??lge
b2682d39c5
Make style
2023-11-13 13:01:01 +01:00
Aarni Koskela
a8e9163fb3
xtts/tokenizer: merge duplicate implementations of preprocess_text ( #3170 )
...
This was found via ruff:
> F811 Redefinition of unused `preprocess_text` from line 570
2023-11-09 16:32:12 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Julian Weber
ce1a39a9a4
Add char limit warn ( #3130 )
...
* Add char limit warning
* Adding v2 langs
* cached_property for cutlet
* Fix import
2023-11-08 10:24:23 +01:00
Edresson Casanova
5f9ab6cfaa
Fix style
...
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-11-06 19:22:34 -03:00
Edresson Casanova
b146de4ce8
Bug fix on XTTS v2.0 Trainer
2023-11-06 20:26:01 +01:00
Edresson Casanova
72b2bac0f8
Load reference in 24khz to avoid issued with multiple sr references
2023-11-06 20:25:06 +01:00
Eren Gölge
f0cb19ecca
Drop diffusion from XTTS ( #3150 )
...
* Drop diffusion for XTTS
* Make style
* Drop diffusion deps in code
* Restore thrashed
2023-11-06 20:15:49 +01:00
Edresson Casanova
e45227d9ff
XTTS v2.0 ( #3137 )
...
* Implement most similar ref training approach
* Use non-enhanced hifigan for test samples
* Add Perceiver
* Update GPT Trainer for perceiver support
* Update XTTS docs
* Bug fix masking with XTTS perceiver
* Bug fix on gpt forward
* Bug Fix on XTTS v2.0 training
* Add XTTS v2.0 unit tests
* Add XTTS v2.0 inference unit tests
* Bug Fix on diffusion inference
* Add XTTS v2.0 training recipe
* Placeholder model entry
* Add cloning params to config
* Make prompt embedding configurable
* Make cloning configurable
* Cheap fix for a cheaper fix
* Prevent resampling
* Update model entry
* Update docs
* Update requirements
* Code linting
* Add xtts v2 to sep tests
* Bug fix on XTTS get_gpt_cond_latents
* Bug fix on rebase
* Make style
* Bug fix in Japenese tokenizer
* Add num2words to deps
* Remove unused kwarg and added num_beams=1 as default
---------
Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
WeberJulian
c1133724a1
Move lang token add to tokenizer
2023-10-26 14:52:13 +02:00
Edresson Casanova
01839af926
Bug fix on XTTS masking training
2023-10-24 18:30:14 -03:00
Edresson Casanova
ec7f54768a
Rebase bug fix and update recipe
2023-10-21 17:37:51 -03:00
Edresson Casanova
affaf11148
Add XTTS training unit test
2023-10-21 13:41:12 -03:00
Edresson Casanova
1f92741d6a
Fix issue #2971
2023-10-21 13:37:21 -03:00
Edresson Casanova
9e3598c3b7
Bug Fix on inference using XTTS trainer checkpoint
2023-10-21 13:37:21 -03:00
Edresson Casanova
c4ceaabe2c
Add test sentences during the training
2023-10-21 13:33:56 -03:00
Edresson Casanova
2f868dd5c2
Bug fix on reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
bafab049c2
Add prompting masking
2023-10-21 13:33:56 -03:00
Edresson Casanova
47d613df3a
Add reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
40a4e631ea
Update mel spectrogram for the style encoder
2023-10-21 13:33:56 -03:00
Edresson Casanova
a32961bcb4
Add XTTS base training code
2023-10-21 13:33:56 -03:00
Julian Weber
dad6a7b0b6
Preserve [ja] token of the text processing
2023-10-21 11:26:03 +02:00
Julian Weber
c7a16042e3
Remove global cutlet import
2023-10-21 11:18:58 +02:00
Julian Weber
cf97116185
XTTS v1.1 ( #3089 )
...
* Add support for ne_hifigan
* Update model.json
* Update hash
* Fix model loading
* Enhance text_normalization
* Add xtts to zoo test exception
* Add model hash check
* Add get_number_tokens
2023-10-20 16:02:08 +02:00
Julian Weber
e5e0cbffc9
Streaming inference for XTTS 🚀 ( #3035 )
2023-10-06 18:34:06 +02:00
Edresson Casanova
4c3c11c958
Tortoise inference fix and fix zoo unit tests ( #3010 )
2023-09-29 13:40:57 +02:00
Aarni Koskela
09e14e68db
Remove duplicate get_named_beta_schedules
2023-09-27 01:09:59 +03:00
Aarni Koskela
59f85a7122
Remove duplicate code from xtts.tokenizer
2023-09-27 01:09:59 +03:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00