Eren G??lge
b1b6876489
Make cloning configurable
2023-11-06 11:37:49 +01:00
Eren G??lge
c182535e2a
Make prompt embedding configurable
2023-11-06 11:37:08 +01:00
Eren G??lge
aa16da9194
Add cloning params to config
2023-11-06 11:37:08 +01:00
Eren G??lge
d2a2b7a82e
Placeholder model entry
2023-11-06 11:37:08 +01:00
Edresson Casanova
0664c843d8
Bug Fix on diffusion inference
2023-11-06 11:37:08 +01:00
Edresson Casanova
cff8542012
Bug Fix on XTTS v2.0 training
2023-11-06 11:37:08 +01:00
Edresson Casanova
32796fdfc1
Bug fix on gpt forward
2023-11-06 11:37:08 +01:00
Edresson Casanova
a032d9877b
Bug fix masking with XTTS perceiver
2023-11-06 11:37:08 +01:00
Edresson Casanova
5df8f76b0c
Update XTTS docs
2023-11-06 11:37:08 +01:00
Edresson Casanova
8479a3702c
Update GPT Trainer for perceiver support
2023-11-06 11:37:08 +01:00
Edresson Casanova
dff3902ca8
Add Perceiver
2023-11-06 11:37:08 +01:00
Edresson Casanova
1fb6c203ab
Use non-enhanced hifigan for test samples
2023-11-06 11:37:08 +01:00
Edresson Casanova
077a849b3b
Implement most similar ref training approach
2023-11-06 11:37:08 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
Eren Gölge
6fef4f9067
Bump up to v0.19.1
2023-10-30 10:37:28 +01:00
Eren Gölge
eccc94be9b
Merge pull request #2983 from vltmedia/dev
...
Bug: self.model_name needed to be initialized.
2023-10-28 10:39:25 +02:00
Eren Gölge
2d6bd716ef
Merge pull request #3109 from coqui-ai/tts_3067
...
fix for issue 3067
2023-10-28 10:37:52 +02:00
WeberJulian
1c98821359
Remove unused load_audio function
2023-10-27 22:27:18 +02:00
Aya Jafari
041b4b6723
fix for issue 3067
2023-10-26 13:06:01 -03:00
WeberJulian
d4e08c8d6c
Add features to get_conditioning_latents
2023-10-26 14:57:33 +02:00
WeberJulian
c1133724a1
Move lang token add to tokenizer
2023-10-26 14:52:13 +02:00
WeberJulian
6fa46d197d
Fix get_conditioning_latents when using only ne
2023-10-26 14:51:35 +02:00
Eren Gölge
edd3a28723
Bump up to v0.19.0
2023-10-25 13:29:38 +02:00
Edresson Casanova
01839af926
Bug fix on XTTS masking training
2023-10-24 18:30:14 -03:00
VLT Media
818aa0eb7e
Merge branch 'coqui-ai:dev' into dev
2023-10-23 23:36:33 -04:00
Edresson Casanova
0f96abb5ec
Add FT inference example on XTTS docs
2023-10-23 13:23:30 -03:00
Edresson Casanova
37b7945474
Update XTTS train not implemented error to point to the XTTS docs
2023-10-23 11:39:17 -03:00
Edresson Casanova
ec7f54768a
Rebase bug fix and update recipe
2023-10-21 17:37:51 -03:00
Edresson Casanova
affaf11148
Add XTTS training unit test
2023-10-21 13:41:12 -03:00
Edresson Casanova
1f92741d6a
Fix issue #2971
2023-10-21 13:37:21 -03:00
Edresson Casanova
5f98dbeec9
Update Ljspeech XTTS recipe
2023-10-21 13:37:21 -03:00
Edresson Casanova
9e3598c3b7
Bug Fix on inference using XTTS trainer checkpoint
2023-10-21 13:37:21 -03:00
Edresson Casanova
c4ceaabe2c
Add test sentences during the training
2023-10-21 13:33:56 -03:00
Edresson Casanova
2f868dd5c2
Bug fix on reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
bafab049c2
Add prompting masking
2023-10-21 13:33:56 -03:00
Edresson Casanova
47d613df3a
Add reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
40a4e631ea
Update mel spectrogram for the style encoder
2023-10-21 13:33:56 -03:00
Edresson Casanova
a32961bcb4
Add XTTS base training code
2023-10-21 13:33:56 -03:00
Eren Gölge
1e152692ed
Bump up to v0.18.2
2023-10-21 17:29:53 +02:00
Julian Weber
dad6a7b0b6
Preserve [ja] token of the text processing
2023-10-21 11:26:03 +02:00
Julian Weber
c7a16042e3
Remove global cutlet import
2023-10-21 11:18:58 +02:00
Edresson Casanova
414f0de0a1
Bump up to v0.18.1
2023-10-20 17:30:58 -03:00
Edresson Casanova
59576fc0ec
Bug fix on XTTS v1.1 inference ( #3093 )
...
* Bug fix on XTTS v1.1 inference
* Update .models.json
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-10-20 17:29:43 -03:00
Eren Gölge
85e7323739
Bump up to v0.18.0
2023-10-20 16:03:24 +02:00
Julian Weber
cf97116185
XTTS v1.1 ( #3089 )
...
* Add support for ne_hifigan
* Update model.json
* Update hash
* Fix model loading
* Enhance text_normalization
* Add xtts to zoo test exception
* Add model hash check
* Add get_number_tokens
2023-10-20 16:02:08 +02:00
Eren Gölge
747f688dc3
Bump up to v0.17.10
2023-10-19 12:00:15 +02:00
Eren Gölge
93e6961bb5
Update .models.json
2023-10-19 11:59:49 +02:00
Eren Gölge
bf68848f38
Bump up to v0.17.9
2023-10-19 11:22:42 +02:00
Eren Gölge
c3b011217d
Update .models.json
2023-10-19 11:21:21 +02:00
David Garvey
a151d70242
Add stdout option ( #3027 )
...
* add add cli options for play and speed
--play argument uses simpleaudio to play the tts wav
--speed <float 0.0-2.0> passes speed argument to Coqui Studio models
* remove simpleaudio not referenced in file
* fix simpleaudio dependency version
* add ALSA headers for simpleaudio compilation
* Dockerfile ALSA headers for simpleaudio
* base changes to use stdout instead of play audio
Considering conversion to pipe wav data for audio playback with ohter program
like aplay.
This is incomplete code. Using to get feedback before proceeding with
implementation.
* remove play for pipe_out arg that suppresses stdout
removed play and simpleaudio dependency in place of pipe
fuctionality to allow passing wav file data to a program
dedicated to playing audio.
* scipy.io.wavfile.write fails with /dev/null target
* Streaming inference for XTTS 🚀 (#3035 )
* v0.17.7
* Redownload XTTS with the local and remote config do not match
* Remove unused method
* Print a message when it is already donwloaded
* Try-except to present error when the user dont have connection
* Fix style
* 0.17.8
* v0.17.8
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
Co-authored-by: Edresson Casanova <edresson1@gmail.com>
Co-authored-by: ggoknar <ggoknar@coqui.ai>
2023-10-16 12:07:21 +02:00