Edresson Casanova
2470599d18
Drop XTTS v1
2023-11-06 19:12:04 -03:00
Edresson Casanova
13243df526
Update XTTS v1.1 files
2023-11-06 19:10:21 -03:00
Edresson Casanova
09fb317e6d
Remove unused code
2023-11-06 17:36:32 -03:00
Edresson Casanova
b146de4ce8
Bug fix on XTTS v2.0 Trainer
2023-11-06 20:26:01 +01:00
Edresson Casanova
1b6f8d0e46
Update unit tests and recipes
2023-11-06 20:25:06 +01:00
Edresson Casanova
72b2bac0f8
Load reference in 24khz to avoid issued with multiple sr references
2023-11-06 20:25:06 +01:00
Edresson Casanova
00294ffdf6
Update XTTS docs
2023-11-06 20:24:06 +01:00
Edresson Casanova
459ad70dc8
Add support for multiples speaker references on XTTS inference
2023-11-06 20:22:35 +01:00
Eren Gölge
f0cb19ecca
Drop diffusion from XTTS ( #3150 )
...
* Drop diffusion for XTTS
* Make style
* Drop diffusion deps in code
* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge
5d418bb84a
Update docs
2023-11-06 18:48:41 +01:00
Eren G??lge
9bbf6eb8dd
Drop use_ne_hifigan
2023-11-06 18:43:38 +01:00
Eren G??lge
9d54bd7655
Fixup XTTS
2023-11-06 18:13:58 +01:00
Eren Gölge
c713a839da
Update VERSION
2023-11-06 15:51:56 +01:00
Edresson Casanova
e45227d9ff
XTTS v2.0 ( #3137 )
...
* Implement most similar ref training approach
* Use non-enhanced hifigan for test samples
* Add Perceiver
* Update GPT Trainer for perceiver support
* Update XTTS docs
* Bug fix masking with XTTS perceiver
* Bug fix on gpt forward
* Bug Fix on XTTS v2.0 training
* Add XTTS v2.0 unit tests
* Add XTTS v2.0 inference unit tests
* Bug Fix on diffusion inference
* Add XTTS v2.0 training recipe
* Placeholder model entry
* Add cloning params to config
* Make prompt embedding configurable
* Make cloning configurable
* Cheap fix for a cheaper fix
* Prevent resampling
* Update model entry
* Update docs
* Update requirements
* Code linting
* Add xtts v2 to sep tests
* Bug fix on XTTS get_gpt_cond_latents
* Bug fix on rebase
* Make style
* Bug fix in Japenese tokenizer
* Add num2words to deps
* Remove unused kwarg and added num_beams=1 as default
---------
Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
Aarni Koskela
5ae369d629
Move FreeVCConfig to TTS.vc.configs (like all other config classes)
2023-10-31 16:56:25 +02:00
Eren Gölge
6fef4f9067
Bump up to v0.19.1
2023-10-30 10:37:28 +01:00
Eren Gölge
eccc94be9b
Merge pull request #2983 from vltmedia/dev
...
Bug: self.model_name needed to be initialized.
2023-10-28 10:39:25 +02:00
Eren Gölge
2d6bd716ef
Merge pull request #3109 from coqui-ai/tts_3067
...
fix for issue 3067
2023-10-28 10:37:52 +02:00
WeberJulian
1c98821359
Remove unused load_audio function
2023-10-27 22:27:18 +02:00
Aya Jafari
041b4b6723
fix for issue 3067
2023-10-26 13:06:01 -03:00
WeberJulian
d4e08c8d6c
Add features to get_conditioning_latents
2023-10-26 14:57:33 +02:00
WeberJulian
c1133724a1
Move lang token add to tokenizer
2023-10-26 14:52:13 +02:00
WeberJulian
6fa46d197d
Fix get_conditioning_latents when using only ne
2023-10-26 14:51:35 +02:00
Eren Gölge
edd3a28723
Bump up to v0.19.0
2023-10-25 13:29:38 +02:00
Edresson Casanova
01839af926
Bug fix on XTTS masking training
2023-10-24 18:30:14 -03:00
VLT Media
818aa0eb7e
Merge branch 'coqui-ai:dev' into dev
2023-10-23 23:36:33 -04:00
Edresson Casanova
0f96abb5ec
Add FT inference example on XTTS docs
2023-10-23 13:23:30 -03:00
Edresson Casanova
37b7945474
Update XTTS train not implemented error to point to the XTTS docs
2023-10-23 11:39:17 -03:00
Edresson Casanova
ec7f54768a
Rebase bug fix and update recipe
2023-10-21 17:37:51 -03:00
Edresson Casanova
affaf11148
Add XTTS training unit test
2023-10-21 13:41:12 -03:00
Edresson Casanova
1f92741d6a
Fix issue #2971
2023-10-21 13:37:21 -03:00
Edresson Casanova
5f98dbeec9
Update Ljspeech XTTS recipe
2023-10-21 13:37:21 -03:00
Edresson Casanova
9e3598c3b7
Bug Fix on inference using XTTS trainer checkpoint
2023-10-21 13:37:21 -03:00
Edresson Casanova
c4ceaabe2c
Add test sentences during the training
2023-10-21 13:33:56 -03:00
Edresson Casanova
2f868dd5c2
Bug fix on reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
bafab049c2
Add prompting masking
2023-10-21 13:33:56 -03:00
Edresson Casanova
47d613df3a
Add reproducible evaluation
2023-10-21 13:33:56 -03:00
Edresson Casanova
40a4e631ea
Update mel spectrogram for the style encoder
2023-10-21 13:33:56 -03:00
Edresson Casanova
a32961bcb4
Add XTTS base training code
2023-10-21 13:33:56 -03:00
Eren Gölge
1e152692ed
Bump up to v0.18.2
2023-10-21 17:29:53 +02:00
Julian Weber
dad6a7b0b6
Preserve [ja] token of the text processing
2023-10-21 11:26:03 +02:00
Julian Weber
c7a16042e3
Remove global cutlet import
2023-10-21 11:18:58 +02:00
Edresson Casanova
414f0de0a1
Bump up to v0.18.1
2023-10-20 17:30:58 -03:00
Edresson Casanova
59576fc0ec
Bug fix on XTTS v1.1 inference ( #3093 )
...
* Bug fix on XTTS v1.1 inference
* Update .models.json
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-10-20 17:29:43 -03:00
Eren Gölge
85e7323739
Bump up to v0.18.0
2023-10-20 16:03:24 +02:00
Julian Weber
cf97116185
XTTS v1.1 ( #3089 )
...
* Add support for ne_hifigan
* Update model.json
* Update hash
* Fix model loading
* Enhance text_normalization
* Add xtts to zoo test exception
* Add model hash check
* Add get_number_tokens
2023-10-20 16:02:08 +02:00
Eren Gölge
747f688dc3
Bump up to v0.17.10
2023-10-19 12:00:15 +02:00
Eren Gölge
93e6961bb5
Update .models.json
2023-10-19 11:59:49 +02:00
Eren Gölge
bf68848f38
Bump up to v0.17.9
2023-10-19 11:22:42 +02:00
Eren Gölge
c3b011217d
Update .models.json
2023-10-19 11:21:21 +02:00
David Garvey
a151d70242
Add stdout option ( #3027 )
...
* add add cli options for play and speed
--play argument uses simpleaudio to play the tts wav
--speed <float 0.0-2.0> passes speed argument to Coqui Studio models
* remove simpleaudio not referenced in file
* fix simpleaudio dependency version
* add ALSA headers for simpleaudio compilation
* Dockerfile ALSA headers for simpleaudio
* base changes to use stdout instead of play audio
Considering conversion to pipe wav data for audio playback with ohter program
like aplay.
This is incomplete code. Using to get feedback before proceeding with
implementation.
* remove play for pipe_out arg that suppresses stdout
removed play and simpleaudio dependency in place of pipe
fuctionality to allow passing wav file data to a program
dedicated to playing audio.
* scipy.io.wavfile.write fails with /dev/null target
* Streaming inference for XTTS 🚀 (#3035 )
* v0.17.7
* Redownload XTTS with the local and remote config do not match
* Remove unused method
* Print a message when it is already donwloaded
* Try-except to present error when the user dont have connection
* Fix style
* 0.17.8
* v0.17.8
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
Co-authored-by: Edresson Casanova <edresson1@gmail.com>
Co-authored-by: ggoknar <ggoknar@coqui.ai>
2023-10-16 12:07:21 +02:00
Dusty Hagstrom
13cd076a7f
Synthesizer skips over embeddings file if model only has one speaker ( #2587 )
...
* It looks like the Neon model is special in that t does not have a speaker_name and it wants to get the only item available. This was blocking a valid model with one speaker and a d_vector_file from being executed to get the embedding.
* Update synthesizer.py
oh my how embarrassing
2023-10-16 11:55:45 +02:00
Aya Jafari
ffddf10458
unit test fix
2023-10-13 10:56:47 -03:00
Aya Jafari
6eaecab0ca
fixed bugs in fastpitch tts synthesis
2023-10-10 23:02:31 -03:00
ggoknar
99635193f5
v0.17.8
2023-10-07 01:14:05 +03:00
ggoknar
3bb51b1276
0.17.8
2023-10-07 01:13:02 +03:00
Edresson Casanova
2852404bdf
Fix style
2023-10-06 17:42:46 -03:00
Edresson Casanova
99650044a4
Try-except to present error when the user dont have connection
2023-10-06 17:37:05 -03:00
Edresson Casanova
529ea3f67f
Print a message when it is already donwloaded
2023-10-06 17:26:40 -03:00
Edresson Casanova
ee1ef1c51e
Remove unused method
2023-10-06 17:21:22 -03:00
Edresson Casanova
4a6103fec9
Redownload XTTS with the local and remote config do not match
2023-10-06 17:16:30 -03:00
Eren Gölge
0520697b5f
v0.17.7
2023-10-06 18:35:26 +02:00
Julian Weber
e5e0cbffc9
Streaming inference for XTTS 🚀 ( #3035 )
2023-10-06 18:34:06 +02:00
OPERATOR
2150136210
None is not able to be read for "XTTS", fixes crash if its set to None. ( #3009 )
2023-10-02 12:53:36 +02:00
Eren Gölge
155c5fc0bd
v0.17.6
2023-09-29 23:44:09 +02:00
Edresson Casanova
4c3c11c958
Tortoise inference fix and fix zoo unit tests ( #3010 )
2023-09-29 13:40:57 +02:00
Eren Gölge
bb05dcb9b4
Merge pull request #2922 from coqui-ai/be_tts
...
Adding Belarusian TTS model
2023-09-27 09:48:28 +02:00
Eren Gölge
8cba47191f
Merge pull request #2993 from akx/tts-readme
...
Ensure `tts` CLI tool readme and usage is in sync
2023-09-27 09:46:54 +02:00
Eren Gölge
ea51a7ffcc
Merge pull request #3003 from akx/duplicate-code-removal
...
Duplicate code removal
2023-09-27 09:41:35 +02:00
Aarni Koskela
0dbe7cbcc4
Remove duplicate convert_pad_shape
2023-09-27 01:10:48 +03:00
Aarni Koskela
33a7c722f6
Merge duplicate on_train_step_start functions in delightful_tts
2023-09-27 01:10:44 +03:00
Aarni Koskela
861c68b0b8
Rename misnamed setter
2023-09-27 01:09:59 +03:00
Aarni Koskela
09e14e68db
Remove duplicate get_named_beta_schedules
2023-09-27 01:09:59 +03:00
Aarni Koskela
59f85a7122
Remove duplicate code from xtts.tokenizer
2023-09-27 01:09:59 +03:00
Aarni Koskela
0a82f063cc
Late-import main TTS libraries in `tts` CLI
2023-09-26 15:38:56 +03:00
Aarni Koskela
5c047cf304
Ensure `tts` CLI tool readme and usage help is in sync
2023-09-26 15:38:56 +03:00
Eren Gölge
0b95b88f13
Bum up to v0.17.5
2023-09-25 18:16:45 +02:00
VLT Media
dd73910651
Bug: self.model_name needed to be initialized.
...
Bug: self.model_name needed to be initialize to get around a bug that automatically crashes when the user provides the model paths but no model_name when initializing the TTS object.
2023-09-23 01:41:35 -04:00
loupzeur
da8b6bbce1
fix: xtts not taking into account device flag ( #2951 )
...
* fix: xtts not taking into account device flag
* Style changes
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-09-20 09:57:02 +02:00
Reuben Morais
f829bf50f8
Bump version to v0.17.4 (really)
2023-09-15 16:40:34 +02:00
Eren G??lge
aa8fa4756e
Bump up to v0.17.4
2023-09-14 17:52:44 +02:00
Eren G??lge
9d0b76ce23
Check env var for COQUI_TOS_AGREED
2023-09-14 17:51:40 +02:00
Eren G??lge
13dd7c4c9e
Bump up to v0.17.2
2023-09-14 15:24:05 +02:00
Eren G??lge
ded7fd4fb2
Make style
2023-09-14 15:23:37 +02:00
Eren G??lge
44b61d2b92
Fixup
2023-09-14 15:22:54 +02:00
Eren Gölge
623ea41634
Fix model tests ( #2943 )
2023-09-14 15:21:48 +02:00
Eren G??lge
af62613c86
Bump up to v0.17.1
2023-09-13 18:23:39 +02:00
Eren G??lge
ee7cee0e35
Fixup
2023-09-13 18:21:44 +02:00
Eren G??lge
5dcf9ae311
Bump up v0.17.0
2023-09-13 18:04:26 +02:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00
Edresson Casanova
4d3f23b5d3
Add CML-TTS dataset YourTTS training recipe ( #2934 )
2023-09-12 11:49:14 +02:00
Eren Gölge
9533f8656c
Make style
2023-09-04 13:58:37 +02:00
Eren Gölge
562a9509f2
Add BE model
2023-09-04 13:57:03 +02:00
Eren Gölge
b4c82685a7
Add model entries
2023-09-04 13:04:58 +02:00
Cohee
b3b1555d82
Fix exception handling in manage.py ( #2912 )
2023-09-04 12:54:30 +02:00
Eren G??lge
40b527345f
Bump up to v0.16.6
2023-09-04 12:51:53 +02:00
Aleś Bułojčyk
fead04f779
Add phonemizer for Belarusian language ( #2856 )
2023-08-28 11:20:45 +02:00
Jake Tae
b79b6f0762
feature: add device flag to tts cli ( #2875 )
2023-08-28 11:20:12 +02:00
Eren Gölge
c0b5e61749
Bump up to v0.16.5
2023-08-26 12:00:25 +02:00