coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren G??lge	b1b6876489	Make cloning configurable	2023-11-06 11:37:49 +01:00
Eren G??lge	c182535e2a	Make prompt embedding configurable	2023-11-06 11:37:08 +01:00
Eren G??lge	aa16da9194	Add cloning params to config	2023-11-06 11:37:08 +01:00
Eren G??lge	d2a2b7a82e	Placeholder model entry	2023-11-06 11:37:08 +01:00
Edresson Casanova	0664c843d8	Bug Fix on diffusion inference	2023-11-06 11:37:08 +01:00
Edresson Casanova	cff8542012	Bug Fix on XTTS v2.0 training	2023-11-06 11:37:08 +01:00
Edresson Casanova	32796fdfc1	Bug fix on gpt forward	2023-11-06 11:37:08 +01:00
Edresson Casanova	a032d9877b	Bug fix masking with XTTS perceiver	2023-11-06 11:37:08 +01:00
Edresson Casanova	5df8f76b0c	Update XTTS docs	2023-11-06 11:37:08 +01:00
Edresson Casanova	8479a3702c	Update GPT Trainer for perceiver support	2023-11-06 11:37:08 +01:00
Edresson Casanova	dff3902ca8	Add Perceiver	2023-11-06 11:37:08 +01:00
Edresson Casanova	1fb6c203ab	Use non-enhanced hifigan for test samples	2023-11-06 11:37:08 +01:00
Edresson Casanova	077a849b3b	Implement most similar ref training approach	2023-11-06 11:37:08 +01:00
Aarni Koskela	38f6f8f0bb	Run `make style` & re-enable it in CI (#3127 )	2023-11-06 11:36:37 +01:00
Eren Gölge	6fef4f9067	Bump up to v0.19.1	2023-10-30 10:37:28 +01:00
Eren Gölge	eccc94be9b	Merge pull request #2983 from vltmedia/dev Bug: self.model_name needed to be initialized.	2023-10-28 10:39:25 +02:00
Eren Gölge	2d6bd716ef	Merge pull request #3109 from coqui-ai/tts_3067 fix for issue 3067	2023-10-28 10:37:52 +02:00
WeberJulian	1c98821359	Remove unused load_audio function	2023-10-27 22:27:18 +02:00
Aya Jafari	041b4b6723	fix for issue 3067	2023-10-26 13:06:01 -03:00
WeberJulian	d4e08c8d6c	Add features to get_conditioning_latents	2023-10-26 14:57:33 +02:00
WeberJulian	c1133724a1	Move lang token add to tokenizer	2023-10-26 14:52:13 +02:00
WeberJulian	6fa46d197d	Fix get_conditioning_latents when using only ne	2023-10-26 14:51:35 +02:00
Eren Gölge	edd3a28723	Bump up to v0.19.0	2023-10-25 13:29:38 +02:00
Edresson Casanova	01839af926	Bug fix on XTTS masking training	2023-10-24 18:30:14 -03:00
VLT Media	818aa0eb7e	Merge branch 'coqui-ai:dev' into dev	2023-10-23 23:36:33 -04:00
Edresson Casanova	0f96abb5ec	Add FT inference example on XTTS docs	2023-10-23 13:23:30 -03:00
Edresson Casanova	37b7945474	Update XTTS train not implemented error to point to the XTTS docs	2023-10-23 11:39:17 -03:00
Edresson Casanova	ec7f54768a	Rebase bug fix and update recipe	2023-10-21 17:37:51 -03:00
Edresson Casanova	affaf11148	Add XTTS training unit test	2023-10-21 13:41:12 -03:00
Edresson Casanova	1f92741d6a	Fix issue #2971	2023-10-21 13:37:21 -03:00
Edresson Casanova	5f98dbeec9	Update Ljspeech XTTS recipe	2023-10-21 13:37:21 -03:00
Edresson Casanova	9e3598c3b7	Bug Fix on inference using XTTS trainer checkpoint	2023-10-21 13:37:21 -03:00
Edresson Casanova	c4ceaabe2c	Add test sentences during the training	2023-10-21 13:33:56 -03:00
Edresson Casanova	2f868dd5c2	Bug fix on reproducible evaluation	2023-10-21 13:33:56 -03:00
Edresson Casanova	bafab049c2	Add prompting masking	2023-10-21 13:33:56 -03:00
Edresson Casanova	47d613df3a	Add reproducible evaluation	2023-10-21 13:33:56 -03:00
Edresson Casanova	40a4e631ea	Update mel spectrogram for the style encoder	2023-10-21 13:33:56 -03:00
Edresson Casanova	a32961bcb4	Add XTTS base training code	2023-10-21 13:33:56 -03:00
Eren Gölge	1e152692ed	Bump up to v0.18.2	2023-10-21 17:29:53 +02:00
Julian Weber	dad6a7b0b6	Preserve [ja] token of the text processing	2023-10-21 11:26:03 +02:00
Julian Weber	c7a16042e3	Remove global cutlet import	2023-10-21 11:18:58 +02:00
Edresson Casanova	414f0de0a1	Bump up to v0.18.1	2023-10-20 17:30:58 -03:00
Edresson Casanova	59576fc0ec	Bug fix on XTTS v1.1 inference (#3093 ) * Bug fix on XTTS v1.1 inference * Update .models.json --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr>	2023-10-20 17:29:43 -03:00
Eren Gölge	85e7323739	Bump up to v0.18.0	2023-10-20 16:03:24 +02:00
Julian Weber	cf97116185	XTTS v1.1 (#3089 ) * Add support for ne_hifigan * Update model.json * Update hash * Fix model loading * Enhance text_normalization * Add xtts to zoo test exception * Add model hash check * Add get_number_tokens	2023-10-20 16:02:08 +02:00
Eren Gölge	747f688dc3	Bump up to v0.17.10	2023-10-19 12:00:15 +02:00
Eren Gölge	93e6961bb5	Update .models.json	2023-10-19 11:59:49 +02:00
Eren Gölge	bf68848f38	Bump up to v0.17.9	2023-10-19 11:22:42 +02:00
Eren Gölge	c3b011217d	Update .models.json	2023-10-19 11:21:21 +02:00
David Garvey	a151d70242	Add stdout option (#3027 ) * add add cli options for play and speed --play argument uses simpleaudio to play the tts wav --speed <float 0.0-2.0> passes speed argument to Coqui Studio models * remove simpleaudio not referenced in file * fix simpleaudio dependency version * add ALSA headers for simpleaudio compilation * Dockerfile ALSA headers for simpleaudio * base changes to use stdout instead of play audio Considering conversion to pipe wav data for audio playback with ohter program like aplay. This is incomplete code. Using to get feedback before proceeding with implementation. * remove play for pipe_out arg that suppresses stdout removed play and simpleaudio dependency in place of pipe fuctionality to allow passing wav file data to a program dedicated to playing audio. * scipy.io.wavfile.write fails with /dev/null target * Streaming inference for XTTS 🚀 (#3035) * v0.17.7 * Redownload XTTS with the local and remote config do not match * Remove unused method * Print a message when it is already donwloaded * Try-except to present error when the user dont have connection * Fix style * 0.17.8 * v0.17.8 --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr> Co-authored-by: Eren Gölge <erogol@hotmail.com> Co-authored-by: Edresson Casanova <edresson1@gmail.com> Co-authored-by: ggoknar <ggoknar@coqui.ai>	2023-10-16 12:07:21 +02:00

1 2 3 4 5 ...

1883 Commits