coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson	3cd889a9d4	Add support to use the speaker encoder as loss function in VITS model	2021-12-09 13:29:58 +00:00
Edresson	a3901032f4	Add H/ASP original checkpoint support	2021-12-09 13:28:16 +00:00
Edresson	256197b6aa	Fix the optimizer parameters bug in multilingual and multispeaker training	2021-12-09 13:27:21 +00:00
Edresson	f4abb19515	Fix bug after merge	2021-12-09 13:26:33 +00:00
Edresson	d7042ecfd8	Fix d-vector multispeaker training bug	2021-12-09 13:26:33 +00:00
Edresson	82611cfcd3	Fix unit tests	2021-12-09 13:18:36 +00:00
Edresson	cfa9910f9d	Fix pylint issues	2021-12-09 13:16:32 +00:00
Edresson	9071bf326f	Implement vocoder Fine Tuning like SC-GlowTTS paper	2021-12-09 13:16:32 +00:00
Edresson	3df5d9a619	Fix the bug in M-AILABS formatter	2021-12-09 13:11:06 +00:00
Edresson	d653227e59	Add voice conversion support for the model VITS trained with external speaker embedding	2021-12-09 13:11:06 +00:00
Edresson	56b548835d	Fix bug in VITS multilingual inference	2021-12-09 13:11:06 +00:00
Edresson	240356cd53	Fix bugs in the non-multilingual VITS inference	2021-12-09 13:11:06 +00:00
Edresson	32ece5d5ad	Fix pylint issues	2021-12-09 13:11:06 +00:00
Edresson	8e83a212fa	Add multilingual inference support	2021-12-09 13:10:09 +00:00
Edresson	d0e3647db6	Add multilingual training support to the VITS model	2021-12-09 13:07:00 +00:00
Edresson	829ee55b04	Implement multilingual dataloader support	2021-12-09 12:50:03 +00:00
Edresson	c9f5838bb4	Fix pylint issues	2021-12-09 12:38:58 +00:00
Edresson	1efcccd5c9	Implement training support with d_vecs in the VITS model	2021-12-09 12:37:37 +00:00
Edresson	c9c1960040	Allow ignore speakers for all multispeaker datasets	2021-12-09 12:35:12 +00:00
Edresson	234a4aacb3	Select randomly a speaker from the speaker manager for the test setences	2021-12-09 12:32:14 +00:00
Edresson	8310d19da8	Save speakers embeddings/ids before starting training	2021-12-09 12:23:02 +00:00
Eren Gölge	babdd84f91	Fix GST inference commit d3e477875a7e46a101fcf95a1794442823750fe2 Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Wed Nov 3 10:16:12 2021 +0000 Read .wav for GST conditioning from CL commit 074e6d0874d3b34fb6a4991fc17d66dccd413fbb Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Fri Oct 29 14:43:47 2021 +0100 Fix GST during inference in Tacotron2 commit fdece14585ab5a36eed1061a9a838d8e48aa6882 Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Wed Nov 3 10:16:12 2021 +0000 Read .wav for GST conditioning from CL commit cd29e21b8d0a541ee298d2bf5f67223ad60be38f Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Fri Oct 29 14:43:47 2021 +0100 Fix GST during inference in Tacotron2 commit 908ce39370eadcc9fa8510cdb26c9ead87305427 Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Fri Oct 29 12:49:37 2021 +0100 Make trim_db value negative commit 1008a2e0f72fa7ca7f0307424f570386f2f16d42 Author: George Rousssos <25833833+george-roussos@users.noreply.github.com> Date: Fri Oct 29 12:22:24 2021 +0100 Set find_endpoint db threshold in config.json	2021-12-07 13:28:49 +00:00
Eren Gölge	2ed9e3c241	Fix constant use of noise augment	2021-11-08 09:20:34 +01:00
Eren Gölge	b6b14a76af	Fix VITS stochastic duration predictor	2021-11-08 09:20:11 +01:00
Eren Gölge	faafea4cf2	Fix style	2021-11-04 17:04:40 +01:00
Eren Gölge	c5077c6c3f	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-11-01 16:42:27 +01:00
Eren Gölge	20cebde1c9	Add docstring to MAI labs formatter	2021-11-01 16:41:55 +01:00
Eren Gölge	608f437545	Add a function to find unique chars	2021-11-01 16:41:33 +01:00
Eren Gölge	d6d780e758	Fix FastSpeech config	2021-11-01 16:41:15 +01:00
Michael Hansen	3bc043faeb	Upgrade to gruut 2.0 (#882 )	2021-10-31 11:41:55 +01:00
Eren Gölge	2df0752e73	Model zoo tests (#900 ) * Fix VITS model multi-speaker init * Remove gdrive support in model manager * Add model zoo tests	2021-10-29 17:54:16 +02:00
Eren Gölge	035ed432bc	Doc update (#889 ) * Link source files from the docs * Update glowTTS recipes for docs * Add dataset downloaders	2021-10-26 17:41:33 +02:00
Eren Gölge	0cac3f330a	Enable custom formatter in load_tts_samples	2021-10-26 13:07:11 +02:00
Eren Gölge	00becf2671	Fix import statements	2021-10-25 19:29:16 +02:00
Eren Gölge	2b7d159383	Update BaseTTS for multi-speaker training	2021-10-21 16:29:06 +00:00
Eren Gölge	e62d3c5cf7	Use absolute imports for tts configs and models	2021-10-21 16:29:06 +00:00
Eren Gölge	82fed4add2	Make style	2021-10-21 16:05:51 +00:00
Eren Gölge	3cb07fb6b5	Fix SpeakerManager init with data items	2021-10-21 13:54:39 +00:00
Eren Gölge	aea90e2501	Comment synthesis.py	2021-10-21 13:53:45 +00:00
Eren Gölge	3ab009ca8d	Edit model configs for multi-speaker	2021-10-21 13:51:37 +00:00
Eren Gölge	cea8e1739b	Update AlignTTS to use SpeakerManager	2021-10-20 18:22:41 +00:00
Eren Gölge	0e768dd4c5	Update comments	2021-10-20 18:21:26 +00:00
Eren Gölge	7c2cb7cc30	Update BaseTTS	2021-10-20 18:18:22 +00:00
Eren Gölge	330ee7d208	Comment BaseTacotron and remove unused funcs	2021-10-20 18:17:25 +00:00
Eren Gölge	aa25f70b95	Update ForwardTTS for multi-speaker	2021-10-20 18:16:41 +00:00
Eren Gölge	0ebc2a400e	Implement `_set_speaker_embedding` in GlowTTS	2021-10-20 18:15:20 +00:00
Eren Gölge	3da79a4de4	Comment Tacotron2 model	2021-10-20 18:14:04 +00:00
Eren Gölge	c514351c0e	Refactor multi-speaker init in BaseTTS-Tacotron1-2	2021-10-18 08:55:45 +00:00
Eren Gölge	127571423c	Update multi-speaker init in BaseTTS	2021-10-18 08:54:41 +00:00
Eren Gölge	a0a5d580e9	Approximate audio length from file size	2021-10-18 08:54:02 +00:00

1 2 3 4 5 ...

546 Commits