coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	b692c77e6a	Fix emotion unit test	2022-03-31 08:34:08 -03:00
Edresson Casanova	047cebd7b8	Fix Style tests	2022-03-30 16:51:39 -03:00
Edresson Casanova	aebbdfc62b	Merge branch 'dev-managers' into dev-emotion	2022-03-30 16:25:47 -03:00
Edresson Casanova	34a92f1b1b	Fix the Bug in Synthesizer	2022-03-30 15:32:35 -03:00
Edresson Casanova	397b3e9baf	Fix style tests	2022-03-23 15:31:33 -03:00
Edresson Casanova	ab20a34170	Fix bug in get_speaker_manager	2022-03-23 15:27:01 -03:00
Edresson Casanova	cb941530df	Fix docs of set_language_ids_from_config	2022-03-23 15:27:01 -03:00
Edresson Casanova	2bc2685ff9	Add parse_key in set_ids_from_data	2022-03-23 15:27:01 -03:00
Edresson Casanova	88e0cfa5a0	Rename set_embeddings_from_file to load_embeddings_from_file	2022-03-23 15:27:01 -03:00
Edresson Casanova	b7eefac47d	Rename set_ids_from_file to load_ids_from_file	2022-03-23 15:27:01 -03:00
Edresson Casanova	24274c58f8	Fix unit tests	2022-03-23 15:27:01 -03:00
Edresson Casanova	c7af7c6474	Implement LanguageManager inherit BaseIDManager	2022-03-23 15:26:59 -03:00
Edresson Casanova	4fdc864f74	Add EmbeddingManager and BaseIDManager	2022-03-23 15:26:59 -03:00
Edresson Casanova	40df2cfdd1	Change the speaker manager to a generic manager	2022-03-23 15:26:06 -03:00
Eren Gölge	3af01cfe3b	Update base model wrt 👟 (#1406 )	2022-03-23 17:24:20 +01:00
WeberJulian	3c7c14607b	Add formatting tests (#1437 ) * Add style checks to `make lint` * Bump target-version in black config	2022-03-23 17:23:36 +01:00
Eren Gölge	1c3623af33	Fix model manager (#1436 ) * Fix manager * Make style	2022-03-23 12:57:14 +01:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Edresson Casanova	ccdc2300dc	Add eval_split and eval_split_size in the call of load_tts_samples for all recipes (#1424 )	2022-03-22 12:54:41 +01:00
Eren Gölge	2e6e8f651d	Update CheckSpectrograms notebook (#1418 )	2022-03-18 16:48:24 +01:00
Eren Gölge	c7f9ec07c8	Hinge Gruut version to 2.2.3 (#1419 )	2022-03-18 16:47:50 +01:00
Edresson Casanova	10dee54ac3	Bug fix in single speaker emotion embedding training	2022-03-16 20:57:14 +00:00
Eren Gölge	fd56fabb21	Fix #1380 (#1409 )	2022-03-16 12:38:27 +01:00
Eren Gölge	0870a4faa2	Make style (#1405 )	2022-03-16 12:13:55 +01:00
WeberJulian	690c96ed28	Fix default phonemizer for ja and zh (#1399 )	2022-03-16 12:13:22 +01:00
Eren Gölge	f40b833659	Add CITATION.cff (#1404 )	2022-03-16 12:05:17 +01:00
WeberJulian	24b57f6a0e	Fix typo workflow text (#1403 )	2022-03-16 11:51:37 +01:00
Edresson Casanova	38027b15c2	Fix unit tests	2022-03-15 19:40:07 +00:00
Edresson Casanova	4f03784b1f	Add emotion external embeddings training unit test	2022-03-15 13:09:58 +00:00
Edresson Casanova	5090034fd1	Add emotion consistency loss	2022-03-15 12:35:00 +00:00
Edresson Casanova	cc3821332b	Fix the bug in sythesizer	2022-03-15 12:33:36 +00:00
Edresson Casanova	e3520e9e9f	Add Emotion Support for the VITS model	2022-03-15 01:16:48 +00:00
Edresson Casanova	18d3565d37	Add emotion manager	2022-03-14 14:26:40 +00:00
Edresson Casanova	e52b40aca4	Fix bug in get_speaker_manager	2022-03-14 14:15:18 +00:00
Edresson Casanova	8040b930a8	Fix docs of set_language_ids_from_config	2022-03-14 14:14:37 +00:00
Edresson Casanova	0e258d1784	Add parse_key in set_ids_from_data	2022-03-14 13:53:46 +00:00
Edresson Casanova	464775dbaf	Rename set_embeddings_from_file to load_embeddings_from_file	2022-03-14 13:34:16 +00:00
Edresson Casanova	7e59755d63	Rename set_ids_from_file to load_ids_from_file	2022-03-14 13:31:01 +00:00
Edresson Casanova	25da4d9b74	Fix unit tests	2022-03-11 19:55:29 -03:00
Edresson Casanova	e33819b7de	Implement LanguageManager inherit BaseIDManager	2022-03-11 19:25:18 -03:00
Edresson Casanova	eac06a5e87	Add EmbeddingManager and BaseIDManager	2022-03-11 19:01:51 -03:00
Edresson Casanova	12e0b6f39e	Change the speaker manager to a generic manager	2022-03-11 17:09:58 -03:00
Edresson Casanova	f81892483d	REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349 ) * Rename Speaker encoder module to encoder * Add a generic emotion dataset formatter * Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config * Add class map in emotion config * Add Base encoder config * Add evaluation encoder script * Fix the bug in plot_embeddings * Enable Weight decay for encoder training * Add argumnet to disable storage * Add Perfect Sampler and remove storage * Add evaluation during encoder training * Fix lint checks * Remove useless config parameter * Active evaluation in speaker encoder test and use multispeaker dataset for this test * Unit tests fixs * Remove useless tests for speedup the aux_tests * Use get_optimizer in Encoder * Add BaseEncoder Class * Fix the unitests * Add Perfect Batch Sampler unit test * Add compute encoder accuracy in a function	2022-03-11 14:43:40 +01:00
Edresson Casanova	36e9ea2f97	Open bible dataset formatter (#1365 ) * Add support for voice conversion inference * Cache d_vectors_by_speaker for fast inference using a bigger speakers.json * Rebase bug fix * Use the average d-vector for inference * Fix the bug in find unique chars script * Add OpenBible formatter Co-authored-by: Eren Gölge <erogol@hotmail.com>	2022-03-11 10:43:31 +01:00
Eren Gölge	b0be825d92	Update issue template (#1370 ) * Add bug_report template * Fix typos	2022-03-11 10:40:20 +01:00
Edresson Casanova	dbe9da7f15	Add Voice conversion inference support (#1337 ) * Add support for voice conversion inference * Cache d_vectors_by_speaker for fast inference using a bigger speakers.json * Rebase bug fix * Use the average d-vector for inference	2022-03-10 14:57:12 +01:00
Edresson Casanova	917f417ac4	Add alphas to control language and speaker balancer (#1216 ) * Add alphas to control language and speaker balancer * Add docs for speaker and language samplers * Change the Samplers weights to float for save memory * Change the test_samplers to unittest format * Add get_sampler method in BaseTTS * Fix rebase issues * Add language and speaker samplers support for DDP training * Rename distributed sampler wrapper * Remove the DistributedSamplerWrapper and use the one from Trainer * Bugfix after rebase * Move the samplers config to tts config	2022-03-10 14:56:09 +01:00
Edresson Casanova	f381e29b91	REBASED: Add support for the speaker encoder training using torch spectrograms (#1348 ) * Add support for the speaker encoder training using torch spectrograms * Remove useless function in speaker encoder dataset class	2022-03-10 14:54:51 +01:00
Eren Gölge	07d96f7991	Fix DocQA title	2022-03-10 12:17:06 +01:00
Yanlong Wang	8a007c8834	feat: add docsqa to docs website (#1363 )	2022-03-10 11:40:06 +01:00

1 2 3 4 5 ...

4054 Commits All Branches Search

4054 Commits

All Branches