coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	bdefc43d96	Bug fix on pre-compute F0	2022-05-19 13:48:02 +00:00
Edresson Casanova	d94b8bac02	Add pitch predictor	2022-05-16 21:53:49 +00:00
Edresson Casanova	3a524b0597	Add prosody encoder params on config	2022-05-16 09:45:28 -03:00
Edresson Casanova	5271846d9c	Add Speech style balancer	2022-04-19 15:51:15 -03:00
Edresson Casanova	8a3396d9c1	Add prosody encoder training support	2022-04-18 17:01:44 -03:00
Edresson Casanova	7be9056b3d	Remove useless encoder weights reload	2022-03-31 11:05:58 -03:00
Edresson Casanova	b692c77e6a	Fix emotion unit test	2022-03-31 08:34:08 -03:00
Edresson Casanova	aebbdfc62b	Merge branch 'dev-managers' into dev-emotion	2022-03-30 16:25:47 -03:00
Edresson Casanova	40df2cfdd1	Change the speaker manager to a generic manager	2022-03-23 15:26:06 -03:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Eren Gölge	0870a4faa2	Make style (#1405 )	2022-03-16 12:13:55 +01:00
Edresson Casanova	4f03784b1f	Add emotion external embeddings training unit test	2022-03-15 13:09:58 +00:00
Edresson Casanova	5090034fd1	Add emotion consistency loss	2022-03-15 12:35:00 +00:00
Edresson Casanova	e3520e9e9f	Add Emotion Support for the VITS model	2022-03-15 01:16:48 +00:00
Edresson Casanova	12e0b6f39e	Change the speaker manager to a generic manager	2022-03-11 17:09:58 -03:00
Edresson Casanova	f81892483d	REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349 ) * Rename Speaker encoder module to encoder * Add a generic emotion dataset formatter * Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config * Add class map in emotion config * Add Base encoder config * Add evaluation encoder script * Fix the bug in plot_embeddings * Enable Weight decay for encoder training * Add argumnet to disable storage * Add Perfect Sampler and remove storage * Add evaluation during encoder training * Fix lint checks * Remove useless config parameter * Active evaluation in speaker encoder test and use multispeaker dataset for this test * Unit tests fixs * Remove useless tests for speedup the aux_tests * Use get_optimizer in Encoder * Add BaseEncoder Class * Fix the unitests * Add Perfect Batch Sampler unit test * Add compute encoder accuracy in a function	2022-03-11 14:43:40 +01:00
Edresson Casanova	917f417ac4	Add alphas to control language and speaker balancer (#1216 ) * Add alphas to control language and speaker balancer * Add docs for speaker and language samplers * Change the Samplers weights to float for save memory * Change the test_samplers to unittest format * Add get_sampler method in BaseTTS * Fix rebase issues * Add language and speaker samplers support for DDP training * Rename distributed sampler wrapper * Remove the DistributedSamplerWrapper and use the one from Trainer * Bugfix after rebase * Move the samplers config to tts config	2022-03-10 14:56:09 +01:00
Eren Gölge	1425a023fe	Make style and lint	2022-03-02 13:25:35 +01:00
Eren Gölge	27b67b7945	Fix import	2022-03-02 09:15:20 +01:00
Eren Gölge	690de1ab06	Update Characters and add more tests	2022-02-25 11:32:44 +01:00
Eren Gölge	14c117978d	Fix return outputs	2022-02-25 11:31:56 +01:00
Eren Gölge	424d04e4f6	Make stlye	2022-02-25 11:31:56 +01:00
Eren Gölge	c0b40a0cb7	Update VITS tests	2022-02-25 11:31:20 +01:00
Eren Gölge	b0cff949f5	Update tests	2022-02-25 11:28:14 +01:00
Eren Gölge	1f0c8179da	Make style	2022-02-25 11:26:59 +01:00
Eren Gölge	ef63c99524	Implement `start_by_longest` option for TTSDatase	2022-02-25 11:26:18 +01:00
Eren Gölge	c4c471d61d	Allow padding for shorter segments	2022-02-25 11:25:48 +01:00
Eren Gölge	bc2243bac4	Fix tests	2022-02-25 11:25:00 +01:00
Eren Gölge	21940952bf	Make lint	2022-02-25 11:25:00 +01:00
Eren Gölge	146fbfd7c9	Extend unittests	2022-02-25 11:25:00 +01:00
Eren Gölge	2fe16de8e3	Make lint	2022-02-25 11:25:00 +01:00
Eren Gölge	d0eb3e4ef2	Add get_tests_data_path	2022-02-25 11:24:13 +01:00
Eren Gölge	235f7d9b02	Extend glow_tts model tests	2022-02-25 11:24:13 +01:00
Eren Gölge	5176ae9e53	Fixes small compat. issues	2022-02-25 11:21:19 +01:00
Eren Gölge	edec27738b	Delete `use_espeak_phonemes` from tests	2022-02-25 11:18:00 +01:00
Eren Gölge	0a47a7eac0	Update tests	2022-02-25 11:12:44 +01:00
Eren Gölge	b341951b78	Update loader tests	2022-02-25 11:12:44 +01:00
Eren Gölge	196ae74273	Update data loader tests	2022-02-25 11:05:06 +01:00
Eren Gölge	75c507c36a	Update VITS LJspeech recipe	2022-02-25 10:57:35 +01:00
Eren Gölge	04202da1ac	Make style	2022-02-25 10:48:03 +01:00
Eren Gölge	961e98a461	Add OOV case to tokenizer tests	2022-02-25 10:48:03 +01:00
Eren Gölge	8c8093ce23	Make style	2022-02-25 10:48:03 +01:00
Eren Gölge	f1ea3ad182	Remove old text processing tests	2022-02-25 10:48:02 +01:00
Eren Gölge	ba3b60c90f	Test TTSTokenizer	2022-02-25 10:48:02 +01:00
Eren Gölge	79a84410f2	Test punctuations	2022-02-25 10:48:02 +01:00
Eren Gölge	99d9bb7a17	Test Phonemizers	2022-02-25 10:48:02 +01:00
Eren Gölge	a1df4f9887	Test character classes	2022-02-25 10:45:24 +01:00
Eren Gölge	a51b031bff	Merge branch 'dev' into dev-fix-glowtts-infer	2022-02-21 12:01:40 +03:00
Edresson Casanova	28a7464975	Fix the bug in split dataset function (#1251 ) * Fix the bug in split_dataset * Make eval_split_size configurable * Change test_loader to use load_tts_samples function * Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval * Fix samplers unit test * Add data unit test on GitHub workflow	2022-02-21 11:59:36 +03:00
Edresson Casanova	531821545e	Fix inference test issue	2022-02-19 12:21:32 +00:00

1 2 3 4 5 ...

437 Commits