coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	856e185641	Add Resnet prosody encoder support	2022-06-13 13:47:22 +00:00
Edresson Casanova	4b59f07946	Support the use of speaker embedding as emotion embedding	2022-06-08 09:52:39 -03:00
Edresson Casanova	360b969c23	Fix rebase issues	2022-06-08 09:52:39 -03:00
Edresson Casanova	f50819a5f6	Fix compute embeddings issue	2022-06-08 09:52:37 -03:00
Edresson Casanova	0fb1b200c6	Fix rebase issues	2022-06-08 09:52:37 -03:00
Edresson Casanova	e8c4417f07	Fix Style tests	2022-06-08 09:52:36 -03:00
Edresson Casanova	a8292c7c03	Fix the Bug in Synthesizer	2022-06-08 09:52:36 -03:00
Edresson Casanova	6f33506d89	Fix unit tests	2022-06-08 09:52:35 -03:00
Eren Gölge	f70e82cd19	Use fsspec and torch for embedding file IO (#1581 ) * Use fsspec and torch for embedding file * Fixup * Fix load and save files * Fix compute embedding script * Set use_cuda to true if available * Add dummy speakers.pth file * Make style * Change default speakers file extension Co-authored-by: WeberJulian <julian.weber@hotmail.fr>	2022-06-01 13:49:42 +02:00
Edresson Casanova	6233f4fcd7	Bug fix in compute embedding without eval partition	2022-04-26 13:58:03 -03:00
Edresson Casanova	060e0f9368	Add EmbeddingManager and BaseIDManager (#1374 )	2022-03-31 13:41:16 +02:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Edresson Casanova	f81892483d	REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349 ) * Rename Speaker encoder module to encoder * Add a generic emotion dataset formatter * Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config * Add class map in emotion config * Add Base encoder config * Add evaluation encoder script * Fix the bug in plot_embeddings * Enable Weight decay for encoder training * Add argumnet to disable storage * Add Perfect Sampler and remove storage * Add evaluation during encoder training * Fix lint checks * Remove useless config parameter * Active evaluation in speaker encoder test and use multispeaker dataset for this test * Unit tests fixs * Remove useless tests for speedup the aux_tests * Use get_optimizer in Encoder * Add BaseEncoder Class * Fix the unitests * Add Perfect Batch Sampler unit test * Add compute encoder accuracy in a function	2022-03-11 14:43:40 +01:00
Eren Gölge	127118c637	Update TTS.tts formatters (#1228 ) * Return Dict from tts formatters * Make style	2022-02-11 23:03:43 +01:00
WeberJulian	c7f5e005e1	Compute embedding for new audios only	2022-01-06 15:41:38 +01:00
Eren Gölge	043dca61b4	Rename `load_meta_data` as `load_tts_data`	2021-09-30 14:47:56 +00:00
Eren Gölge	4b7b88dd3d	Add fullband-melgan DE vocoder	2021-07-26 15:38:30 +02:00
Edresson Casanova	d5adc35fdf	Add docstring to compute_embeddings script	2021-07-21 07:16:10 -03:00
Edresson	b1620d1f3f	remove ignore generate eval flag	2021-07-15 03:34:28 -03:00
Edresson	d906fea08c	lint fix and eval as argparse in extract tts spectrograms	2021-07-13 02:15:31 -03:00
Edresson	2e5baffa9c	Merge fix and eval split as argparse	2021-07-13 01:47:32 -03:00
Eren Gölge	8f47f95998	correct import of `load_meta_data` remove redundant import	2021-06-28 17:03:19 +02:00
Edresson	1c4e806f54	use speaker manager on compute embeddings script	2021-06-27 03:35:34 -03:00
Edresson	99d40e98d9	fix Lint checks	2021-06-18 14:59:01 -03:00
Edresson	28bec238ca	fix Lint checks	2021-06-18 14:33:50 -03:00
Edresson	83644056e3	fix Lint checks	2021-06-18 14:32:28 -03:00
Edresson Casanova	e78e3cd81e	Merge branch 'dev' into dev	2021-06-18 14:10:03 -03:00
Edresson	b74b510d3c	Compute embeddings and find characters using config file	2021-06-18 14:04:49 -03:00
Eren Gölge	bec85ac58d	make style	2021-05-31 16:37:15 +02:00
Edresson	7448177b72	use SpeakerManager on compute embeddings script	2021-05-29 21:11:53 -03:00
Edresson	208bb0f0ee	add batched speaker encoder inference	2021-05-27 20:01:00 -03:00
Edresson	825734a3a9	remove unused embeddings export	2021-05-27 19:10:24 -03:00
Edresson	1496f271dc	update Compute embeddings script	2021-05-27 00:45:18 -03:00
Edresson	3433c2f348	add compute embedding for the new speaker encoder	2021-05-12 03:06:46 -03:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
erogol	8a820930c6	compute_embedding update	2020-11-25 14:46:08 +01:00
erogol	67e2b664e5	compute embeddings and create speakers.json	2020-11-12 12:51:17 +01:00
erogol	016d3503da	compute embeddings with speaker encoder	2020-11-10 12:51:02 +01:00

38 Commits