Commit Graph

29 Commits

Author SHA1 Message Date
Edresson Casanova 6233f4fcd7 Bug fix in compute embedding without eval partition 2022-04-26 13:58:03 -03:00
Edresson Casanova 060e0f9368
Add EmbeddingManager and BaseIDManager (#1374) 2022-03-31 13:41:16 +02:00
Eren Gölge 72d85e53c9
Update model file extension (#1422)
* Update model file ext to ```.pth```

* Update docs

* Rename more

* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349)
* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Eren Gölge 127118c637
Update TTS.tts formatters (#1228)
* Return Dict from tts formatters

* Make style
2022-02-11 23:03:43 +01:00
WeberJulian c7f5e005e1 Compute embedding for new audios only 2022-01-06 15:41:38 +01:00
Eren Gölge 043dca61b4 Rename `load_meta_data` as `load_tts_data` 2021-09-30 14:47:56 +00:00
Eren Gölge 4b7b88dd3d Add fullband-melgan DE vocoder 2021-07-26 15:38:30 +02:00
Edresson Casanova d5adc35fdf
Add docstring to compute_embeddings script 2021-07-21 07:16:10 -03:00
Edresson b1620d1f3f remove ignore generate eval flag 2021-07-15 03:34:28 -03:00
Edresson d906fea08c lint fix and eval as argparse in extract tts spectrograms 2021-07-13 02:15:31 -03:00
Edresson 2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Eren Gölge 8f47f95998 correct import of `load_meta_data`
remove redundant import
2021-06-28 17:03:19 +02:00
Edresson 1c4e806f54 use speaker manager on compute embeddings script 2021-06-27 03:35:34 -03:00
Edresson 99d40e98d9 fix Lint checks 2021-06-18 14:59:01 -03:00
Edresson 28bec238ca fix Lint checks 2021-06-18 14:33:50 -03:00
Edresson 83644056e3 fix Lint checks 2021-06-18 14:32:28 -03:00
Edresson Casanova e78e3cd81e
Merge branch 'dev' into dev 2021-06-18 14:10:03 -03:00
Edresson b74b510d3c Compute embeddings and find characters using config file 2021-06-18 14:04:49 -03:00
Eren Gölge bec85ac58d make style 2021-05-31 16:37:15 +02:00
Edresson 7448177b72 use SpeakerManager on compute embeddings script 2021-05-29 21:11:53 -03:00
Edresson 208bb0f0ee add batched speaker encoder inference 2021-05-27 20:01:00 -03:00
Edresson 825734a3a9 remove unused embeddings export 2021-05-27 19:10:24 -03:00
Edresson 1496f271dc update Compute embeddings script 2021-05-27 00:45:18 -03:00
Edresson 3433c2f348 add compute embedding for the new speaker encoder 2021-05-12 03:06:46 -03:00
Eren Gölge f519012dea reformatting and styling 2021-04-12 11:47:39 +02:00
erogol 8a820930c6 compute_embedding update 2020-11-25 14:46:08 +01:00
erogol 67e2b664e5 compute embeddings and create speakers.json 2020-11-12 12:51:17 +01:00
erogol 016d3503da compute embeddings with speaker encoder 2020-11-10 12:51:02 +01:00