Commit Graph

4028 Commits

Author SHA1 Message Date
Edresson Casanova 10dee54ac3 Bug fix in single speaker emotion embedding training 2022-03-16 20:57:14 +00:00
Edresson Casanova 38027b15c2 Fix unit tests 2022-03-15 19:40:07 +00:00
Edresson Casanova 4f03784b1f Add emotion external embeddings training unit test 2022-03-15 13:09:58 +00:00
Edresson Casanova 5090034fd1 Add emotion consistency loss 2022-03-15 12:35:00 +00:00
Edresson Casanova cc3821332b Fix the bug in sythesizer 2022-03-15 12:33:36 +00:00
Edresson Casanova e3520e9e9f Add Emotion Support for the VITS model 2022-03-15 01:16:48 +00:00
Edresson Casanova 18d3565d37 Add emotion manager 2022-03-14 14:26:40 +00:00
Edresson Casanova e52b40aca4 Fix bug in get_speaker_manager 2022-03-14 14:15:18 +00:00
Edresson Casanova 8040b930a8 Fix docs of set_language_ids_from_config 2022-03-14 14:14:37 +00:00
Edresson Casanova 0e258d1784 Add parse_key in set_ids_from_data 2022-03-14 13:53:46 +00:00
Edresson Casanova 464775dbaf Rename set_embeddings_from_file to load_embeddings_from_file 2022-03-14 13:34:16 +00:00
Edresson Casanova 7e59755d63 Rename set_ids_from_file to load_ids_from_file 2022-03-14 13:31:01 +00:00
Edresson Casanova 25da4d9b74 Fix unit tests 2022-03-11 19:55:29 -03:00
Edresson Casanova e33819b7de Implement LanguageManager inherit BaseIDManager 2022-03-11 19:25:18 -03:00
Edresson Casanova eac06a5e87 Add EmbeddingManager and BaseIDManager 2022-03-11 19:01:51 -03:00
Edresson Casanova 12e0b6f39e Change the speaker manager to a generic manager 2022-03-11 17:09:58 -03:00
Edresson Casanova f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349)
* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Edresson Casanova 36e9ea2f97
Open bible dataset formatter (#1365)
* Add support for voice conversion inference

* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json

* Rebase bug fix

* Use the average d-vector for inference

* Fix the bug in find unique chars script

* Add OpenBible formatter

Co-authored-by: Eren Gölge <erogol@hotmail.com>
2022-03-11 10:43:31 +01:00
Eren Gölge b0be825d92
Update issue template (#1370)
* Add bug_report template

* Fix typos
2022-03-11 10:40:20 +01:00
Edresson Casanova dbe9da7f15
Add Voice conversion inference support (#1337)
* Add support for voice conversion inference

* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json

* Rebase bug fix

* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Edresson Casanova 917f417ac4
Add alphas to control language and speaker balancer (#1216)
* Add alphas to control language and speaker balancer

* Add docs for speaker and language samplers

* Change the Samplers weights to float for save memory

* Change the test_samplers to unittest format

* Add get_sampler method in BaseTTS

* Fix rebase issues

* Add language and speaker samplers support for DDP training

* Rename distributed sampler wrapper

* Remove the DistributedSamplerWrapper and use the one from Trainer

* Bugfix after rebase

* Move the samplers config to tts config
2022-03-10 14:56:09 +01:00
Edresson Casanova f381e29b91
REBASED: Add support for the speaker encoder training using torch spectrograms (#1348)
* Add support for the speaker encoder training using torch spectrograms

* Remove useless function in speaker encoder dataset class
2022-03-10 14:54:51 +01:00
Eren Gölge 07d96f7991 Fix DocQA title 2022-03-10 12:17:06 +01:00
Yanlong Wang 8a007c8834
feat: add docsqa to docs website (#1363) 2022-03-10 11:40:06 +01:00
Eren Gölge 48f6bb405a
Fix recipes as to the recent API changes. (#1367)
* Fix recipes -> #1366

* Fix trainer docs
2022-03-10 11:36:38 +01:00
Edresson Casanova d792b78703
Fix multilingual recipe (#1354) 2022-03-09 16:18:17 +01:00
Eren Gölge c670365507 Fix VCTK recipe and formatter 2022-03-08 14:20:34 +01:00
Eren Gölge 0cf3265a46
Merge pull request #1347 from coqui-ai/dev
v0.6.1
2022-03-07 16:02:19 +01:00
Eren Gölge 8feb41d361 Bump up to v0.6.1 2022-03-07 15:57:44 +01:00
Eren Gölge 6df69f79ea Revert DocQA as it fails on readthedocs 2022-03-07 15:54:43 +01:00
Eren Gölge 95e551dd0a Update requirements.txt for coqui-trainer 2022-03-07 14:31:25 +01:00
Eren Gölge 209ee40c88
Merge pull request #1288 from coqui-ai/dev
v0.6.0
2022-03-07 14:05:30 +01:00
Eren Gölge d87985cde1 Update docs 2022-03-07 12:27:13 +01:00
Eren Gölge ee02bc3823 Bump up to v0.6.0 2022-03-07 12:08:22 +01:00
Eren Gölge dc280819be Add new models 2022-03-07 12:08:09 +01:00
Eren Gölge 45f1e1f786 Update requirements.txt 2022-03-06 14:24:19 +01:00
Eren Gölge bdebe3d83e Fix typos 2022-03-06 14:19:46 +01:00
Eren Gölge 6716b3b214 Fix typo 2022-03-06 14:10:16 +01:00
Han Xiao 00edd3c99b feat: add dotbot 2022-03-06 13:34:33 +01:00
Eren Gölge e9d9028b4d Revert cleaner name 2022-03-06 12:57:06 +01:00
Eren Gölge 764c7fa4a4 Rename phoneme_cleaners 2022-03-06 12:09:54 +01:00
Eren Gölge bec543b3a5 Update zoo tests 2022-03-06 11:48:38 +01:00
Eren Gölge dd4287de1f Update models 2022-03-03 20:23:00 +01:00
Eren Gölge 6cb00be795 Update your_tts model URL 2022-03-02 18:04:49 +01:00
Eren Gölge fd71893ea9 Add missing deps for CI tests 2022-03-02 18:00:29 +01:00
Eren Gölge 1425a023fe Make style and lint 2022-03-02 13:25:35 +01:00
Eren Gölge c68885b3fd Update Vits speaker encoder init 2022-03-02 13:20:23 +01:00
Eren Gölge 27b67b7945 Fix import 2022-03-02 09:15:20 +01:00
Eren Gölge 942df0fb05 Update vits dataset 2022-03-02 09:14:32 +01:00
Eren Gölge a84499c5da Add text_tests 2022-03-01 07:58:12 +01:00