Commit Graph

437 Commits

Author SHA1 Message Date
Edresson Casanova bdefc43d96 Bug fix on pre-compute F0 2022-05-19 13:48:02 +00:00
Edresson Casanova d94b8bac02 Add pitch predictor 2022-05-16 21:53:49 +00:00
Edresson Casanova 3a524b0597 Add prosody encoder params on config 2022-05-16 09:45:28 -03:00
Edresson Casanova 5271846d9c Add Speech style balancer 2022-04-19 15:51:15 -03:00
Edresson Casanova 8a3396d9c1 Add prosody encoder training support 2022-04-18 17:01:44 -03:00
Edresson Casanova 7be9056b3d Remove useless encoder weights reload 2022-03-31 11:05:58 -03:00
Edresson Casanova b692c77e6a Fix emotion unit test 2022-03-31 08:34:08 -03:00
Edresson Casanova aebbdfc62b
Merge branch 'dev-managers' into dev-emotion 2022-03-30 16:25:47 -03:00
Edresson Casanova 40df2cfdd1 Change the speaker manager to a generic manager 2022-03-23 15:26:06 -03:00
Eren Gölge 72d85e53c9
Update model file extension (#1422)
* Update model file ext to ```.pth```

* Update docs

* Rename more

* Find model files
2022-03-22 17:55:00 +01:00
Eren Gölge 0870a4faa2
Make style (#1405) 2022-03-16 12:13:55 +01:00
Edresson Casanova 4f03784b1f Add emotion external embeddings training unit test 2022-03-15 13:09:58 +00:00
Edresson Casanova 5090034fd1 Add emotion consistency loss 2022-03-15 12:35:00 +00:00
Edresson Casanova e3520e9e9f Add Emotion Support for the VITS model 2022-03-15 01:16:48 +00:00
Edresson Casanova 12e0b6f39e Change the speaker manager to a generic manager 2022-03-11 17:09:58 -03:00
Edresson Casanova f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349)
* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Edresson Casanova 917f417ac4
Add alphas to control language and speaker balancer (#1216)
* Add alphas to control language and speaker balancer

* Add docs for speaker and language samplers

* Change the Samplers weights to float for save memory

* Change the test_samplers to unittest format

* Add get_sampler method in BaseTTS

* Fix rebase issues

* Add language and speaker samplers support for DDP training

* Rename distributed sampler wrapper

* Remove the DistributedSamplerWrapper and use the one from Trainer

* Bugfix after rebase

* Move the samplers config to tts config
2022-03-10 14:56:09 +01:00
Eren Gölge 1425a023fe Make style and lint 2022-03-02 13:25:35 +01:00
Eren Gölge 27b67b7945 Fix import 2022-03-02 09:15:20 +01:00
Eren Gölge 690de1ab06 Update Characters and add more tests 2022-02-25 11:32:44 +01:00
Eren Gölge 14c117978d Fix return outputs 2022-02-25 11:31:56 +01:00
Eren Gölge 424d04e4f6 Make stlye 2022-02-25 11:31:56 +01:00
Eren Gölge c0b40a0cb7 Update VITS tests 2022-02-25 11:31:20 +01:00
Eren Gölge b0cff949f5 Update tests 2022-02-25 11:28:14 +01:00
Eren Gölge 1f0c8179da Make style 2022-02-25 11:26:59 +01:00
Eren Gölge ef63c99524 Implement `start_by_longest` option for TTSDatase 2022-02-25 11:26:18 +01:00
Eren Gölge c4c471d61d Allow padding for shorter segments 2022-02-25 11:25:48 +01:00
Eren Gölge bc2243bac4 Fix tests 2022-02-25 11:25:00 +01:00
Eren Gölge 21940952bf Make lint 2022-02-25 11:25:00 +01:00
Eren Gölge 146fbfd7c9 Extend unittests 2022-02-25 11:25:00 +01:00
Eren Gölge 2fe16de8e3 Make lint 2022-02-25 11:25:00 +01:00
Eren Gölge d0eb3e4ef2 Add get_tests_data_path 2022-02-25 11:24:13 +01:00
Eren Gölge 235f7d9b02 Extend glow_tts model tests 2022-02-25 11:24:13 +01:00
Eren Gölge 5176ae9e53 Fixes small compat. issues 2022-02-25 11:21:19 +01:00
Eren Gölge edec27738b Delete `use_espeak_phonemes` from tests 2022-02-25 11:18:00 +01:00
Eren Gölge 0a47a7eac0 Update tests 2022-02-25 11:12:44 +01:00
Eren Gölge b341951b78 Update loader tests 2022-02-25 11:12:44 +01:00
Eren Gölge 196ae74273 Update data loader tests 2022-02-25 11:05:06 +01:00
Eren Gölge 75c507c36a Update VITS LJspeech recipe 2022-02-25 10:57:35 +01:00
Eren Gölge 04202da1ac Make style 2022-02-25 10:48:03 +01:00
Eren Gölge 961e98a461 Add OOV case to tokenizer tests 2022-02-25 10:48:03 +01:00
Eren Gölge 8c8093ce23 Make style 2022-02-25 10:48:03 +01:00
Eren Gölge f1ea3ad182 Remove old text processing tests 2022-02-25 10:48:02 +01:00
Eren Gölge ba3b60c90f Test TTSTokenizer 2022-02-25 10:48:02 +01:00
Eren Gölge 79a84410f2 Test punctuations 2022-02-25 10:48:02 +01:00
Eren Gölge 99d9bb7a17 Test Phonemizers 2022-02-25 10:48:02 +01:00
Eren Gölge a1df4f9887 Test character classes 2022-02-25 10:45:24 +01:00
Eren Gölge a51b031bff
Merge branch 'dev' into dev-fix-glowtts-infer 2022-02-21 12:01:40 +03:00
Edresson Casanova 28a7464975
Fix the bug in split dataset function (#1251)
* Fix the bug in split_dataset

* Make eval_split_size configurable

* Change test_loader to use load_tts_samples function

* Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval

* Fix samplers unit test

* Add data unit test on GitHub workflow
2022-02-21 11:59:36 +03:00
Edresson Casanova 531821545e Fix inference test issue 2022-02-19 12:21:32 +00:00