Edresson Casanova
3a524b0597
Add prosody encoder params on config
2022-05-16 09:45:28 -03:00
Edresson Casanova
5271846d9c
Add Speech style balancer
2022-04-19 15:51:15 -03:00
Edresson Casanova
093bd07528
Add reversal classifier loss
2022-04-18 21:09:59 -03:00
Edresson Casanova
8a3396d9c1
Add prosody encoder training support
2022-04-18 17:01:44 -03:00
Edresson Casanova
f31ba25233
Add emotion embedding in the encoder
2022-03-31 19:14:41 -03:00
Edresson Casanova
314f95f974
Add formatter for the Emotional Speech Dataset
2022-03-31 17:27:30 +00:00
Edresson Casanova
7be9056b3d
Remove useless encoder weights reload
2022-03-31 11:05:58 -03:00
Edresson Casanova
b692c77e6a
Fix emotion unit test
2022-03-31 08:34:08 -03:00
Edresson Casanova
047cebd7b8
Fix Style tests
2022-03-30 16:51:39 -03:00
Edresson Casanova
aebbdfc62b
Merge branch 'dev-managers' into dev-emotion
2022-03-30 16:25:47 -03:00
Edresson Casanova
34a92f1b1b
Fix the Bug in Synthesizer
2022-03-30 15:32:35 -03:00
Edresson Casanova
397b3e9baf
Fix style tests
2022-03-23 15:31:33 -03:00
Edresson Casanova
ab20a34170
Fix bug in get_speaker_manager
2022-03-23 15:27:01 -03:00
Edresson Casanova
cb941530df
Fix docs of set_language_ids_from_config
2022-03-23 15:27:01 -03:00
Edresson Casanova
2bc2685ff9
Add parse_key in set_ids_from_data
2022-03-23 15:27:01 -03:00
Edresson Casanova
88e0cfa5a0
Rename set_embeddings_from_file to load_embeddings_from_file
2022-03-23 15:27:01 -03:00
Edresson Casanova
b7eefac47d
Rename set_ids_from_file to load_ids_from_file
2022-03-23 15:27:01 -03:00
Edresson Casanova
24274c58f8
Fix unit tests
2022-03-23 15:27:01 -03:00
Edresson Casanova
c7af7c6474
Implement LanguageManager inherit BaseIDManager
2022-03-23 15:26:59 -03:00
Edresson Casanova
4fdc864f74
Add EmbeddingManager and BaseIDManager
2022-03-23 15:26:59 -03:00
Edresson Casanova
40df2cfdd1
Change the speaker manager to a generic manager
2022-03-23 15:26:06 -03:00
Eren Gölge
3af01cfe3b
Update base model wrt 👟 ( #1406 )
2022-03-23 17:24:20 +01:00
WeberJulian
3c7c14607b
Add formatting tests ( #1437 )
...
* Add style checks to `make lint`
* Bump target-version in black config
2022-03-23 17:23:36 +01:00
Eren Gölge
1c3623af33
Fix model manager ( #1436 )
...
* Fix manager
* Make style
2022-03-23 12:57:14 +01:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova
ccdc2300dc
Add eval_split and eval_split_size in the call of load_tts_samples for all recipes ( #1424 )
2022-03-22 12:54:41 +01:00
Eren Gölge
2e6e8f651d
Update CheckSpectrograms notebook ( #1418 )
2022-03-18 16:48:24 +01:00
Eren Gölge
c7f9ec07c8
Hinge Gruut version to 2.2.3 ( #1419 )
2022-03-18 16:47:50 +01:00
Edresson Casanova
10dee54ac3
Bug fix in single speaker emotion embedding training
2022-03-16 20:57:14 +00:00
Eren Gölge
fd56fabb21
Fix #1380 ( #1409 )
2022-03-16 12:38:27 +01:00
Eren Gölge
0870a4faa2
Make style ( #1405 )
2022-03-16 12:13:55 +01:00
WeberJulian
690c96ed28
Fix default phonemizer for ja and zh ( #1399 )
2022-03-16 12:13:22 +01:00
Eren Gölge
f40b833659
Add CITATION.cff ( #1404 )
2022-03-16 12:05:17 +01:00
WeberJulian
24b57f6a0e
Fix typo workflow text ( #1403 )
2022-03-16 11:51:37 +01:00
Edresson Casanova
38027b15c2
Fix unit tests
2022-03-15 19:40:07 +00:00
Edresson Casanova
4f03784b1f
Add emotion external embeddings training unit test
2022-03-15 13:09:58 +00:00
Edresson Casanova
5090034fd1
Add emotion consistency loss
2022-03-15 12:35:00 +00:00
Edresson Casanova
cc3821332b
Fix the bug in sythesizer
2022-03-15 12:33:36 +00:00
Edresson Casanova
e3520e9e9f
Add Emotion Support for the VITS model
2022-03-15 01:16:48 +00:00
Edresson Casanova
18d3565d37
Add emotion manager
2022-03-14 14:26:40 +00:00
Edresson Casanova
e52b40aca4
Fix bug in get_speaker_manager
2022-03-14 14:15:18 +00:00
Edresson Casanova
8040b930a8
Fix docs of set_language_ids_from_config
2022-03-14 14:14:37 +00:00
Edresson Casanova
0e258d1784
Add parse_key in set_ids_from_data
2022-03-14 13:53:46 +00:00
Edresson Casanova
464775dbaf
Rename set_embeddings_from_file to load_embeddings_from_file
2022-03-14 13:34:16 +00:00
Edresson Casanova
7e59755d63
Rename set_ids_from_file to load_ids_from_file
2022-03-14 13:31:01 +00:00
Edresson Casanova
25da4d9b74
Fix unit tests
2022-03-11 19:55:29 -03:00
Edresson Casanova
e33819b7de
Implement LanguageManager inherit BaseIDManager
2022-03-11 19:25:18 -03:00
Edresson Casanova
eac06a5e87
Add EmbeddingManager and BaseIDManager
2022-03-11 19:01:51 -03:00
Edresson Casanova
12e0b6f39e
Change the speaker manager to a generic manager
2022-03-11 17:09:58 -03:00
Edresson Casanova
f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support ( #1349 )
...
* Rename Speaker encoder module to encoder
* Add a generic emotion dataset formatter
* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config
* Add class map in emotion config
* Add Base encoder config
* Add evaluation encoder script
* Fix the bug in plot_embeddings
* Enable Weight decay for encoder training
* Add argumnet to disable storage
* Add Perfect Sampler and remove storage
* Add evaluation during encoder training
* Fix lint checks
* Remove useless config parameter
* Active evaluation in speaker encoder test and use multispeaker dataset for this test
* Unit tests fixs
* Remove useless tests for speedup the aux_tests
* Use get_optimizer in Encoder
* Add BaseEncoder Class
* Fix the unitests
* Add Perfect Batch Sampler unit test
* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00