Commit Graph

78 Commits

Author SHA1 Message Date
Edresson Casanova dcd0d1f6a1 Clean up old code 2022-05-16 13:09:12 +00:00
Edresson Casanova 093bd07528 Add reversal classifier loss 2022-04-18 21:09:59 -03:00
Edresson Casanova 047cebd7b8 Fix Style tests 2022-03-30 16:51:39 -03:00
Edresson Casanova aebbdfc62b
Merge branch 'dev-managers' into dev-emotion 2022-03-30 16:25:47 -03:00
Edresson Casanova 397b3e9baf Fix style tests 2022-03-23 15:31:33 -03:00
Edresson Casanova ab20a34170 Fix bug in get_speaker_manager 2022-03-23 15:27:01 -03:00
Edresson Casanova 2bc2685ff9 Add parse_key in set_ids_from_data 2022-03-23 15:27:01 -03:00
Edresson Casanova 88e0cfa5a0 Rename set_embeddings_from_file to load_embeddings_from_file 2022-03-23 15:27:01 -03:00
Edresson Casanova b7eefac47d Rename set_ids_from_file to load_ids_from_file 2022-03-23 15:27:01 -03:00
Edresson Casanova 4fdc864f74 Add EmbeddingManager and BaseIDManager 2022-03-23 15:26:59 -03:00
Edresson Casanova 40df2cfdd1 Change the speaker manager to a generic manager 2022-03-23 15:26:06 -03:00
Eren Gölge 0870a4faa2
Make style (#1405) 2022-03-16 12:13:55 +01:00
Edresson Casanova e52b40aca4 Fix bug in get_speaker_manager 2022-03-14 14:15:18 +00:00
Edresson Casanova 0e258d1784 Add parse_key in set_ids_from_data 2022-03-14 13:53:46 +00:00
Edresson Casanova 464775dbaf Rename set_embeddings_from_file to load_embeddings_from_file 2022-03-14 13:34:16 +00:00
Edresson Casanova 7e59755d63 Rename set_ids_from_file to load_ids_from_file 2022-03-14 13:31:01 +00:00
Edresson Casanova eac06a5e87 Add EmbeddingManager and BaseIDManager 2022-03-11 19:01:51 -03:00
Edresson Casanova 12e0b6f39e Change the speaker manager to a generic manager 2022-03-11 17:09:58 -03:00
Edresson Casanova f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349)
* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Edresson Casanova dbe9da7f15
Add Voice conversion inference support (#1337)
* Add support for voice conversion inference

* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json

* Rebase bug fix

* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Edresson Casanova 917f417ac4
Add alphas to control language and speaker balancer (#1216)
* Add alphas to control language and speaker balancer

* Add docs for speaker and language samplers

* Change the Samplers weights to float for save memory

* Change the test_samplers to unittest format

* Add get_sampler method in BaseTTS

* Fix rebase issues

* Add language and speaker samplers support for DDP training

* Rename distributed sampler wrapper

* Remove the DistributedSamplerWrapper and use the one from Trainer

* Bugfix after rebase

* Move the samplers config to tts config
2022-03-10 14:56:09 +01:00
Eren Gölge 5176ae9e53 Fixes small compat. issues 2022-02-25 11:21:19 +01:00
Eren Gölge f802a931a3 Pass samples to init_from_config in SpeakerManager 2022-02-25 11:07:34 +01:00
Eren Gölge 3de9f38d16 Add init_from_config to SpeakerManager 2022-02-25 10:48:03 +01:00
Eren Gölge 127118c637
Update TTS.tts formatters (#1228)
* Return Dict from tts formatters

* Make style
2022-02-11 23:03:43 +01:00
Eren Gölge 348b5c96a2 Fix speaker encoder test 2021-12-30 15:36:35 +00:00
Eren Gölge 5c5ddd2ba7 Init speaker manager for speaker encoder 2021-12-22 15:51:53 +00:00
Eren Gölge a25269d897 Remove commented code 2021-12-20 11:54:10 +00:00
Eren Gölge 79de38ca76 Rename setup_model to setup_speaker_encoder_model 2021-12-20 11:54:10 +00:00
Edresson b769b49e34 Remove the data from the set_d_vectors_from_file function 2021-12-20 11:54:10 +00:00
Edresson 9daa33d1fd Remove unusable speaker manager function 2021-12-20 11:54:10 +00:00
WeberJulian e995a63bd6 fix linter 2021-12-20 11:54:10 +00:00
WeberJulian 1472b6df49 make style 2021-12-20 11:54:10 +00:00
WeberJulian 005bba60b0 get_speaker_weighted_sampler 2021-12-20 11:54:10 +00:00
Edresson 9b011b1cb3 Add H/ASP original checkpoint support 2021-12-20 11:54:09 +00:00
Edresson ac9416fb86 Add multilingual inference support 2021-12-20 11:54:09 +00:00
Edresson dcb2374bc9 Add multilingual training support to the VITS model 2021-12-20 11:54:09 +00:00
Edresson 5f1c18187f Fix pylint issues 2021-12-20 11:54:09 +00:00
Edresson d91c595c5a Implement training support with d_vecs in the VITS model 2021-12-20 11:54:09 +00:00
Edresson e0ad838066 Select randomly a speaker from the speaker manager for the test setences 2021-12-20 11:54:09 +00:00
Eren Gölge 3cb07fb6b5 Fix SpeakerManager init with data items 2021-10-21 13:54:39 +00:00
Eren Gölge 9f23ad6a0f Fix imports 2021-09-30 14:47:56 +00:00
Eren Gölge e4648ffef1 Fix multi-speaker init of Tacotron models & tests 2021-08-09 18:02:36 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Edresson 2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Eren Gölge c25a2184e7 Add docs for `SpeakerManager` 2021-07-03 13:55:27 +02:00
Eren Gölge f23b228e24 Update `speaker_manager` 2021-06-28 17:03:47 +02:00
Eren Gölge 419735f440 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge 2c38ef8441 use get_speaker_manager in Trainer and save speakers.json file when
needed
2021-06-28 17:03:19 +02:00
Eren Gölge db6a97d1a2 rename external speaker embedding arguments as `d_vectors` 2021-06-28 17:03:19 +02:00