Edresson Casanova
047cebd7b8
Fix Style tests
2022-03-30 16:51:39 -03:00
Edresson Casanova
aebbdfc62b
Merge branch 'dev-managers' into dev-emotion
2022-03-30 16:25:47 -03:00
Edresson Casanova
34a92f1b1b
Fix the Bug in Synthesizer
2022-03-30 15:32:35 -03:00
Edresson Casanova
397b3e9baf
Fix style tests
2022-03-23 15:31:33 -03:00
Edresson Casanova
c7af7c6474
Implement LanguageManager inherit BaseIDManager
2022-03-23 15:26:59 -03:00
Edresson Casanova
40df2cfdd1
Change the speaker manager to a generic manager
2022-03-23 15:26:06 -03:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova
10dee54ac3
Bug fix in single speaker emotion embedding training
2022-03-16 20:57:14 +00:00
Eren Gölge
0870a4faa2
Make style ( #1405 )
2022-03-16 12:13:55 +01:00
Edresson Casanova
38027b15c2
Fix unit tests
2022-03-15 19:40:07 +00:00
Edresson Casanova
e3520e9e9f
Add Emotion Support for the VITS model
2022-03-15 01:16:48 +00:00
Edresson Casanova
e33819b7de
Implement LanguageManager inherit BaseIDManager
2022-03-11 19:25:18 -03:00
Edresson Casanova
12e0b6f39e
Change the speaker manager to a generic manager
2022-03-11 17:09:58 -03:00
Edresson Casanova
f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support ( #1349 )
...
* Rename Speaker encoder module to encoder
* Add a generic emotion dataset formatter
* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config
* Add class map in emotion config
* Add Base encoder config
* Add evaluation encoder script
* Fix the bug in plot_embeddings
* Enable Weight decay for encoder training
* Add argumnet to disable storage
* Add Perfect Sampler and remove storage
* Add evaluation during encoder training
* Fix lint checks
* Remove useless config parameter
* Active evaluation in speaker encoder test and use multispeaker dataset for this test
* Unit tests fixs
* Remove useless tests for speedup the aux_tests
* Use get_optimizer in Encoder
* Add BaseEncoder Class
* Fix the unitests
* Add Perfect Batch Sampler unit test
* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Edresson Casanova
36e9ea2f97
Open bible dataset formatter ( #1365 )
...
* Add support for voice conversion inference
* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json
* Rebase bug fix
* Use the average d-vector for inference
* Fix the bug in find unique chars script
* Add OpenBible formatter
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2022-03-11 10:43:31 +01:00
Edresson Casanova
dbe9da7f15
Add Voice conversion inference support ( #1337 )
...
* Add support for voice conversion inference
* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json
* Rebase bug fix
* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Edresson Casanova
f381e29b91
REBASED: Add support for the speaker encoder training using torch spectrograms ( #1348 )
...
* Add support for the speaker encoder training using torch spectrograms
* Remove useless function in speaker encoder dataset class
2022-03-10 14:54:51 +01:00
Eren Gölge
1425a023fe
Make style and lint
2022-03-02 13:25:35 +01:00
Eren Gölge
1e414b3a09
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
bf540f4323
Update imports for trainer
2022-02-25 11:31:56 +01:00
Eren Gölge
424d04e4f6
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
be3a03126a
Update imports for trainer
2022-02-25 11:28:14 +01:00
Eren Gölge
8622226f3f
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
27db089d6c
Change TrainingArgs -> TrainerArgs
2022-02-25 11:26:59 +01:00
Eren Gölge
1f0c8179da
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
bc2243bac4
Fix tests
2022-02-25 11:25:00 +01:00
Eren Gölge
2fe16de8e3
Make lint
2022-02-25 11:25:00 +01:00
Eren Gölge
8e248913d6
Update train_tts for the new API
2022-02-25 11:24:13 +01:00
Eren Gölge
730f7c0df4
Add file_ext args to resample.py
2022-02-25 11:15:46 +01:00
Eren Gölge
4d99fee3e2
Update spec extractor
2022-02-25 11:12:44 +01:00
Eren Gölge
38a0b3b6c7
Update train_tts.py
2022-02-25 11:11:35 +01:00
Eren Gölge
fbad17e084
Update imports for symbols -> characters
2022-02-25 10:48:02 +01:00
Eren Gölge
a51b031bff
Merge branch 'dev' into dev-fix-glowtts-infer
2022-02-21 12:01:40 +03:00
Edresson Casanova
28a7464975
Fix the bug in split dataset function ( #1251 )
...
* Fix the bug in split_dataset
* Make eval_split_size configurable
* Change test_loader to use load_tts_samples function
* Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval
* Fix samplers unit test
* Add data unit test on GitHub workflow
2022-02-21 11:59:36 +03:00
Edresson Casanova
bc5db13d06
Fix the bug in extract tts spectrogram script
2022-02-19 19:24:00 +00:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
Edresson Casanova
0860d73cf8
Remove Tensorflow requeriment ( #1225 )
...
* Remove TF modules
* Remove TF unit tests
* Remove TF vocoder modules
* Remove TF convert scripts
* Remove TF requirement
* Remove the Docs TF instructions
* Remove TF inference support
2022-02-10 16:14:54 +01:00
WeberJulian
c7f5e005e1
Compute embedding for new audios only
2022-01-06 15:41:38 +01:00
WeberJulian
e1accb6e28
Fix train_tts.py and uncomment code ( #1051 )
...
* Fix SE loading and language embedding logic
* remove trailing white space
* Uncomment resmapling code for SCL
2022-01-03 17:44:57 +01:00
Eren Gölge
56378b12f7
Fix speaker encoder init
2021-12-21 12:26:25 +00:00
Eren Gölge
4c50f6f4df
Add functions to get and check and argument in config and config.model_args
2021-12-20 11:54:10 +00:00
Eren Gölge
704dddcffa
Make style
2021-12-20 11:54:10 +00:00
WeberJulian
a564eb9f54
Add support for multi-lingual models in CLI
2021-12-20 11:54:10 +00:00
WeberJulian
6b03943526
Move multilingual logic out of the trainer
2021-12-20 11:54:10 +00:00
Edresson
4196a42de7
Get the number speaker from the Speaker Manager property
2021-12-20 11:54:10 +00:00
Edresson
f394d60695
Fix the bug in multispeaker vits
2021-12-20 11:54:10 +00:00
Edresson
45d0b04179
Lint fixs
2021-12-20 11:54:10 +00:00
Edresson
85418ffeaa
Fix the bug in extract tts spectrograms
2021-12-20 11:54:10 +00:00
Edresson
34749f8727
Remove the call to get_speaker_manager
2021-12-20 11:54:10 +00:00
Edresson
352aa69eca
Create a module for the VAD script
2021-12-20 11:54:10 +00:00