Edresson Casanova
856e185641
Add Resnet prosody encoder support
2022-06-13 13:47:22 +00:00
Edresson Casanova
4b59f07946
Support the use of speaker embedding as emotion embedding
2022-06-08 09:52:39 -03:00
Edresson Casanova
360b969c23
Fix rebase issues
2022-06-08 09:52:39 -03:00
Edresson Casanova
f50819a5f6
Fix compute embeddings issue
2022-06-08 09:52:37 -03:00
Edresson Casanova
0fb1b200c6
Fix rebase issues
2022-06-08 09:52:37 -03:00
Edresson Casanova
e8c4417f07
Fix Style tests
2022-06-08 09:52:36 -03:00
Edresson Casanova
a8292c7c03
Fix the Bug in Synthesizer
2022-06-08 09:52:36 -03:00
Edresson Casanova
6f33506d89
Fix unit tests
2022-06-08 09:52:35 -03:00
Eren Gölge
f70e82cd19
Use fsspec and torch for embedding file IO ( #1581 )
...
* Use fsspec and torch for embedding file
* Fixup
* Fix load and save files
* Fix compute embedding script
* Set use_cuda to true if available
* Add dummy speakers.pth file
* Make style
* Change default speakers file extension
Co-authored-by: WeberJulian <julian.weber@hotmail.fr>
2022-06-01 13:49:42 +02:00
Edresson Casanova
6233f4fcd7
Bug fix in compute embedding without eval partition
2022-04-26 13:58:03 -03:00
Edresson Casanova
060e0f9368
Add EmbeddingManager and BaseIDManager ( #1374 )
2022-03-31 13:41:16 +02:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova
f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support ( #1349 )
...
* Rename Speaker encoder module to encoder
* Add a generic emotion dataset formatter
* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config
* Add class map in emotion config
* Add Base encoder config
* Add evaluation encoder script
* Fix the bug in plot_embeddings
* Enable Weight decay for encoder training
* Add argumnet to disable storage
* Add Perfect Sampler and remove storage
* Add evaluation during encoder training
* Fix lint checks
* Remove useless config parameter
* Active evaluation in speaker encoder test and use multispeaker dataset for this test
* Unit tests fixs
* Remove useless tests for speedup the aux_tests
* Use get_optimizer in Encoder
* Add BaseEncoder Class
* Fix the unitests
* Add Perfect Batch Sampler unit test
* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
WeberJulian
c7f5e005e1
Compute embedding for new audios only
2022-01-06 15:41:38 +01:00
Eren Gölge
043dca61b4
Rename `load_meta_data` as `load_tts_data`
2021-09-30 14:47:56 +00:00
Eren Gölge
4b7b88dd3d
Add fullband-melgan DE vocoder
2021-07-26 15:38:30 +02:00
Edresson Casanova
d5adc35fdf
Add docstring to compute_embeddings script
2021-07-21 07:16:10 -03:00
Edresson
b1620d1f3f
remove ignore generate eval flag
2021-07-15 03:34:28 -03:00
Edresson
d906fea08c
lint fix and eval as argparse in extract tts spectrograms
2021-07-13 02:15:31 -03:00
Edresson
2e5baffa9c
Merge fix and eval split as argparse
2021-07-13 01:47:32 -03:00
Eren Gölge
8f47f95998
correct import of `load_meta_data`
...
remove redundant import
2021-06-28 17:03:19 +02:00
Edresson
1c4e806f54
use speaker manager on compute embeddings script
2021-06-27 03:35:34 -03:00
Edresson
99d40e98d9
fix Lint checks
2021-06-18 14:59:01 -03:00
Edresson
28bec238ca
fix Lint checks
2021-06-18 14:33:50 -03:00
Edresson
83644056e3
fix Lint checks
2021-06-18 14:32:28 -03:00
Edresson Casanova
e78e3cd81e
Merge branch 'dev' into dev
2021-06-18 14:10:03 -03:00
Edresson
b74b510d3c
Compute embeddings and find characters using config file
2021-06-18 14:04:49 -03:00
Eren Gölge
bec85ac58d
make style
2021-05-31 16:37:15 +02:00
Edresson
7448177b72
use SpeakerManager on compute embeddings script
2021-05-29 21:11:53 -03:00
Edresson
208bb0f0ee
add batched speaker encoder inference
2021-05-27 20:01:00 -03:00
Edresson
825734a3a9
remove unused embeddings export
2021-05-27 19:10:24 -03:00
Edresson
1496f271dc
update Compute embeddings script
2021-05-27 00:45:18 -03:00
Edresson
3433c2f348
add compute embedding for the new speaker encoder
2021-05-12 03:06:46 -03:00
Eren Gölge
f519012dea
reformatting and styling
2021-04-12 11:47:39 +02:00
erogol
8a820930c6
compute_embedding update
2020-11-25 14:46:08 +01:00
erogol
67e2b664e5
compute embeddings and create speakers.json
2020-11-12 12:51:17 +01:00
erogol
016d3503da
compute embeddings with speaker encoder
2020-11-10 12:51:02 +01:00