Edresson Casanova
a9208e9edd
Use get_optimizer in Encoder
2022-03-10 13:58:17 -03:00
Edresson Casanova
a436fe40a3
Remove useless tests for speedup the aux_tests
2022-03-10 11:50:18 -03:00
Edresson Casanova
247da8ef12
Unit tests fixs
2022-03-10 11:50:18 -03:00
Edresson Casanova
631aec6e88
Active evaluation in speaker encoder test and use multispeaker dataset for this test
2022-03-10 11:50:18 -03:00
Edresson Casanova
3a7feadba4
Remove useless config parameter
2022-03-10 11:50:18 -03:00
Edresson Casanova
711a46506f
Fix lint checks
2022-03-10 11:50:18 -03:00
Edresson Casanova
33fd07a209
Add evaluation during encoder training
2022-03-10 11:50:18 -03:00
Edresson Casanova
0e372e0b9b
Add Perfect Sampler and remove storage
2022-03-10 11:50:18 -03:00
Edresson Casanova
8ba3385747
Add argumnet to disable storage
2022-03-10 11:50:18 -03:00
Edresson Casanova
984b6d9fd1
Enable Weight decay for encoder training
2022-03-10 11:50:18 -03:00
Edresson Casanova
1c1684bdc5
Fix the bug in plot_embeddings
2022-03-10 11:50:18 -03:00
Edresson Casanova
0a06d1e67b
Add evaluation encoder script
2022-03-10 11:50:18 -03:00
Edresson Casanova
f811af7651
Add Base encoder config
2022-03-10 11:50:18 -03:00
Edresson Casanova
33ac13e44e
Add class map in emotion config
2022-03-10 11:50:18 -03:00
Edresson Casanova
854c887764
Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config
2022-03-10 11:50:18 -03:00
Edresson Casanova
1c6d16cffc
Add a generic emotion dataset formatter
2022-03-10 11:50:18 -03:00
Edresson Casanova
71a1907f4c
Rename Speaker encoder module to encoder
2022-03-10 11:50:18 -03:00
Edresson Casanova
dbe9da7f15
Add Voice conversion inference support ( #1337 )
...
* Add support for voice conversion inference
* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json
* Rebase bug fix
* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Edresson Casanova
917f417ac4
Add alphas to control language and speaker balancer ( #1216 )
...
* Add alphas to control language and speaker balancer
* Add docs for speaker and language samplers
* Change the Samplers weights to float for save memory
* Change the test_samplers to unittest format
* Add get_sampler method in BaseTTS
* Fix rebase issues
* Add language and speaker samplers support for DDP training
* Rename distributed sampler wrapper
* Remove the DistributedSamplerWrapper and use the one from Trainer
* Bugfix after rebase
* Move the samplers config to tts config
2022-03-10 14:56:09 +01:00
Edresson Casanova
f381e29b91
REBASED: Add support for the speaker encoder training using torch spectrograms ( #1348 )
...
* Add support for the speaker encoder training using torch spectrograms
* Remove useless function in speaker encoder dataset class
2022-03-10 14:54:51 +01:00
Eren Gölge
07d96f7991
Fix DocQA title
2022-03-10 12:17:06 +01:00
Yanlong Wang
8a007c8834
feat: add docsqa to docs website ( #1363 )
2022-03-10 11:40:06 +01:00
Eren Gölge
48f6bb405a
Fix recipes as to the recent API changes. ( #1367 )
...
* Fix recipes -> #1366
* Fix trainer docs
2022-03-10 11:36:38 +01:00
Edresson Casanova
d792b78703
Fix multilingual recipe ( #1354 )
2022-03-09 16:18:17 +01:00
Eren Gölge
c670365507
Fix VCTK recipe and formatter
2022-03-08 14:20:34 +01:00
Eren Gölge
0cf3265a46
Merge pull request #1347 from coqui-ai/dev
...
v0.6.1
2022-03-07 16:02:19 +01:00
Eren Gölge
8feb41d361
Bump up to v0.6.1
2022-03-07 15:57:44 +01:00
Eren Gölge
6df69f79ea
Revert DocQA as it fails on readthedocs
2022-03-07 15:54:43 +01:00
Eren Gölge
95e551dd0a
Update requirements.txt for coqui-trainer
2022-03-07 14:31:25 +01:00
Eren Gölge
209ee40c88
Merge pull request #1288 from coqui-ai/dev
...
v0.6.0
2022-03-07 14:05:30 +01:00
Eren Gölge
d87985cde1
Update docs
2022-03-07 12:27:13 +01:00
Eren Gölge
ee02bc3823
Bump up to v0.6.0
2022-03-07 12:08:22 +01:00
Eren Gölge
dc280819be
Add new models
2022-03-07 12:08:09 +01:00
Eren Gölge
45f1e1f786
Update requirements.txt
2022-03-06 14:24:19 +01:00
Eren Gölge
bdebe3d83e
Fix typos
2022-03-06 14:19:46 +01:00
Eren Gölge
6716b3b214
Fix typo
2022-03-06 14:10:16 +01:00
Han Xiao
00edd3c99b
feat: add dotbot
2022-03-06 13:34:33 +01:00
Eren Gölge
e9d9028b4d
Revert cleaner name
2022-03-06 12:57:06 +01:00
Eren Gölge
764c7fa4a4
Rename phoneme_cleaners
2022-03-06 12:09:54 +01:00
Eren Gölge
bec543b3a5
Update zoo tests
2022-03-06 11:48:38 +01:00
Eren Gölge
dd4287de1f
Update models
2022-03-03 20:23:00 +01:00
Eren Gölge
6cb00be795
Update your_tts model URL
2022-03-02 18:04:49 +01:00
Eren Gölge
fd71893ea9
Add missing deps for CI tests
2022-03-02 18:00:29 +01:00
Eren Gölge
1425a023fe
Make style and lint
2022-03-02 13:25:35 +01:00
Eren Gölge
c68885b3fd
Update Vits speaker encoder init
2022-03-02 13:20:23 +01:00
Eren Gölge
27b67b7945
Fix import
2022-03-02 09:15:20 +01:00
Eren Gölge
942df0fb05
Update vits dataset
2022-03-02 09:14:32 +01:00
Eren Gölge
a84499c5da
Add text_tests
2022-03-01 07:58:12 +01:00
Eren Gölge
6a9f8074f0
Fix TTSDataset
2022-03-01 07:57:48 +01:00
Eren Gölge
690de1ab06
Update Characters and add more tests
2022-02-25 11:32:44 +01:00