Commit Graph

4099 Commits

Author SHA1 Message Date
Eren G??lge 8adcd1de8e Rename `g` as `spk_emb` 2022-05-17 13:47:06 +02:00
Eren G??lge 2d29e8219d Fix up 2022-05-17 13:47:06 +02:00
Eren G??lge 8e915b70e0 Make hifigan discriminator configurable 2022-05-17 13:47:06 +02:00
Eren G??lge c437db15fd Fix dirt 2022-05-17 13:46:05 +02:00
Eren Gölge a05c82f9ef Fix audio_config handling 2022-05-17 13:46:05 +02:00
Eren Gölge b3fb0e19e8 Implement get_state_dict 2022-05-17 13:46:05 +02:00
Eren Gölge ce4f96292a Remove remaned trainer functions 2022-05-17 13:46:05 +02:00
Eren Gölge 96779e75ba Return duration by ForwardTTS inference 2022-05-17 13:46:05 +02:00
Eren Gölge 9291d13c69 Make style 2022-05-17 13:46:05 +02:00
Eren Gölge edd59c81e8 Update ForwardTTSe2e tests 2022-05-17 13:46:05 +02:00
Eren Gölge 0b585b46c1 Refactor TTSDataset to use numpy transforms 2022-05-17 13:44:01 +02:00
Eren Gölge 4171f4e9c6 Update ForwardTTSE2eLoss 2022-05-17 13:44:01 +02:00
Eren Gölge dbe5eb992e Make AP optional in BaseTTS 2022-05-17 13:44:01 +02:00
Eren Gölge 6a53b77a95 Add numpy and torch transforms 2022-05-17 13:44:01 +02:00
Eren Gölge c3fb49bf76 Refactor ForwardTTS to skip decoder 2022-05-17 13:44:01 +02:00
Eren Gölge cc57c20162 Make plot results more general 2022-05-17 13:44:01 +02:00
Eren Gölge e7c5db0d97 Add missing kernel size attr to transformer layer 2022-05-17 13:44:01 +02:00
Eren Gölge 231c69b12e Remove AP from FastPitchE2e 2022-05-17 13:44:01 +02:00
Eren Gölge 4556c61902 Update fastpitche2e recipe 2022-05-17 13:44:01 +02:00
Eren Gölge 5f9d559419 Update import statements 2022-05-17 13:44:01 +02:00
Eren Gölge 9f8d86b716 Remove redundancy 2022-05-17 13:42:09 +02:00
Eren Gölge 0738cb0efe Fix Vocoder logging 2022-05-17 13:42:09 +02:00
Eren Gölge 760f045aaa Rename vars in VITS 2022-05-17 13:42:09 +02:00
Eren Gölge 775a6ab6ee Add cond layer in decoder 2022-05-17 13:38:53 +02:00
Eren Gölge 28a53c7462 Refactor multi-speaker init in ForwardTTS 2022-05-17 13:38:53 +02:00
Eren Gölge c125024da0 Implement BaseTTSE2E 2022-05-17 13:38:53 +02:00
Eren Gölge b16613c5ad Implement ForwardTTSE2E Loss 2022-05-17 13:38:53 +02:00
Eren Gölge aea8cb7668 Implement FastPitchE2E LJSpeech recipe 2022-05-17 13:38:53 +02:00
Eren Gölge 2a61b8fdaf Implement ForwardTTSE2E tests 2022-05-17 13:38:53 +02:00
Eren Gölge 85731482e1 Implement FastPitchE2EConfig 2022-05-17 13:38:53 +02:00
Eren Gölge fccda5ae7b Implement ForwardTTSE2Eg 2022-05-17 13:38:53 +02:00
Eren Gölge f237e4ccd9
Merge pull request #1574 from coqui-ai/update_badge
Update CI badges
2022-05-13 14:58:05 +02:00
Eren Gölge e282da5161 Update CI badges 2022-05-13 14:56:49 +02:00
Edresson Casanova e5d8ec2402
Change the VITS upsampling interpolation trick to linear (#1564) 2022-05-13 10:52:39 +02:00
Edresson Casanova c6008e5235
Add audio length sampler balancer (#1561)
* Add audio length sampler balancer

* Add unit tests
2022-05-12 19:59:19 +02:00
Eren Gölge 6e460b7e42
Add an assert for the upsampling trick (#1538) 2022-05-12 19:55:24 +02:00
Eren Gölge 6048959e24
Add CPU only Docker image (#1573)
Co-authored-by: Reuben Morais <reuben.morais@gmail.com>
2022-05-12 19:33:27 +02:00
Eren Gölge 27cf388a79
Update CI tests (#1572)
* Use direct model URLs in CI

* Fixup

* Fixup
2022-05-12 18:41:01 +02:00
Eren Gölge 4857967063
🐍 Python 3.10.x support and drop Python 3.6 support (#1565)
* Update requirements

* Update CI for p3.10

* Update numpy requirement

* Drop 🐍p3.6 support

Numpy also dropped support for p3.6

* Bind cython v0.29.28

* Bind pyworld to v0.2.10

> 0.2.10 is not p3.10.x compatible

* Update Dockerfile
2022-05-12 15:50:25 +02:00
Edresson Casanova a97eed696a
Fix the bug in eSpeak wrapper for eSpeak version 1.48.15 (#1560) 2022-05-12 15:15:18 +02:00
Eren Gölge e45ae57aef
Merge pull request #1550 from coqui-ai/fix-upsampling-asserts
Fix VITS upsampling asserts
2022-05-12 14:51:41 +02:00
Edresson Casanova 175ca06388 Add reinit text encoder and duration predictor parameter (#1562)
* Add reinit encoder and duration predictor option

* Add .data to prevent any overlooked autograd hook
2022-05-12 09:08:36 -03:00
Edresson Casanova 182711043c Fix the VITS upsampling asserts
Fix style
2022-05-12 09:08:29 -03:00
Taras Sereda f9d91a55f2
Improve data_path resolvement (#1567) 2022-05-12 13:10:35 +02:00
Eren Gölge 2fc38f67d2 Update SpeakerManager init in Synthesizer 2022-05-11 11:32:27 +02:00
Eren Gölge c3f8c4d5eb Return default SpeakerManager if no d_vector_file 2022-05-11 11:31:45 +02:00
Eren Gölge 121e9ed685 Pass use_cuda to init_encoder 2022-05-11 11:31:17 +02:00
Eren Gölge c18bd21b3f Return durations at VITS inference 2022-05-11 11:30:05 +02:00
Eren Gölge 5021a03de0 Use torch.no_grad for VITS inference 2022-05-11 11:29:36 +02:00
Eren Gölge 3f03e3012c Fix batch_group_size in VITS 2022-05-07 13:44:44 +02:00