Commit Graph

372 Commits

Author SHA1 Message Date
Eren Gölge 66b6e9bc99 Make style 2021-12-10 07:53:10 +00:00
WeberJulian 7b81c16434 Fix seed in test_samplers to avoid random fails 2021-12-09 13:42:42 +00:00
WeberJulian 22c7be5f44 Add test for language_weighted_sampler 2021-12-09 13:42:42 +00:00
Edresson d79531c177 Add remove silence vad script Unit test 2021-12-09 13:41:56 +00:00
Edresson c382a3f6a9 Add find unique phonemes unit tests 2021-12-09 13:41:56 +00:00
WeberJulian 4a8c344fc8 Make a multilingual test use chars 2021-12-09 13:41:56 +00:00
WeberJulian 9c1bec86a4 Fix tests 2021-12-09 13:41:28 +00:00
Edresson 65b5ed4427 Add VITS multilingual d-vectors unit test 2021-12-09 13:38:18 +00:00
Edresson 88f4369c63 Fix the bug in multispeaker vits 2021-12-09 13:38:18 +00:00
WeberJulian 3e9ca4b95d make style 2021-12-09 13:36:48 +00:00
WeberJulian 6ed55ba57e fix test vits 2021-12-09 13:35:12 +00:00
Edresson ecf327a118 Add VITS multispeaker train unit test 2021-12-09 13:27:21 +00:00
Edresson 2bba769e67 Active the multispeaker mode in multilingual training 2021-12-09 13:27:21 +00:00
Edresson 08da902af3 Add VITS d-vector unit test 2021-12-09 13:18:36 +00:00
Edresson 859cf1bfac Add VITS multilingual unit test 2021-12-09 13:18:36 +00:00
Edresson 32ece5d5ad Fix pylint issues 2021-12-09 13:11:06 +00:00
Michael Hansen 3bc043faeb
Upgrade to gruut 2.0 (#882) 2021-10-31 11:41:55 +01:00
Eren Gölge 2df0752e73
Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge 25759d6a61 Split tests 2021-10-21 17:30:15 +00:00
Eren Gölge e62d3c5cf7 Use absolute imports for tts configs and models 2021-10-21 16:29:06 +00:00
Eren Gölge 4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00
Eren Gölge 7edbe04fe0 Fix WaveRNN config and test 2021-09-30 16:20:12 +00:00
Eren Gölge 4cacbf0d45 Fix WaveRNN test 2021-09-30 14:47:56 +00:00
Eren Gölge 2766dd1d6e
Fix #813 - GlowTTS training (#814)
* Fix #813

* Update glow_tts recipe

* Fix glow-tts test

* Linter fix

* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge 1e7db32e90 Test FastPitch train 2021-09-11 10:19:47 +00:00
Eren Gölge 26f76fce22 Remove SpeedySpeech from .models.json 2021-09-10 17:47:27 +00:00
Eren Gölge 7ec23e69d4 Skip TF tests on GPU 2021-09-10 17:28:58 +00:00
Eren Gölge 1ebf9ec6bf Remove speedy_speech implementation 2021-09-10 17:28:20 +00:00
Eren Gölge 7d8f77385a Use `glow-tts` in synthesis tests 2021-09-10 17:27:33 +00:00
Eren Gölge d6e29ef98a Style update 2021-09-10 08:30:33 +00:00
Eren Gölge 3abc3a1d32 Fix GPU init in tests 2021-09-10 08:28:10 +00:00
Eren Gölge ed4b1d8514 Test `TTS.tts.utils.helpers` 2021-09-10 08:25:21 +00:00
Eren Gölge 8b7e094bde Implement `forward_tts`
- Generic API for feed-forward TTS models (FastPitch, SpeedySpeech)

- Tests for `forward-tts`

- Edit  FastPitchConfig and SpeedySpeechConfig to use `forward_tts`
2021-09-10 08:24:33 +00:00
Eren Gölge 4761853c5c Fix imports 2021-09-08 13:34:40 +00:00
Eren Gölge e72c265cd4 Fix linter issues 2021-09-06 15:16:58 +00:00
Eren Gölge fd287aa438 Update loader tests for dict return 2021-09-06 15:16:58 +00:00
Eren Gölge 2c4bbbf9b9 Use pyworld for pitch 2021-09-06 15:16:58 +00:00
Eren Gölge 076d0cb258 Add tests for certain FastPitch functions 2021-09-06 15:16:58 +00:00
Eren Gölge d63a6bb690 Set BaseDatasetConfig for tests 2021-09-06 15:16:58 +00:00
Eren Gölge fba257104d Compute F0 using librosa 2021-09-06 15:16:58 +00:00
Katsuya Iida 165e5814af
Update Japanese phonemizer (#758)
* Update default ja vocoder

* update

* Japanese phonemizer test

* Run make style

Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge f186856e5d Add option to sort input sequnce by audio len 2021-08-30 08:10:35 +00:00
Eren Gölge c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge 003e5579e8 Enable `custom_symbols` in text processing
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge e4648ffef1 Fix multi-speaker init of Tacotron models & tests 2021-08-09 18:02:36 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge 75b201c6c1
Merge pull request #673 from coqui-ai/fix_stopnet
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge fc0c4600bd Fix stopnet training 2021-07-24 11:39:54 +02:00
Eren Gölge 30eed347b6
Merge pull request #581 from Edresson/dev
Compute speaker embeddings in batch for the LSTM  Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian c79a82ed07 refix linter 2021-07-13 23:12:18 +02:00