Commit Graph

420 Commits

Author SHA1 Message Date
WeberJulian 6f01eed672 Add test for language_weighted_sampler 2021-12-20 11:54:10 +00:00
Edresson a57ddfb4ec Add remove silence vad script Unit test 2021-12-20 11:54:10 +00:00
Edresson e068fab6b2 Add find unique phonemes unit tests 2021-12-20 11:54:10 +00:00
WeberJulian 54e33bff61 Make a multilingual test use chars 2021-12-20 11:54:10 +00:00
WeberJulian 09eda31a3f Fix tests 2021-12-20 11:54:10 +00:00
Edresson 06d89f93a8 Add VITS multilingual d-vectors unit test 2021-12-20 11:54:10 +00:00
Edresson f394d60695 Fix the bug in multispeaker vits 2021-12-20 11:54:10 +00:00
WeberJulian 1472b6df49 make style 2021-12-20 11:54:10 +00:00
WeberJulian 3b5592abcf fix test vits 2021-12-20 11:54:10 +00:00
Edresson bbdb5c38e6 Add VITS multispeaker train unit test 2021-12-20 11:54:09 +00:00
Edresson 92f7f4f400 Active the multispeaker mode in multilingual training 2021-12-20 11:54:09 +00:00
Edresson e68b042493 Add VITS d-vector unit test 2021-12-20 11:54:09 +00:00
Edresson 959cc8f03c Add VITS multilingual unit test 2021-12-20 11:54:09 +00:00
Edresson 3fbbebd74d Fix pylint issues 2021-12-20 11:54:09 +00:00
Michael Hansen 3bc043faeb
Upgrade to gruut 2.0 (#882) 2021-10-31 11:41:55 +01:00
Eren Gölge 2df0752e73
Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge 25759d6a61 Split tests 2021-10-21 17:30:15 +00:00
Eren Gölge e62d3c5cf7 Use absolute imports for tts configs and models 2021-10-21 16:29:06 +00:00
Eren Gölge 4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00
Eren Gölge 7edbe04fe0 Fix WaveRNN config and test 2021-09-30 16:20:12 +00:00
Eren Gölge 4cacbf0d45 Fix WaveRNN test 2021-09-30 14:47:56 +00:00
Eren Gölge 2766dd1d6e
Fix #813 - GlowTTS training (#814)
* Fix #813

* Update glow_tts recipe

* Fix glow-tts test

* Linter fix

* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge 1e7db32e90 Test FastPitch train 2021-09-11 10:19:47 +00:00
Eren Gölge 26f76fce22 Remove SpeedySpeech from .models.json 2021-09-10 17:47:27 +00:00
Eren Gölge 7ec23e69d4 Skip TF tests on GPU 2021-09-10 17:28:58 +00:00
Eren Gölge 1ebf9ec6bf Remove speedy_speech implementation 2021-09-10 17:28:20 +00:00
Eren Gölge 7d8f77385a Use `glow-tts` in synthesis tests 2021-09-10 17:27:33 +00:00
Eren Gölge d6e29ef98a Style update 2021-09-10 08:30:33 +00:00
Eren Gölge 3abc3a1d32 Fix GPU init in tests 2021-09-10 08:28:10 +00:00
Eren Gölge ed4b1d8514 Test `TTS.tts.utils.helpers` 2021-09-10 08:25:21 +00:00
Eren Gölge 8b7e094bde Implement `forward_tts`
- Generic API for feed-forward TTS models (FastPitch, SpeedySpeech)

- Tests for `forward-tts`

- Edit  FastPitchConfig and SpeedySpeechConfig to use `forward_tts`
2021-09-10 08:24:33 +00:00
Eren Gölge 4761853c5c Fix imports 2021-09-08 13:34:40 +00:00
Eren Gölge e72c265cd4 Fix linter issues 2021-09-06 15:16:58 +00:00
Eren Gölge fd287aa438 Update loader tests for dict return 2021-09-06 15:16:58 +00:00
Eren Gölge 2c4bbbf9b9 Use pyworld for pitch 2021-09-06 15:16:58 +00:00
Eren Gölge 076d0cb258 Add tests for certain FastPitch functions 2021-09-06 15:16:58 +00:00
Eren Gölge d63a6bb690 Set BaseDatasetConfig for tests 2021-09-06 15:16:58 +00:00
Eren Gölge fba257104d Compute F0 using librosa 2021-09-06 15:16:58 +00:00
Katsuya Iida 165e5814af
Update Japanese phonemizer (#758)
* Update default ja vocoder

* update

* Japanese phonemizer test

* Run make style

Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge f186856e5d Add option to sort input sequnce by audio len 2021-08-30 08:10:35 +00:00
Eren Gölge c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge 003e5579e8 Enable `custom_symbols` in text processing
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge e4648ffef1 Fix multi-speaker init of Tacotron models & tests 2021-08-09 18:02:36 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge 75b201c6c1
Merge pull request #673 from coqui-ai/fix_stopnet
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge fc0c4600bd Fix stopnet training 2021-07-24 11:39:54 +02:00
Eren Gölge 30eed347b6
Merge pull request #581 from Edresson/dev
Compute speaker embeddings in batch for the LSTM  Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian c79a82ed07 refix linter 2021-07-13 23:12:18 +02:00
Edresson 2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Edresson 4eac1c4651 bug fix on train_encoder and unit tests 2021-07-11 12:00:39 -03:00
Eren Gölge 1e9538aaef Add more model tests to `test_synthesize` 2021-07-04 11:45:49 +02:00
Eren Gölge 47b3b10d6d Bump up to v0.1.0 🚀 2021-06-29 13:07:59 +02:00
Eren Gölge 9790eddada Fix wrong argument name 🛠️ 2021-06-28 17:03:47 +02:00
Eren Gölge 626c9d41e6 Update tests for the new trainer API 2021-06-28 17:03:19 +02:00
Eren Gölge fcfd95669a Update model test configs 2021-06-28 17:03:19 +02:00
Eren Gölge 00c82c516d rename to 2021-06-28 17:03:19 +02:00
Eren Gölge fdfb18d230 downsize melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge 25238e0658 fix glow-tts `inference()` 2021-06-28 17:03:19 +02:00
Eren Gölge 82582993cc use one testing sentence in tts tests 2021-06-28 17:03:19 +02:00
Eren Gölge 419735f440 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge 269e5a734e add max_decoder_steps argument to tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge 304d60197b reduce multiband melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge db6a97d1a2 rename external speaker embedding arguments as `d_vectors` 2021-06-28 17:03:19 +02:00
Eren Gölge 877bf66b61 reduce size of the metadata.csv used at testing 2021-06-28 17:03:19 +02:00
Eren Gölge 87c61d210a update test to be less demanding 2021-06-28 17:03:19 +02:00
Eren Gölge 6d6896fd99 reduce fullband-melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge 1443d03af1 update test for the new input output API of the tts models 2021-06-28 17:03:19 +02:00
Eren Gölge ef4ea9e527 update imports for `formatters` 2021-06-28 17:03:19 +02:00
Eren Gölge 6c495c6a6e fix glow-tts inference and forward functions for handling `cond_input`
and refactor its test
2021-06-28 17:03:19 +02:00
Eren Gölge b500338faa make style 2021-06-28 17:03:19 +02:00
Eren Gölge d25f017b42 update `setup_model.py` imports 2021-06-28 17:03:19 +02:00
Eren Gölge 7dff6be871 update tts training tests to use the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge 9134c7dfb6 update `sequence_mask` import globally 2021-06-28 17:03:19 +02:00
Eren Gölge 8def3c87af trainer-API updates 2021-06-28 17:03:19 +02:00
Eren Gölge 42554cc711 rename MyDataset -> TTSDataset 2021-06-28 17:03:19 +02:00
Edresson Casanova eb84bb2bc8
Merge branch 'dev' into dev 2021-06-26 15:32:19 -03:00
Eren Gölge 6c7bbcaef0 Use `en-us` for testing phoneme models 2021-06-25 16:52:17 +02:00
Michael Hansen a41f53fe72 Fix silly error in tests 2021-06-25 14:41:35 +02:00
Michael Hansen 3f172b84d8 Fix linting issues 2021-06-25 14:41:31 +02:00
Michael Hansen 4d8426fa0a Use eSpeak IPA lexicons by default for phoneme models 2021-06-25 14:41:05 +02:00
Michael Hansen 47191f3ecc Add tests for gruut phonemization 2021-06-25 14:41:05 +02:00
Edresson 28bec238ca fix Lint checks 2021-06-18 14:33:50 -03:00
Edresson 83644056e3 fix Lint checks 2021-06-18 14:32:28 -03:00
Eren Gölge db48c69f0f reduce fullband melgan model size for testing 2021-06-02 11:44:53 +02:00
Eren Gölge 49c5e5d820 maket style japanese PR 2021-06-02 11:44:46 +02:00
Eren Gölge 0c14278c30 reorg test files 2021-06-02 11:40:26 +02:00
Eren Gölge 73b4083c6c
Merge pull request #502 from kaiidams/kaiidams/kokoro
Japanese Tacotron 2 model
2021-06-02 10:20:08 +02:00
Katsuya Iida 1cc18d1972 Move unittest of Japanese phonemizer. 2021-06-01 18:51:34 +09:00
Eren Gölge bec85ac58d make style 2021-05-31 16:37:15 +02:00
Eren Gölge 301c516abd Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-05-31 15:46:25 +02:00
Edresson cc192b6843 add resnet speaker encoder train unit test 2021-05-29 22:43:41 -03:00
Eren Gölge 925c08cf95 replace unidecode with anyascii 2021-05-27 14:02:44 +02:00
Edresson bc5307caa0 add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix 2021-05-26 20:35:58 -03:00
Edresson 7a9a27282a fix unit tests 2021-05-26 18:14:06 -03:00
Eren Gölge f5a9950a2e phoneme files for testing 2021-05-19 03:16:57 +02:00
Eren Gölge 9b706c5583 enable phonemes in test to match the attention masks 2021-05-19 03:15:54 +02:00
Eren Gölge 8a7c40736c set use_phonemes false 2021-05-19 01:27:26 +02:00
Eren Gölge faedea4b60 set use_phonemes False in configs 2021-05-19 00:08:07 +02:00
Eren Gölge ccfaa6b1d5 add `needs_phonemizer` field to models.json. If set true these models
are only compatible with v0.0.13 or below.
2021-05-18 17:57:28 +02:00
Eren Gölge a14fcf2a13 remove text_processing test 2021-05-18 17:57:28 +02:00