Commit Graph

218 Commits

Author SHA1 Message Date
Eren Gölge d227aaebcc Print when using Griffin-Lim in Synthesizer 2021-11-01 16:52:26 +01:00
George 37eaefc085
Optional silence trimming during inference and find_endpoint() fix (#898)
* Set find_endpoint db threshold in config.json

* Optional silence trimming during inference

* Make trim_db value negative
2021-10-29 18:28:55 +02:00
Eren Gölge 2df0752e73
Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge 035ed432bc
Doc update (#889)
* Link source files from the docs

* Update glowTTS recipes for docs

* Add dataset downloaders
2021-10-26 17:41:33 +02:00
Eren Gölge 1987aaaaed Update d-vector reshape in synthesizer 2021-10-21 13:53:25 +00:00
Eren Gölge 92b6d98443 Set pitch frame alignment wrt spec computation 2021-10-20 18:12:38 +00:00
Eren Gölge 0a3d1cc7ee Pass speaker manager to the model in synthesizer 2021-10-20 18:11:36 +00:00
Eren Gölge 3c7848e9b1 Don't OOR values in train console log 2021-10-19 16:32:16 +00:00
Eren Gölge c514351c0e Refactor multi-speaker init in BaseTTS-Tacotron1-2 2021-10-18 08:55:45 +00:00
Eren Gölge 700b056117 Update Synthesizer multi-speaker handling 2021-10-15 10:21:12 +00:00
Eren Gölge 9a0d8fa027 Update `copy_model_files()` 2021-09-30 14:47:56 +00:00
Eren Gölge 8ada870a57 Refactor `trainer.py` for v2 2021-09-30 14:16:34 +00:00
Eren Gölge 7d8f77385a Use `glow-tts` in synthesis tests 2021-09-10 17:27:33 +00:00
Eren Gölge 742f9c54da Warn user if nan in GL 2021-09-10 08:26:05 +00:00
Eren Gölge 4761853c5c Fix imports 2021-09-08 13:34:40 +00:00
Eren Gölge 2c4bbbf9b9 Use pyworld for pitch 2021-09-06 15:16:58 +00:00
Eren Gölge 98a7271ce8 Refactor FastPitchv2 2021-09-06 15:16:58 +00:00
Eren Gölge 42862f7fdb Format style of the recipes 2021-09-06 15:16:58 +00:00
Eren Gölge aacbb3ed77 Fix SpeakerManager usage in `synthesize.py` 2021-09-06 15:16:58 +00:00
Eren Gölge 5a6ffaee08 Add yin based pitch computation 2021-09-06 15:16:58 +00:00
Eren Gölge d085642ac1 Cache pitch features
Cache the features at the beginning of `BaseTTS` training.
2021-09-06 15:16:58 +00:00
Eren Gölge fba257104d Compute F0 using librosa 2021-09-06 15:16:58 +00:00
Eren Gölge d16da949a5 Merge branch 'fix_distribute' into dev 2021-08-30 16:31:07 +00:00
Eren Gölge 5255e089e6 Fix #767 2021-08-30 13:10:08 +00:00
Eren Gölge c560114324 Fix #750 2021-08-30 13:06:50 +00:00
Eren Gölge 18da8f5dbd Update pylint 2.10.2 and fix lint issues 2021-08-30 08:10:35 +00:00
Eren Gölge 2620f62ea8 Move duration_loss inside VitsGeneratorLoss 2021-08-27 07:07:07 +00:00
Eren Gölge 1692b8e4d9
Merge pull request #726 from fijipants/patch-1
Fix bug with log_func
2021-08-26 22:11:29 +02:00
Eren Gölge 49e1181ea4 Fixes for the vits model 2021-08-26 17:15:09 +00:00
fijipants e9e01b09b0 Fix bug with log_func 2021-08-18 19:59:51 -04:00
fijipants 8f57f8adfd Update synthesizer.py 2021-08-18 19:56:52 -04:00
Eren Gölge 7c0d564965 Syncronize DDP processes 2021-08-13 10:40:50 +00:00
Eren Gölge ecf5f17dca Fix distribute.py and ddp training 2021-08-12 22:22:32 +00:00
Eren Gölge 537bc8487a Print model count when listing modelsk 2021-08-10 16:25:11 +00:00
Ayush Chaurasia f3e9d61330 Refactor logging initialization 2021-08-09 18:35:08 +00:00
Ayush Chaurasia 79b74a989d Update: add_text 2021-08-09 18:34:38 +00:00
Ayush Chaurasia 9fcf48b760 Delete logger_base.py 2021-08-09 18:34:00 +00:00
Ayush Chaurasia 290972fd35 reformat 2021-08-09 18:34:00 +00:00
Ayush Chaurasia 936a47504d Update Logger API, recipes 2021-08-09 18:34:00 +00:00
Ayush Chaurasia f63cf46c55 Unified logger API 2021-08-09 18:34:00 +00:00
Ayush Chaurasia f4434da5a3 Update disabled structure 2021-08-09 18:31:16 +00:00
Ayush Chaurasia f606741dc4 Add artifacts logging , wandb args 2021-08-09 18:31:16 +00:00
Ayush Chaurasia f5e50ad502 WandbLogger 2021-08-09 18:27:06 +00:00
Eren Gölge c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge 060e746e21 Add `do_amp_to_db` option 2021-08-09 18:02:36 +00:00
Eren Gölge e94c1f894d Simplify `console_logger` 2021-08-09 18:02:36 +00:00
Eren Gölge dd55960732 Update `synthesizer.py`
Fixes and changes for multi-speaker model init and custom symbols  made
by mode.make_symbols()
2021-08-09 18:02:36 +00:00
Eren Gölge d4deb2716f Modify `get_optimizer` to accept a model argument 2021-08-09 18:02:36 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge a05b234080 Raise an error when multiple GPUs are in use
User must define the target GPU by `CUDA_VISIBLE_DEVICES` and
use `distribute.py` for multi-gpu training.
2021-07-04 11:25:49 +02:00