Commit Graph

44 Commits

Author SHA1 Message Date
Eren Gölge 914280a556
Bump up to v0.11.0 (#2329)
* Make style

* Bump up to v0.11.0
2023-02-08 13:58:49 +01:00
Eren Gölge 9e5a469c64
d-vector handling (#1945)
* Update BaseDatasetConfig

- Add dataset_name
- Chane name to formatter_name

* Update compute_embedding

- Allow entering dataset by args
- Use released model by default
- Use the new key format

* Update loading

* Update recipes

* Update other dep code

* Update tests

* Fixup

* Load multiple embedding files

* Fix argument names in dep code

* Update docs

* Fix argument name

* Fix linter
2022-09-13 14:10:33 +02:00
Edresson Casanova 060e0f9368
Add EmbeddingManager and BaseIDManager (#1374) 2022-03-31 13:41:16 +02:00
Eren Gölge 1425a023fe Make style and lint 2022-03-02 13:25:35 +01:00
Eren Gölge 4d99fee3e2 Update spec extractor 2022-02-25 11:12:44 +01:00
Eren Gölge a51b031bff
Merge branch 'dev' into dev-fix-glowtts-infer 2022-02-21 12:01:40 +03:00
Edresson Casanova 28a7464975
Fix the bug in split dataset function (#1251)
* Fix the bug in split_dataset

* Make eval_split_size configurable

* Change test_loader to use load_tts_samples function

* Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval

* Fix samplers unit test

* Add data unit test on GitHub workflow
2022-02-21 11:59:36 +03:00
Edresson Casanova bc5db13d06 Fix the bug in extract tts spectrogram script 2022-02-19 19:24:00 +00:00
Eren Gölge 704dddcffa Make style 2021-12-20 11:54:10 +00:00
Edresson 85418ffeaa Fix the bug in extract tts spectrograms 2021-12-20 11:54:10 +00:00
Edresson 34749f8727 Remove the call to get_speaker_manager 2021-12-20 11:54:10 +00:00
Eren Gölge 043dca61b4 Rename `load_meta_data` as `load_tts_data` 2021-09-30 14:47:56 +00:00
Eren Gölge 3c740d4893 Style extract_tts_spectrogram.py 2021-09-10 08:21:21 +00:00
Eren Gölge 807f1d3817 Fix `extract_tts_spectrograms.py` model init 2021-09-09 08:59:55 +00:00
Eren Gölge 91a70e80b2 Refactor TTSDataset
Return a dict by `collate`
Refactor batch handling in `collate`
A couple of bug fixes
2021-09-06 15:16:58 +00:00
Eren Gölge 18da8f5dbd Update pylint 2.10.2 and fix lint issues 2021-08-30 08:10:35 +00:00
Eren Gölge f186856e5d Add option to sort input sequnce by audio len 2021-08-30 08:10:35 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Edresson b1620d1f3f remove ignore generate eval flag 2021-07-15 03:34:28 -03:00
Edresson d906fea08c lint fix and eval as argparse in extract tts spectrograms 2021-07-13 02:15:31 -03:00
Edresson 2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Eren Gölge 9790eddada Fix wrong argument name 🛠️ 2021-06-28 17:03:47 +02:00
Eren Gölge 45947acb60 Update `TTS.bin` scripts for the new API 2021-06-28 17:03:47 +02:00
Eren Gölge 00c82c516d rename to 2021-06-28 17:03:19 +02:00
Eren Gölge d6b2b6add6 make style and linter fixes 2021-06-28 17:03:19 +02:00
Eren Gölge 802d461389 Compute d_vectors and speaker_ids separately in TTSDataset 2021-06-28 17:03:19 +02:00
Eren Gölge db6a97d1a2 rename external speaker embedding arguments as `d_vectors` 2021-06-28 17:03:19 +02:00
Eren Gölge b500338faa make style 2021-06-28 17:03:19 +02:00
Eren Gölge 469d2e620a update extract_tts_spectrogram for `cond_input` API of the models 2021-06-28 17:03:19 +02:00
Eren Gölge 5ab28fa618 update `extract_tts_spec...` using `SpeakerManager` 2021-06-28 17:03:19 +02:00
Eren Gölge c392fa4288 update `extract_tts_spectrograms` for the new model API 2021-06-28 17:03:19 +02:00
Eren Gölge 8f47f95998 correct import of `load_meta_data`
remove redundant import
2021-06-28 17:03:19 +02:00
Eren Gölge d25f017b42 update `setup_model.py` imports 2021-06-28 17:03:19 +02:00
Eren Gölge 42554cc711 rename MyDataset -> TTSDataset 2021-06-28 17:03:19 +02:00
Edresson Casanova eb84bb2bc8
Merge branch 'dev' into dev 2021-06-26 15:32:19 -03:00
Michael Hansen 3f172b84d8 Fix linting issues 2021-06-25 14:41:31 +02:00
Adam Froghyar b0aa189348 Forcing do_trim_silence to False in the extract TTS script 2021-06-14 10:44:00 +02:00
Eren Gölge 12722501bb styling 2021-05-15 23:48:31 +02:00
Eren Gölge 715b0a65a0 update main.yml for python x64
fix test
2021-05-12 00:57:29 +02:00
Edresson 501c8e0302 remove unused vars on extract tts spectrograms script 2021-05-04 19:04:13 -03:00
Edresson 3ecd556bbe add unit test for extract tts spectrograms script 2021-05-01 13:41:56 -03:00
Edresson 446b1da936 create inference function 2021-04-29 18:18:37 -03:00
Edresson 20e42a3381 add save audio option 2021-04-23 15:00:00 -03:00
Edresson 8228091f92 add script for extraction of tts spectrograms 2021-04-23 14:17:46 -03:00