Commit Graph

70 Commits

Author SHA1 Message Date
Eren Gölge 36cef5966b Fix resnet speaker encoder 2021-12-30 15:36:35 +00:00
Eren Gölge 348b5c96a2 Fix speaker encoder test 2021-12-30 15:36:35 +00:00
Eren Gölge 638091f41d Update Speaker Encoder models 2021-12-30 12:02:06 +00:00
Eren Gölge d29c3780d1 Use speaker_encoder from speaker manager in Vits 2021-12-20 11:54:10 +00:00
Eren Gölge 3c6d7f495c Fixup 2021-12-20 11:54:10 +00:00
Eren Gölge 3818bd0c23 Fixup 2021-12-20 11:54:10 +00:00
Eren Gölge 7a987db62b Use torchaudio for ResNet speaker encoder 2021-12-20 11:54:10 +00:00
Eren Gölge 704dddcffa Make style 2021-12-20 11:54:10 +00:00
Edresson d39200e69b Remove torchaudio requeriment 2021-12-20 11:54:10 +00:00
Edresson 45d0b04179 Lint fixs 2021-12-20 11:54:10 +00:00
WeberJulian 1472b6df49 make style 2021-12-20 11:54:10 +00:00
Edresson 9b011b1cb3 Add H/ASP original checkpoint support 2021-12-20 11:54:09 +00:00
Eren Gölge 37959ad0c7 Make linter 2021-09-30 23:02:16 +00:00
Eren Gölge 2e9b6b4f90 Refactor Speaker Encoder training 2021-09-30 14:47:56 +00:00
Eren Gölge 6782d3eab7 Fix linter issues ofr p3.6 2021-08-30 16:18:33 +00:00
Eren Gölge 18da8f5dbd Update pylint 2.10.2 and fix lint issues 2021-08-30 08:10:35 +00:00
Agrin Hilmkil ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge 4b7b88dd3d Add fullband-melgan DE vocoder 2021-07-26 15:38:30 +02:00
Edresson 1c4e806f54 use speaker manager on compute embeddings script 2021-06-27 03:35:34 -03:00
Edresson Casanova e78e3cd81e
Merge branch 'dev' into dev 2021-06-18 14:10:03 -03:00
Edresson 14b209c7e9 Create a batch for more fast inference on LSTM Speaker Encoder 2021-06-05 03:12:17 -03:00
Eren Gölge bec85ac58d make style 2021-05-31 16:37:15 +02:00
Edresson 7448177b72 use SpeakerManager on compute embeddings script 2021-05-29 21:11:53 -03:00
Edresson 208bb0f0ee add batched speaker encoder inference 2021-05-27 20:01:00 -03:00
Edresson bc5307caa0 add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix 2021-05-26 20:35:58 -03:00
Edresson c90037c2e9 solve merge problems 2021-05-26 16:01:30 -03:00
Edresson Casanova f89cb6aec2
Merge branch 'dev' into dev 2021-05-25 17:30:25 -03:00
Edresson d570c2d790 pylint fix and data loader bug fix 2021-05-26 01:11:37 -03:00
Edresson 856ea19758 bug fix in dataloader and update inference 2021-05-18 03:43:16 -03:00
Eren Gölge da49089a72 update melgan training test batch size 2021-05-12 10:12:11 +02:00
Edresson 3433c2f348 add compute embedding for the new speaker encoder 2021-05-12 03:06:46 -03:00
Edresson 3fcc748b2e implement the Speaker Encoder H/ASP 2021-05-11 16:27:05 -03:00
Eren Gölge 19fb1d743d style update 2021-05-11 11:30:00 +02:00
Eren Gölge 18e76a2309 fix speaker encoder model initialization 2021-05-11 11:29:18 +02:00
Eren Gölge f8e52965dd add speaker encoder coqpit 2021-05-11 11:29:18 +02:00
Eren Gölge ce2bba543e remove extra from utils and move funcs to io.py 2021-05-11 11:29:18 +02:00
Eren Gölge 812dbc2b06 rm config.json 2021-05-11 11:29:18 +02:00
Eren Gölge 4a58fdfd59 comment out check-arguments before copying fields to the configs 2021-05-11 11:29:17 +02:00
Edresson 85ccad7e0a add Audio data augamentation Addtive and RIR 2021-05-11 00:59:57 -03:00
Edresson 77d85c6cc5 add softmaxproto loss and bug fix in data loader 2021-05-10 17:08:38 -03:00
Edresson 78bad25f2b update voxceleb download link 2021-05-07 23:45:15 -03:00
Eren Gölge 8cb27267a4 formatting 2021-05-03 14:26:35 +02:00
Agrin Hilmkil 351d0ed6ae Remove unnecessary fsspec usage 2021-04-28 11:21:08 +02:00
Agrin Hilmkil 167f86417e Move dev, tf, notebook dependencies to extras 2021-04-28 11:20:06 +02:00
Eren Gölge 2da81f5bb6 add load_chekpoint to speaker encoder 2021-04-23 18:04:37 +02:00
Eren Gölge 87ee6ceb57 style update #3 2021-04-09 01:17:15 +02:00
Eren Gölge e5b9607bc3 isort all imports 2021-04-09 00:45:20 +02:00
Eren Gölge 0e79fa86ad format with black and pylint 2.7.3 2021-04-09 00:38:08 +02:00
Eren Gölge 7a382a5c2b stowed aligntts commit and small refactoring with feed_forward layers 2021-03-30 14:39:16 +02:00
Eren Gölge 892c3c3623 use torch for AngleProtoLoss 2021-03-30 14:39:16 +02:00