Eren Gölge
36cef5966b
Fix resnet speaker encoder
2021-12-30 15:36:35 +00:00
Eren Gölge
348b5c96a2
Fix speaker encoder test
2021-12-30 15:36:35 +00:00
Eren Gölge
638091f41d
Update Speaker Encoder models
2021-12-30 12:02:06 +00:00
Eren Gölge
d29c3780d1
Use speaker_encoder from speaker manager in Vits
2021-12-20 11:54:10 +00:00
Eren Gölge
3c6d7f495c
Fixup
2021-12-20 11:54:10 +00:00
Eren Gölge
3818bd0c23
Fixup
2021-12-20 11:54:10 +00:00
Eren Gölge
7a987db62b
Use torchaudio for ResNet speaker encoder
2021-12-20 11:54:10 +00:00
Eren Gölge
704dddcffa
Make style
2021-12-20 11:54:10 +00:00
Edresson
d39200e69b
Remove torchaudio requeriment
2021-12-20 11:54:10 +00:00
Edresson
45d0b04179
Lint fixs
2021-12-20 11:54:10 +00:00
WeberJulian
1472b6df49
make style
2021-12-20 11:54:10 +00:00
Edresson
9b011b1cb3
Add H/ASP original checkpoint support
2021-12-20 11:54:09 +00:00
Eren Gölge
37959ad0c7
Make linter
2021-09-30 23:02:16 +00:00
Eren Gölge
2e9b6b4f90
Refactor Speaker Encoder training
2021-09-30 14:47:56 +00:00
Eren Gölge
6782d3eab7
Fix linter issues ofr p3.6
2021-08-30 16:18:33 +00:00
Eren Gölge
18da8f5dbd
Update pylint 2.10.2 and fix lint issues
2021-08-30 08:10:35 +00:00
Agrin Hilmkil
ced4cfdbbf
Allow saving / loading checkpoints from cloud paths ( #683 )
...
* Allow saving / loading checkpoints from cloud paths
Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.
Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.
* Append suffix _fsspec to save/load function names
* Add a lower bound to the fsspec dependency
Skips the 0 major version.
* Add missing changes from refactor
* Use fsspec for remaining artifacts
* Add test case with path requiring fsspec
* Avoid writing logs to file unless output_path is local
* Document the possibility of using paths supported by fsspec
* Fix style and lint
* Add missing lint fixes
* Add type annotations to new functions
* Use Coqpit method for converting config to dict
* Fix type annotation in semi-new function
* Add return type for load_fsspec
* Fix bug where fs not always created
* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge
4b7b88dd3d
Add fullband-melgan DE vocoder
2021-07-26 15:38:30 +02:00
Edresson
1c4e806f54
use speaker manager on compute embeddings script
2021-06-27 03:35:34 -03:00
Edresson Casanova
e78e3cd81e
Merge branch 'dev' into dev
2021-06-18 14:10:03 -03:00
Edresson
14b209c7e9
Create a batch for more fast inference on LSTM Speaker Encoder
2021-06-05 03:12:17 -03:00
Eren Gölge
bec85ac58d
make style
2021-05-31 16:37:15 +02:00
Edresson
7448177b72
use SpeakerManager on compute embeddings script
2021-05-29 21:11:53 -03:00
Edresson
208bb0f0ee
add batched speaker encoder inference
2021-05-27 20:01:00 -03:00
Edresson
bc5307caa0
add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix
2021-05-26 20:35:58 -03:00
Edresson
c90037c2e9
solve merge problems
2021-05-26 16:01:30 -03:00
Edresson Casanova
f89cb6aec2
Merge branch 'dev' into dev
2021-05-25 17:30:25 -03:00
Edresson
d570c2d790
pylint fix and data loader bug fix
2021-05-26 01:11:37 -03:00
Edresson
856ea19758
bug fix in dataloader and update inference
2021-05-18 03:43:16 -03:00
Eren Gölge
da49089a72
update melgan training test batch size
2021-05-12 10:12:11 +02:00
Edresson
3433c2f348
add compute embedding for the new speaker encoder
2021-05-12 03:06:46 -03:00
Edresson
3fcc748b2e
implement the Speaker Encoder H/ASP
2021-05-11 16:27:05 -03:00
Eren Gölge
19fb1d743d
style update
2021-05-11 11:30:00 +02:00
Eren Gölge
18e76a2309
fix speaker encoder model initialization
2021-05-11 11:29:18 +02:00
Eren Gölge
f8e52965dd
add speaker encoder coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
ce2bba543e
remove extra from utils and move funcs to io.py
2021-05-11 11:29:18 +02:00
Eren Gölge
812dbc2b06
rm config.json
2021-05-11 11:29:18 +02:00
Eren Gölge
4a58fdfd59
comment out check-arguments before copying fields to the configs
2021-05-11 11:29:17 +02:00
Edresson
85ccad7e0a
add Audio data augamentation Addtive and RIR
2021-05-11 00:59:57 -03:00
Edresson
77d85c6cc5
add softmaxproto loss and bug fix in data loader
2021-05-10 17:08:38 -03:00
Edresson
78bad25f2b
update voxceleb download link
2021-05-07 23:45:15 -03:00
Eren Gölge
8cb27267a4
formatting
2021-05-03 14:26:35 +02:00
Agrin Hilmkil
351d0ed6ae
Remove unnecessary fsspec usage
2021-04-28 11:21:08 +02:00
Agrin Hilmkil
167f86417e
Move dev, tf, notebook dependencies to extras
2021-04-28 11:20:06 +02:00
Eren Gölge
2da81f5bb6
add load_chekpoint to speaker encoder
2021-04-23 18:04:37 +02:00
Eren Gölge
87ee6ceb57
style update #3
2021-04-09 01:17:15 +02:00
Eren Gölge
e5b9607bc3
isort all imports
2021-04-09 00:45:20 +02:00
Eren Gölge
0e79fa86ad
format with black and pylint 2.7.3
2021-04-09 00:38:08 +02:00
Eren Gölge
7a382a5c2b
stowed aligntts commit and small refactoring with feed_forward layers
2021-03-30 14:39:16 +02:00
Eren Gölge
892c3c3623
use torch for AngleProtoLoss
2021-03-30 14:39:16 +02:00