Enno Hermann
63625e79af
refactor: import get_last_checkpoint from trainer.io
2024-11-29 13:59:43 +01:00
Enno Hermann
1f27f994a1
refactor(utils): remove duplicate set_partial_state_dict
2024-11-21 11:53:35 +01:00
Enno Hermann
2df9bfa78e
refactor: handle deprecation of torch.cuda.amp.autocast ( #144 )
...
torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be
deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu",
args...) instead.
https://pytorch.org/docs/stable/amp.html
2024-11-09 18:37:08 +01:00
Enno Hermann
de35920317
Merge pull request #50 from idiap/umap
...
build: move umap-learn into optional notebook dependencies
2024-07-25 13:26:09 +01:00
Enno Hermann
2d06aeb79b
chore: remove unused TTS.utils.io module
...
All uses of these methods were replaced with the equivalents from coqui-tts-trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
da82d55329
refactor: use load_fsspec from trainer
...
Made automatically with:
rg "from TTS.utils.io import load_fsspec" --files-with-matches | xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'
2024-06-29 15:07:10 +02:00
Enno Hermann
28296c6458
refactor: use get_git_branch from trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
59ef28d708
build: move umap-learn into optional notebook dependencies
...
Except for notebooks, it's only used to show embedding plots during speaker
encoder training, in which case a warning is now shown to install it.
2024-06-26 23:53:17 +02:00
Enno Hermann
c5241d71ab
chore: address pytorch deprecations
...
torch.range(a, b) == torch.arange(a, b+1)
meshgrid indexing: https://github.com/pytorch/pytorch/issues/50276
checkpoint use_reentrant:
https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745
optimizer.step() before scheduler.step():
https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
2024-06-26 11:38:25 +02:00
Enno Hermann
77722cb0dd
fix(bin.synthesize): correctly handle boolean arguments
...
Previously, e.g. `--use_cuda false` would actually set use_cuda=True:
https://github.com/coqui-ai/TTS/discussions/3762
2024-05-31 08:39:32 +02:00
Enno Hermann
70bd84894d
fix(server): ensure logging output gets actually shown
2024-05-20 12:45:41 +02:00
Enno Hermann
b711e19cb6
refactor: remove verbose arguments
...
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
a7753708fb
refactor: remove duplicate methods available in Trainer
2024-03-12 15:06:42 +01:00
Enno Hermann
e05243c4c8
refactor: read/write csv files with standard library
2024-03-06 16:18:09 +01:00
Aarni Koskela
449820ec7d
Ruff autofix E71*
2023-12-13 14:56:41 +02:00
Aarni Koskela
90991e89b4
Ruff autofix unused imports and import order
2023-12-13 14:56:41 +02:00
Enno Hermann
0fb0d67de7
refactor: use save_checkpoint()/save_best_model() from Trainer
2023-11-17 01:18:23 +01:00
Enno Hermann
96678c7ba2
refactor: use copy_model_files() from Trainer
2023-11-17 01:18:23 +01:00
Enno Hermann
39fe38bda4
refactor: use save_fsspec() from Trainer
2023-11-17 01:18:23 +01:00
Enno Hermann
fdf0c8b10a
chore(encoder): remove unused code
2023-11-17 01:18:23 +01:00
Eren G??lge
17ac188958
Drop fairseq for Hubert
2023-06-26 19:27:48 +02:00
Edresson Casanova
16b9862252
Fix Speaker Consistency Loss (SCL) ( #2364 )
2023-02-27 09:14:00 +03:00
Eren Gölge
914280a556
Bump up to v0.11.0 ( #2329 )
...
* Make style
* Bump up to v0.11.0
2023-02-08 13:58:49 +01:00
Eren Gölge
8cb1433e6e
Cache fsspec downloads ( #2132 )
...
* Cache fsspec downloaded files
* Use diff paths for test
* Make fsspec caching optional
* Decom GPU docker tests
* Make progress bar optional for better CI log
* Check path local
2022-11-09 22:12:48 +01:00
Edresson Casanova
096b35f639
Add VCTK speaker encoder recipe ( #1912 )
2022-08-26 16:19:03 +02:00
Eren Gölge
7442bcefa5
Remove deprecated files ( #1873 )
...
- samplers.py is moved
- distribute.py is replaces by the 👟 Trainer
2022-08-15 12:16:37 +02:00
Eren Gölge
d46fbc240c
Introduce numpy and torch transforms ( #1705 )
...
* Refactor audio processing functions
* Add tests for numpy transforms
* Fix imports
* Fix imports2
2022-08-08 11:57:50 +02:00
Edresson Casanova
060e0f9368
Add EmbeddingManager and BaseIDManager ( #1374 )
2022-03-31 13:41:16 +02:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Eren Gölge
0870a4faa2
Make style ( #1405 )
2022-03-16 12:13:55 +01:00
Edresson Casanova
f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support ( #1349 )
...
* Rename Speaker encoder module to encoder
* Add a generic emotion dataset formatter
* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config
* Add class map in emotion config
* Add Base encoder config
* Add evaluation encoder script
* Fix the bug in plot_embeddings
* Enable Weight decay for encoder training
* Add argumnet to disable storage
* Add Perfect Sampler and remove storage
* Add evaluation during encoder training
* Fix lint checks
* Remove useless config parameter
* Active evaluation in speaker encoder test and use multispeaker dataset for this test
* Unit tests fixs
* Remove useless tests for speedup the aux_tests
* Use get_optimizer in Encoder
* Add BaseEncoder Class
* Fix the unitests
* Add Perfect Batch Sampler unit test
* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00