coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	63625e79af	refactor: import get_last_checkpoint from trainer.io	2024-11-29 13:59:43 +01:00
Enno Hermann	1f27f994a1	refactor(utils): remove duplicate set_partial_state_dict	2024-11-21 11:53:35 +01:00
Enno Hermann	2df9bfa78e	refactor: handle deprecation of torch.cuda.amp.autocast (#144 ) torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu", args...) instead. https://pytorch.org/docs/stable/amp.html	2024-11-09 18:37:08 +01:00
Enno Hermann	de35920317	Merge pull request #50 from idiap/umap build: move umap-learn into optional notebook dependencies	2024-07-25 13:26:09 +01:00
Enno Hermann	2d06aeb79b	chore: remove unused TTS.utils.io module All uses of these methods were replaced with the equivalents from coqui-tts-trainer	2024-06-29 15:07:10 +02:00
Enno Hermann	da82d55329	refactor: use load_fsspec from trainer Made automatically with: rg "from TTS.utils.io import load_fsspec" --files-with-matches \| xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'	2024-06-29 15:07:10 +02:00
Enno Hermann	28296c6458	refactor: use get_git_branch from trainer	2024-06-29 15:07:10 +02:00
Enno Hermann	59ef28d708	build: move umap-learn into optional notebook dependencies Except for notebooks, it's only used to show embedding plots during speaker encoder training, in which case a warning is now shown to install it.	2024-06-26 23:53:17 +02:00
Enno Hermann	c5241d71ab	chore: address pytorch deprecations torch.range(a, b) == torch.arange(a, b+1) meshgrid indexing: https://github.com/pytorch/pytorch/issues/50276 checkpoint use_reentrant: https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745 optimizer.step() before scheduler.step(): https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate	2024-06-26 11:38:25 +02:00
Enno Hermann	77722cb0dd	fix(bin.synthesize): correctly handle boolean arguments Previously, e.g. `--use_cuda false` would actually set use_cuda=True: https://github.com/coqui-ai/TTS/discussions/3762	2024-05-31 08:39:32 +02:00
Enno Hermann	70bd84894d	fix(server): ensure logging output gets actually shown	2024-05-20 12:45:41 +02:00
Enno Hermann	b711e19cb6	refactor: remove verbose arguments Can be handled by adjusting logging levels instead.	2024-04-03 15:19:45 +02:00
Enno Hermann	b6ab85a050	fix: use logging instead of print statements Fixes #1691	2024-04-03 15:19:45 +02:00
Enno Hermann	a7753708fb	refactor: remove duplicate methods available in Trainer	2024-03-12 15:06:42 +01:00
Enno Hermann	e05243c4c8	refactor: read/write csv files with standard library	2024-03-06 16:18:09 +01:00
Aarni Koskela	449820ec7d	Ruff autofix E71*	2023-12-13 14:56:41 +02:00
Aarni Koskela	90991e89b4	Ruff autofix unused imports and import order	2023-12-13 14:56:41 +02:00
Enno Hermann	0fb0d67de7	refactor: use save_checkpoint()/save_best_model() from Trainer	2023-11-17 01:18:23 +01:00
Enno Hermann	96678c7ba2	refactor: use copy_model_files() from Trainer	2023-11-17 01:18:23 +01:00
Enno Hermann	39fe38bda4	refactor: use save_fsspec() from Trainer	2023-11-17 01:18:23 +01:00
Enno Hermann	fdf0c8b10a	chore(encoder): remove unused code	2023-11-17 01:18:23 +01:00
Eren G??lge	17ac188958	Drop fairseq for Hubert	2023-06-26 19:27:48 +02:00
Edresson Casanova	16b9862252	Fix Speaker Consistency Loss (SCL) (#2364 )	2023-02-27 09:14:00 +03:00
Eren Gölge	914280a556	Bump up to v0.11.0 (#2329 ) * Make style * Bump up to v0.11.0	2023-02-08 13:58:49 +01:00
Eren Gölge	8cb1433e6e	Cache fsspec downloads (#2132 ) * Cache fsspec downloaded files * Use diff paths for test * Make fsspec caching optional * Decom GPU docker tests * Make progress bar optional for better CI log * Check path local	2022-11-09 22:12:48 +01:00
Edresson Casanova	096b35f639	Add VCTK speaker encoder recipe (#1912 )	2022-08-26 16:19:03 +02:00
Eren Gölge	7442bcefa5	Remove deprecated files (#1873 ) - samplers.py is moved - distribute.py is replaces by the 👟Trainer	2022-08-15 12:16:37 +02:00
Eren Gölge	d46fbc240c	Introduce numpy and torch transforms (#1705 ) * Refactor audio processing functions * Add tests for numpy transforms * Fix imports * Fix imports2	2022-08-08 11:57:50 +02:00
Edresson Casanova	060e0f9368	Add EmbeddingManager and BaseIDManager (#1374 )	2022-03-31 13:41:16 +02:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Eren Gölge	0870a4faa2	Make style (#1405 )	2022-03-16 12:13:55 +01:00
Edresson Casanova	f81892483d	REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349 ) * Rename Speaker encoder module to encoder * Add a generic emotion dataset formatter * Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config * Add class map in emotion config * Add Base encoder config * Add evaluation encoder script * Fix the bug in plot_embeddings * Enable Weight decay for encoder training * Add argumnet to disable storage * Add Perfect Sampler and remove storage * Add evaluation during encoder training * Fix lint checks * Remove useless config parameter * Active evaluation in speaker encoder test and use multispeaker dataset for this test * Unit tests fixs * Remove useless tests for speedup the aux_tests * Use get_optimizer in Encoder * Add BaseEncoder Class * Fix the unitests * Add Perfect Batch Sampler unit test * Add compute encoder accuracy in a function	2022-03-11 14:43:40 +01:00

32 Commits