coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	35a781fb90	Fix synthesizer reading `use_language_embedding`	2021-12-20 11:54:10 +00:00
Eren Gölge	704dddcffa	Make style	2021-12-20 11:54:10 +00:00
WeberJulian	54b7fb4e4a	Fix zoo tests	2021-12-20 11:54:10 +00:00
WeberJulian	a564eb9f54	Add support for multi-lingual models in CLI	2021-12-20 11:54:10 +00:00
Edresson	818dc4ccd8	Add Docstring for TorchSTFT	2021-12-20 11:54:10 +00:00
Edresson	d39200e69b	Remove torchaudio requeriment	2021-12-20 11:54:10 +00:00
Edresson	45d0b04179	Lint fixs	2021-12-20 11:54:10 +00:00
Edresson	2b2cecaea2	Set the new_fields in copy_model_files as None by default	2021-12-20 11:54:10 +00:00
Edresson	352aa69eca	Create a module for the VAD script	2021-12-20 11:54:10 +00:00
loganhart420	103c010eca	Add addtional datasets	2021-12-16 07:21:27 -05:00
Eren Gölge	ce45d9e1af	Make style and lint	2021-12-01 10:42:52 +00:00
Eren Gölge	512ada7548	Fix callbacks against multi-gpu training	2021-12-01 10:32:14 +00:00
Eren Gölge	d227aaebcc	Print when using Griffin-Lim in Synthesizer	2021-11-01 16:52:26 +01:00
George	37eaefc085	Optional silence trimming during inference and find_endpoint() fix (#898 ) * Set find_endpoint db threshold in config.json * Optional silence trimming during inference * Make trim_db value negative	2021-10-29 18:28:55 +02:00
Eren Gölge	2df0752e73	Model zoo tests (#900 ) * Fix VITS model multi-speaker init * Remove gdrive support in model manager * Add model zoo tests	2021-10-29 17:54:16 +02:00
Eren Gölge	035ed432bc	Doc update (#889 ) * Link source files from the docs * Update glowTTS recipes for docs * Add dataset downloaders	2021-10-26 17:41:33 +02:00
Eren Gölge	1987aaaaed	Update d-vector reshape in synthesizer	2021-10-21 13:53:25 +00:00
Eren Gölge	92b6d98443	Set pitch frame alignment wrt spec computation	2021-10-20 18:12:38 +00:00
Eren Gölge	0a3d1cc7ee	Pass speaker manager to the model in synthesizer	2021-10-20 18:11:36 +00:00
Eren Gölge	3c7848e9b1	Don't OOR values in train console log	2021-10-19 16:32:16 +00:00
Eren Gölge	c514351c0e	Refactor multi-speaker init in BaseTTS-Tacotron1-2	2021-10-18 08:55:45 +00:00
Eren Gölge	700b056117	Update Synthesizer multi-speaker handling	2021-10-15 10:21:12 +00:00
Eren Gölge	9a0d8fa027	Update `copy_model_files()`	2021-09-30 14:47:56 +00:00
Eren Gölge	8ada870a57	Refactor `trainer.py` for v2	2021-09-30 14:16:34 +00:00
Eren Gölge	7d8f77385a	Use `glow-tts` in synthesis tests	2021-09-10 17:27:33 +00:00
Eren Gölge	742f9c54da	Warn user if nan in GL	2021-09-10 08:26:05 +00:00
Eren Gölge	4761853c5c	Fix imports	2021-09-08 13:34:40 +00:00
Eren Gölge	2c4bbbf9b9	Use pyworld for pitch	2021-09-06 15:16:58 +00:00
Eren Gölge	98a7271ce8	Refactor FastPitchv2	2021-09-06 15:16:58 +00:00
Eren Gölge	42862f7fdb	Format style of the recipes	2021-09-06 15:16:58 +00:00
Eren Gölge	aacbb3ed77	Fix SpeakerManager usage in `synthesize.py`	2021-09-06 15:16:58 +00:00
Eren Gölge	5a6ffaee08	Add yin based pitch computation	2021-09-06 15:16:58 +00:00
Eren Gölge	d085642ac1	Cache pitch features Cache the features at the beginning of `BaseTTS` training.	2021-09-06 15:16:58 +00:00
Eren Gölge	fba257104d	Compute F0 using librosa	2021-09-06 15:16:58 +00:00
Eren Gölge	d16da949a5	Merge branch 'fix_distribute' into dev	2021-08-30 16:31:07 +00:00
Eren Gölge	5255e089e6	Fix #767	2021-08-30 13:10:08 +00:00
Eren Gölge	c560114324	Fix #750	2021-08-30 13:06:50 +00:00
Eren Gölge	18da8f5dbd	Update pylint 2.10.2 and fix lint issues	2021-08-30 08:10:35 +00:00
Eren Gölge	2620f62ea8	Move duration_loss inside VitsGeneratorLoss	2021-08-27 07:07:07 +00:00
Eren Gölge	1692b8e4d9	Merge pull request #726 from fijipants/patch-1 Fix bug with log_func	2021-08-26 22:11:29 +02:00
Eren Gölge	49e1181ea4	Fixes for the vits model	2021-08-26 17:15:09 +00:00
fijipants	e9e01b09b0	Fix bug with log_func	2021-08-18 19:59:51 -04:00
fijipants	8f57f8adfd	Update synthesizer.py	2021-08-18 19:56:52 -04:00
Eren Gölge	7c0d564965	Syncronize DDP processes	2021-08-13 10:40:50 +00:00
Eren Gölge	ecf5f17dca	Fix distribute.py and ddp training	2021-08-12 22:22:32 +00:00
Eren Gölge	537bc8487a	Print model count when listing modelsk	2021-08-10 16:25:11 +00:00
Ayush Chaurasia	f3e9d61330	Refactor logging initialization	2021-08-09 18:35:08 +00:00
Ayush Chaurasia	79b74a989d	Update: add_text	2021-08-09 18:34:38 +00:00
Ayush Chaurasia	9fcf48b760	Delete logger_base.py	2021-08-09 18:34:00 +00:00
Ayush Chaurasia	290972fd35	reformat	2021-08-09 18:34:00 +00:00
Ayush Chaurasia	936a47504d	Update Logger API, recipes	2021-08-09 18:34:00 +00:00
Ayush Chaurasia	f63cf46c55	Unified logger API	2021-08-09 18:34:00 +00:00
Ayush Chaurasia	f4434da5a3	Update disabled structure	2021-08-09 18:31:16 +00:00
Ayush Chaurasia	f606741dc4	Add artifacts logging , wandb args	2021-08-09 18:31:16 +00:00
Ayush Chaurasia	f5e50ad502	WandbLogger	2021-08-09 18:27:06 +00:00
Eren Gölge	c312acac7d	Implement VITS model 🚀 VITS model implementation built on Glow TTS and HiFiGAN layers.	2021-08-09 18:02:36 +00:00
Eren Gölge	060e746e21	Add `do_amp_to_db` option	2021-08-09 18:02:36 +00:00
Eren Gölge	e94c1f894d	Simplify `console_logger`	2021-08-09 18:02:36 +00:00
Eren Gölge	dd55960732	Update `synthesizer.py` Fixes and changes for multi-speaker model init and custom symbols made by mode.make_symbols()	2021-08-09 18:02:36 +00:00
Eren Gölge	d4deb2716f	Modify `get_optimizer` to accept a model argument	2021-08-09 18:02:36 +00:00
Agrin Hilmkil	ced4cfdbbf	Allow saving / loading checkpoints from cloud paths (#683 ) * Allow saving / loading checkpoints from cloud paths Allows saving and loading checkpoints directly from cloud paths like Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec. Note: The user will have to install the relevant dependency for each protocol. Otherwise fsspec will fail and specify which dependency is missing. * Append suffix _fsspec to save/load function names * Add a lower bound to the fsspec dependency Skips the 0 major version. * Add missing changes from refactor * Use fsspec for remaining artifacts * Add test case with path requiring fsspec * Avoid writing logs to file unless output_path is local * Document the possibility of using paths supported by fsspec * Fix style and lint * Add missing lint fixes * Add type annotations to new functions * Use Coqpit method for converting config to dict * Fix type annotation in semi-new function * Add return type for load_fsspec * Fix bug where fs not always created * Restore the experiment removal functionality	2021-08-09 18:02:36 +00:00
Eren Gölge	a05b234080	Raise an error when multiple GPUs are in use User must define the target GPU by `CUDA_VISIBLE_DEVICES` and use `distribute.py` for multi-gpu training.	2021-07-04 11:25:49 +02:00
Eren Gölge	168f97cbe9	Let `Synthesizer` use the speaker manager out of the model	2021-07-02 10:47:55 +02:00
Eren Gölge	196876feb1	Fix `ModelManager` model download	2021-07-02 10:47:05 +02:00
Eren Gölge	2e1a428b83	Update glowtts docstrings and docs	2021-06-30 14:30:55 +02:00
Eren Gölge	51398cd15b	Add docstrings and typing for `audio.py`	2021-06-28 17:03:47 +02:00
Eren Gölge	d700845b10	Move `TorchSTFT` to `utils.audio`	2021-06-28 17:03:47 +02:00
Eren Gölge	e30f245e06	Update `synthesizer` for speaker and model init	2021-06-28 17:03:47 +02:00
Eren Gölge	e53616078a	Fixup `utils` for the trainer	2021-06-28 17:03:47 +02:00
Eren Gölge	98298ee671	Implement unified IO utils	2021-06-28 17:03:19 +02:00
Eren Gölge	c7aad884cd	Implement unified trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	00c82c516d	rename to	2021-06-28 17:03:19 +02:00
Eren Gölge	db6a97d1a2	rename external speaker embedding arguments as `d_vectors`	2021-06-28 17:03:19 +02:00
Eren Gölge	9042ae9195	use `to_cuda()` for moving data in `format_batch()`	2021-06-28 17:03:19 +02:00
Eren Gölge	1c8a3d7c86	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	8cdd423234	styling formatting.py	2021-06-28 17:03:19 +02:00
Eren Gölge	8381379938	formating `cond_input` with a function in Tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	d96ebcd6d3	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	b643e8b37c	`logging/__init__.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	0cee5042a9	fix logger imports	2021-06-28 17:03:19 +02:00
Eren Gölge	0eec238429	remove redundant imports	2021-06-28 17:03:19 +02:00
Eren Gölge	b500338faa	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	c680a07a20	fix `Synthesized` for the new `synthesis()`	2021-06-28 17:03:19 +02:00
Eren Gölge	d25f017b42	update `setup_model.py` imports	2021-06-28 17:03:19 +02:00
Eren Gölge	34f8a74e4d	remove `truncated` from synthesizer	2021-06-28 17:03:19 +02:00
Eren Gölge	178eccbc16	update console logger	2021-06-28 17:03:19 +02:00
Eren Gölge	a20a1c7d06	rename preprocess.py -> formatters.py	2021-06-28 17:03:19 +02:00
Eren Gölge	8def3c87af	trainer-API updates	2021-06-28 17:03:19 +02:00
Michael Hansen	67869e77f9	Use gruut for phonemization	2021-06-25 14:41:05 +02:00
Eren Gölge	d0ab0382fc	linter fixes	2021-06-01 09:15:32 +02:00
Eren Gölge	d9f1268f99	init tb_logger None for rank > 0 processes	2021-05-31 15:47:07 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	c2c7dff805	use relaxted coqpit parser	2021-05-18 14:49:47 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	6e980b49c4	fix synthesizer.py for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	db14dcd95a	remove old load_config	2021-05-11 11:29:18 +02:00
Eren Gölge	a21ac883dd	add get_cuda()	2021-05-11 11:29:18 +02:00

1 2 3 4 5 ...

280 Commits