coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	232a5abb6a	Update `tts.setup_model` Run `model.make_symbols()` if availabe to set the symbol list	2021-08-09 18:02:36 +00:00
Eren Gölge	f5a6aa974f	Modify `symbols.py` not to add _arpanet	2021-08-09 18:02:36 +00:00
Eren Gölge	d4deb2716f	Modify `get_optimizer` to accept a model argument	2021-08-09 18:02:36 +00:00
Eren Gölge	003e5579e8	Enable `custom_symbols` in text processing Models can define their own custom symbols lists with custom `make_symbols()`	2021-08-09 18:02:36 +00:00
Eren Gölge	bd4e29b4dd	Add `compute_linear_spec=False` to `BaseTTSConfig`	2021-08-09 18:02:36 +00:00
Eren Gölge	960a35a121	Add `scheduler_after_epoch` to `BaseTrainingConfig`	2021-08-09 18:02:36 +00:00
Eren Gölge	e4648ffef1	Fix multi-speaker init of Tacotron models & tests	2021-08-09 18:02:36 +00:00
Eren Gölge	01324c8e70	Update `base_tts.py` Enable calling `make_symbols()` from the model if defined. Compatibility changes for end2end `tts` models in batch formatting. Changes in multi-speaker initialization. Modify `test_run()` to work with dict output iof `synthesis`	2021-08-09 18:02:36 +00:00
Eren Gölge	bf562cf437	Update `trainer.py` Fix multi-speaker initialization of models. Add changes for end2end`tts` models.	2021-08-09 18:02:36 +00:00
Agrin Hilmkil	ced4cfdbbf	Allow saving / loading checkpoints from cloud paths (#683 ) * Allow saving / loading checkpoints from cloud paths Allows saving and loading checkpoints directly from cloud paths like Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec. Note: The user will have to install the relevant dependency for each protocol. Otherwise fsspec will fail and specify which dependency is missing. * Append suffix _fsspec to save/load function names * Add a lower bound to the fsspec dependency Skips the 0 major version. * Add missing changes from refactor * Use fsspec for remaining artifacts * Add test case with path requiring fsspec * Avoid writing logs to file unless output_path is local * Document the possibility of using paths supported by fsspec * Fix style and lint * Add missing lint fixes * Add type annotations to new functions * Use Coqpit method for converting config to dict * Fix type annotation in semi-new function * Add return type for load_fsspec * Fix bug where fs not always created * Restore the experiment removal functionality	2021-08-09 18:02:36 +00:00
Eren Gölge	d9e18e009b	Skip phoneme cache pre-compute if the path exists	2021-08-09 18:02:36 +00:00
Eren Gölge	6c131d168e	Bump the version to 0.1.3	2021-07-26 21:32:27 +02:00
Eren Gölge	febd6105b5	Update default vocoder for de-thorsten	2021-07-26 16:08:52 +02:00
Eren Gölge	4b7b88dd3d	Add fullband-melgan DE vocoder	2021-07-26 15:38:30 +02:00
Eren Gölge	764f684e1b	Fix `server.py` for multi-speaker models	2021-07-26 15:38:30 +02:00
Eren Gölge	75b201c6c1	Merge pull request #673 from coqui-ai/fix_stopnet Fix stopnet training for Tacotron models	2021-07-24 12:25:38 +02:00
Eren Gölge	fc0c4600bd	Fix stopnet training	2021-07-24 11:39:54 +02:00
Eren Gölge	30eed347b6	Merge pull request #581 from Edresson/dev Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.	2021-07-23 17:22:51 +02:00
Edresson Casanova	d5adc35fdf	Add docstring to compute_embeddings script	2021-07-21 07:16:10 -03:00
Eren Gölge	05c75aa9d5	Fix linter issues	2021-07-16 13:37:38 +02:00
Eren Gölge	58cc414477	Fix WaveGrad `test_run`	2021-07-16 13:02:25 +02:00
WeberJulian	25832eb97b	Changes for review	2021-07-15 11:38:45 +02:00
Edresson	b1620d1f3f	remove ignore generate eval flag	2021-07-15 03:34:28 -03:00
WeberJulian	c79a82ed07	refix linter	2021-07-13 23:12:18 +02:00
WeberJulian	7d92b30946	Fix tests	2021-07-13 23:00:34 +02:00
WeberJulian	32974dd6a9	Fix test sentences synthesis	2021-07-13 16:07:13 +02:00
Edresson	d906fea08c	lint fix and eval as argparse in extract tts spectrograms	2021-07-13 02:15:31 -03:00
Edresson	2e5baffa9c	Merge fix and eval split as argparse	2021-07-13 01:47:32 -03:00
Eren Gölge	93a74cbb71	Merge pull request #628 from Aloento/patch-2 Change to _get_preprocessor_by_name	2021-07-11 22:17:50 +02:00
Edresson	4eac1c4651	bug fix on train_encoder and unit tests	2021-07-11 12:00:39 -03:00
Aloento	6e3e6d5756	Change to _get_preprocessor_by_name	2021-07-08 09:53:13 +02:00
Eren Gölge	8fbadad68e	Bump up to v0.1.2	2021-07-06 14:44:59 +02:00
eren golge	3c0454490f	Fix #616	2021-07-06 14:44:03 +02:00
Eren Gölge	0c347624e7	Bump up version to v0.1.1	2021-07-04 11:46:36 +02:00
Eren Gölge	a05b234080	Raise an error when multiple GPUs are in use User must define the target GPU by `CUDA_VISIBLE_DEVICES` and use `distribute.py` for multi-gpu training.	2021-07-04 11:25:49 +02:00
Eren Gölge	270c3823eb	Fix #608	2021-07-04 11:19:31 +02:00
Eren Gölge	c25a2184e7	Add docs for `SpeakerManager`	2021-07-03 13:55:27 +02:00
Eren Gölge	f382e4c700	Fix linter warnings	2021-07-03 13:30:24 +02:00
Eren Gölge	9e7824fe35	Fix UnivNet inference code	2021-07-02 10:48:34 +02:00
Eren Gölge	168f97cbe9	Let `Synthesizer` use the speaker manager out of the model	2021-07-02 10:47:55 +02:00
Eren Gölge	196876feb1	Fix `ModelManager` model download	2021-07-02 10:47:05 +02:00
Eren Gölge	9352cb4136	Format Align TTS docstrings	2021-07-02 10:45:58 +02:00
Eren Gölge	95ad72f38f	Fix glow tts initialization	2021-07-02 10:45:37 +02:00
Eren Gölge	40b0b5365e	Let `get_characters` return `num_chars`	2021-07-02 10:45:00 +02:00
Eren Gölge	0fa6a8c9b8	Fix glow tts default parameters	2021-07-02 10:44:23 +02:00
Eren Gölge	a4c658f5ef	Fix for using the `Synthesizer` out of the model	2021-07-02 10:43:38 +02:00
Eren Gölge	db47f4f105	Update `.models.json`	2021-07-02 10:43:00 +02:00
Eren Gölge	2e1a428b83	Update glowtts docstrings and docs	2021-06-30 14:30:55 +02:00
Eren Gölge	5723eb4738	Fix config init in `process_args`	2021-06-29 16:41:08 +02:00
Eren Gölge	4b5421b42f	Remove FAQ link from README.md	2021-06-29 13:20:40 +02:00
Eren Gölge	47b3b10d6d	Bump up to v0.1.0 🚀	2021-06-29 13:07:59 +02:00
Eren Gölge	7ec5c31898	Merge branch 'univnet' into trainer-api	2021-06-29 10:27:12 +02:00
Eren Gölge	51398cd15b	Add docstrings and typing for `audio.py`	2021-06-28 17:03:47 +02:00
Eren Gölge	ae6405bb76	Docstrings for `Trainer`	2021-06-28 17:03:47 +02:00
Eren Gölge	6b265ae8e3	Docstring update	2021-06-28 17:03:47 +02:00
Eren Gölge	ab563ce7cd	Start training by config.json using `register_config`	2021-06-28 17:03:47 +02:00
Eren Gölge	b3c073c99b	Allow runing full path scripts with `distribute.py`	2021-06-28 17:03:47 +02:00
Eren Gölge	d42d1c02ea	Use `torch.linalg.qr` for pytorch > `v1.9.0`	2021-06-28 17:03:47 +02:00
Eren Gölge	fbba37e01e	Fix loading the `amp` scaler from a checkpoint 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	a7617d8ab6	Add 🐍 python 3.9 to CI	2021-06-28 17:03:47 +02:00
Eren Gölge	9790eddada	Fix wrong argument name 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	932ab107ae	Docstring edit in `TTSDataset.py` ✍️	2021-06-28 17:03:47 +02:00
Eren Gölge	cfa5041db7	Fix `eval_log` for `gan.py` 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	d700845b10	Move `TorchSTFT` to `utils.audio`	2021-06-28 17:03:47 +02:00
Eren Gölge	5b89cb4fec	Fixup `trainer.py` 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	8c74f054f0	Enable support for 🐍 python 3.10 Bump up versions numpy 1.19.5 and TF 2.5.0	2021-06-28 17:03:47 +02:00
Eren Gölge	9455a2b01e	Apply small fixes for API compatibility	2021-06-28 17:03:47 +02:00
Eren Gölge	a5d5bc9063	Print `max_decoder_steps` when model reaches the limit	2021-06-28 17:03:47 +02:00
Eren Gölge	e30f245e06	Update `synthesizer` for speaker and model init	2021-06-28 17:03:47 +02:00
Eren Gölge	15fa31b595	fixup configs	2021-06-28 17:03:47 +02:00
Eren Gölge	f23b228e24	Update `speaker_manager`	2021-06-28 17:03:47 +02:00
Eren Gölge	e53616078a	Fixup `utils` for the trainer	2021-06-28 17:03:47 +02:00
Eren Gölge	106b63d8a9	Update `vocoder` utils	2021-06-28 17:03:47 +02:00
Eren Gölge	45947acb60	Update `TTS.bin` scripts for the new API	2021-06-28 17:03:47 +02:00
Eren Gölge	d7225eedb0	Update `vocoder` datasets and `setup_dataset`	2021-06-28 17:03:20 +02:00
Eren Gölge	d18198dff8	Implement `setup_model` for vocoder models	2021-06-28 17:03:20 +02:00
Eren Gölge	e949e7ad58	Update vocoder models	2021-06-28 17:03:19 +02:00
Eren Gölge	51005cdab4	Update `tts.models.setup_model`	2021-06-28 17:03:19 +02:00
Eren Gölge	7b8c15ac49	Create base 🐸TTS model abstraction for tts models	2021-06-28 17:03:19 +02:00
Eren Gölge	a358f74a52	Update vocoder model configs	2021-06-28 17:03:19 +02:00
Eren Gölge	786170fe7d	Update tts model configs	2021-06-28 17:03:19 +02:00
Eren Gölge	98298ee671	Implement unified IO utils	2021-06-28 17:03:19 +02:00
Eren Gölge	c7aad884cd	Implement unified trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	6d7b5fbcde	`tts` model abstraction with `TTSModel`	2021-06-28 17:03:19 +02:00
Eren Gölge	d4dbd89752	fix calculation of `loader_start_time`	2021-06-28 17:03:19 +02:00
Eren Gölge	c754a0e17d	`TrainerAbstract` and related updates for `TrainerTTS`	2021-06-28 17:03:19 +02:00
Eren Gölge	00c82c516d	rename to	2021-06-28 17:03:19 +02:00
Eren Gölge	166f0aeb9a	merge if branches with the same implementation	2021-06-28 17:03:19 +02:00
Eren Gölge	03494ad642	adjust `distribute.py` for the `train_tts.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	fdfb18d230	downsize melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	25238e0658	fix glow-tts `inference()`	2021-06-28 17:03:19 +02:00
Eren Gölge	419735f440	refactor and fix multi-speaker training in Trainer and Tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	269e5a734e	add max_decoder_steps argument to tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	b3324bd914	fix speaker_manager init	2021-06-28 17:03:19 +02:00
Eren Gölge	2c38ef8441	use get_speaker_manager in Trainer and save speakers.json file when needed	2021-06-28 17:03:19 +02:00
Eren Gölge	d6b2b6add6	make style and linter fixes	2021-06-28 17:03:19 +02:00
Eren Gölge	802d461389	Compute d_vectors and speaker_ids separately in TTSDataset	2021-06-28 17:03:19 +02:00
Eren Gölge	db6a97d1a2	rename external speaker embedding arguments as `d_vectors`	2021-06-28 17:03:19 +02:00
Eren Gölge	9042ae9195	use `to_cuda()` for moving data in `format_batch()`	2021-06-28 17:03:19 +02:00
Eren Gölge	f82f1970b8	change `to(device)` to `type_as` in models	2021-06-28 17:03:19 +02:00

1 2 3 4 5 ...

1080 Commits