coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	ef4ea9e527	update imports for `formatters`	2021-06-28 17:03:19 +02:00
Eren Gölge	6c495c6a6e	fix glow-tts inference and forward functions for handling `cond_input` and refactor its test	2021-06-28 17:03:19 +02:00
Eren Gölge	f840268181	refactor `SpeakerManager`	2021-06-28 17:03:19 +02:00
Eren Gölge	421194880d	linter fixes	2021-06-28 17:03:19 +02:00
Eren Gölge	8e52a69230	delete separate tts training scripts and pre-commit configuration	2021-06-28 17:03:19 +02:00
Eren Gölge	d96ebcd6d3	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	b643e8b37c	`logging/__init__.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	0cee5042a9	fix logger imports	2021-06-28 17:03:19 +02:00
Eren Gölge	72dceca52c	import missings	2021-06-28 17:03:19 +02:00
Eren Gölge	0eec238429	remove redundant imports	2021-06-28 17:03:19 +02:00
Eren Gölge	b500338faa	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	469d2e620a	update extract_tts_spectrogram for `cond_input` API of the models	2021-06-28 17:03:19 +02:00
Eren Gölge	5ab28fa618	update `extract_tts_spec...` using `SpeakerManager`	2021-06-28 17:03:19 +02:00
Eren Gölge	c392fa4288	update `extract_tts_spectrograms` for the new model API	2021-06-28 17:03:19 +02:00
Eren Gölge	8f47f95998	correct import of `load_meta_data` remove redundant import	2021-06-28 17:03:19 +02:00
Eren Gölge	c680a07a20	fix `Synthesized` for the new `synthesis()`	2021-06-28 17:03:19 +02:00
Eren Gölge	73bf9673ed	revert logging.info to print statements for trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	d25f017b42	update `setup_model.py` imports	2021-06-28 17:03:19 +02:00
Eren Gölge	bb355b7441	update align_tts.py model for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	9203b863d9	update align_tts_loss for trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	fc9a0fb8ce	update aling_tts_config for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	e298b8e364	update trainer.py for better logging handling, restoring models and rename init_ functions with get_	2021-06-28 17:03:19 +02:00
Eren Gölge	b8a4af4010	update `synthesis.py` for being more generic	2021-06-28 17:03:19 +02:00
Eren Gölge	c70d0c9dae	update `speedy_speech.py` model for trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	06ee57d816	update `speedy_speecy_config.py` for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	4e910993f1	update tacotron model to return `model_outputs`	2021-06-28 17:03:19 +02:00
Eren Gölge	bb4deee64c	update glow-tts for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	9134c7dfb6	update `sequence_mask` import globally	2021-06-28 17:03:19 +02:00
Eren Gölge	b2218e882a	update `glow_tts_config.py` for setting the optimizer and the scheduler	2021-06-28 17:03:19 +02:00
Eren Gölge	891631ab47	typing annotation for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	5f07315722	add trainer and train_tts	2021-06-28 17:03:19 +02:00
Eren Gölge	34f8a74e4d	remove `truncated` from synthesizer	2021-06-28 17:03:19 +02:00
Eren Gölge	178eccbc16	update console logger	2021-06-28 17:03:19 +02:00
Eren Gölge	f4f83b6379	update `synthesis.py` for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	130781dab6	remove `tts.generic_utils` as all the functions are moved to other files	2021-06-28 17:03:19 +02:00
Eren Gölge	535a458f40	update Tacotron models for the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	bdbfc95618	add `gradual_training` argument to tacotron.py	2021-06-28 17:03:19 +02:00
Eren Gölge	5a2e75f0ee	import missings for tacotron.py	2021-06-28 17:03:19 +02:00
Eren Gölge	da7d10e53c	mode `setup_model()` to `models/__init__.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	ca302db7b0	add sequence_mask to `utils.data`	2021-06-28 17:03:19 +02:00
Eren Gölge	844abb3b1d	`setup_loss()` in `layer/__init__.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	a20a1c7d06	rename preprocess.py -> formatters.py	2021-06-28 17:03:19 +02:00
Eren Gölge	b9bccbb243	move load_meta_data and related functions to `datasets/__init__.py`	2021-06-28 17:03:19 +02:00
Eren Gölge	d09385808a	set test_sentences in config	2021-06-28 17:03:19 +02:00
Eren Gölge	8def3c87af	trainer-API updates	2021-06-28 17:03:19 +02:00
Eren Gölge	42554cc711	rename MyDataset -> TTSDataset	2021-06-28 17:03:19 +02:00
Edresson	1c4e806f54	use speaker manager on compute embeddings script	2021-06-27 03:35:34 -03:00
Edresson Casanova	eb84bb2bc8	Merge branch 'dev' into dev	2021-06-26 15:32:19 -03:00
Eren Gölge	987cf1178b	Bump up to v0.0.16	2021-06-25 14:44:56 +02:00
Michael Hansen	3f172b84d8	Fix linting issues	2021-06-25 14:41:31 +02:00
Michael Hansen	4d8426fa0a	Use eSpeak IPA lexicons by default for phoneme models	2021-06-25 14:41:05 +02:00
Michael Hansen	618b509204	Use combined characters available in TTS phonemes (like ç)	2021-06-25 14:41:05 +02:00
Michael Hansen	da6f6a4a01	Update docstring for clean_gruut_phonemes	2021-06-25 14:41:05 +02:00
Michael Hansen	47191f3ecc	Add tests for gruut phonemization	2021-06-25 14:41:05 +02:00
Michael Hansen	67869e77f9	Use gruut for phonemization	2021-06-25 14:41:05 +02:00
Eren Gölge	788992093d	Add UnivNet vocoder 🚀	2021-06-23 13:51:04 +02:00
Eren Gölge	64fd59204c	Use `torch.linalg.qr` for pytorch > `v1.9.0`	2021-06-23 13:49:42 +02:00
Eren Gölge	aba840b4e6	Fix loading the `amp` scaler from a checkpoint 🛠️	2021-06-23 13:49:42 +02:00
Eren Gölge	18e5393f16	Add 🐍 python 3.9 to CI	2021-06-23 13:49:36 +02:00
Eren Gölge	0ff2d2336a	Fix wrong argument name 🛠️	2021-06-22 16:21:11 +02:00
Eren Gölge	61c3cb871f	Docstring edit in `TTSDataset.py` ✍️	2021-06-22 16:21:11 +02:00
Eren Gölge	6f739ea07a	Fix `eval_log` for `gan.py` 🛠️	2021-06-22 16:21:11 +02:00
Eren Gölge	ebb91c0fbb	Move `TorchSTFT` to `utils.audio`	2021-06-22 16:21:11 +02:00
Eren Gölge	01c4b22a2f	Fixup `trainer.py` 🛠️	2021-06-22 16:21:11 +02:00
Eren Gölge	7de2756fc4	Enable support for 🐍 python 3.10 Bump up versions numpy 1.19.5 and TF 2.5.0	2021-06-22 16:21:11 +02:00
Eren Gölge	220e184f66	Apply small fixes for API compatibility	2021-06-22 16:21:11 +02:00
Eren Gölge	77d57dd301	Print `max_decoder_steps` when model reaches the limit	2021-06-22 16:21:11 +02:00
Eren Gölge	7dc2177df4	Update `synthesizer` for speaker and model init	2021-06-22 16:21:11 +02:00
Eren Gölge	c3a0bc702e	fixup configs	2021-06-22 16:21:11 +02:00
Eren Gölge	0e01c2594f	Update `speaker_manager`	2021-06-22 16:21:11 +02:00
Eren Gölge	8182f5168f	Fixup `utils` for the trainer	2021-06-22 16:21:11 +02:00
Eren Gölge	b4bb567e04	Update `vocoder` utils	2021-06-22 16:21:11 +02:00
Eren Gölge	f3ff5b1971	Update `TTS.bin` scripts for the new API	2021-06-22 16:21:11 +02:00
Eren Gölge	aed919cf1c	Update `vocoder` datasets and `setup_dataset`	2021-06-22 16:21:11 +02:00
Eren Gölge	59abf490a1	Implement `setup_model` for vocoder models	2021-06-22 16:21:11 +02:00
Eren Gölge	420820caf4	Update vocoder models	2021-06-22 16:21:11 +02:00
Eren Gölge	d10f9c5676	Update `tts.models.setup_model`	2021-06-22 16:21:11 +02:00
Eren Gölge	cae702980f	Create base 🐸TTS model abstraction for tts models	2021-06-22 16:21:11 +02:00
Eren Gölge	70d968b169	Update vocoder model configs	2021-06-22 16:21:11 +02:00
Eren Gölge	f8a3460818	Update tts model configs	2021-06-22 16:21:11 +02:00
Eren Gölge	acd96a4940	Implement unified IO utils	2021-06-22 16:21:10 +02:00
Eren Gölge	6b907554f8	Implement unified trainer	2021-06-22 16:21:10 +02:00
Eren Gölge	20c4a8c8e1	`tts` model abstraction with `TTSModel`	2021-06-22 16:21:10 +02:00
Eren Gölge	b934665fc0	fix calculation of `loader_start_time`	2021-06-22 16:21:10 +02:00
Eren Gölge	64f0f57757	`TrainerAbstract` and related updates for `TrainerTTS`	2021-06-22 16:21:10 +02:00
Eren Gölge	f077a356e0	rename to	2021-06-22 16:21:10 +02:00
Eren Gölge	4575b70826	merge if branches with the same implementation	2021-06-22 16:21:10 +02:00
Eren Gölge	59be1b9af1	adjust `distribute.py` for the `train_tts.py`	2021-06-22 16:21:10 +02:00
Eren Gölge	614738cc85	downsize melgan test model size	2021-06-22 13:12:52 +02:00
Eren Gölge	4f29725eb6	fix glow-tts `inference()`	2021-06-22 13:12:52 +02:00
Eren Gölge	a87c886497	refactor and fix multi-speaker training in Trainer and Tacotron models	2021-06-22 13:12:52 +02:00
Eren Gölge	0206bb847b	add max_decoder_steps argument to tacotron models	2021-06-22 13:12:52 +02:00
Eren Gölge	cbb52b3d83	fix speaker_manager init	2021-06-22 13:12:52 +02:00
Eren Gölge	d2fd6a34a1	use get_speaker_manager in Trainer and save speakers.json file when needed	2021-06-22 13:12:52 +02:00
Eren Gölge	147550c65f	make style and linter fixes	2021-06-22 13:12:52 +02:00
Eren Gölge	a605dd3d08	Compute d_vectors and speaker_ids separately in TTSDataset	2021-06-22 13:12:52 +02:00
Eren Gölge	f00ef90ce6	rename external speaker embedding arguments as `d_vectors`	2021-06-22 13:12:52 +02:00
Eren Gölge	e7b7268c43	use `to_cuda()` for moving data in `format_batch()`	2021-06-22 13:12:52 +02:00
Eren Gölge	26a3312f0d	change `to(device)` to `type_as` in models	2021-06-22 13:12:52 +02:00
Eren Gölge	c09622459e	init `durations = None`	2021-06-22 13:12:52 +02:00
Eren Gölge	2e31659dd9	docstring fix	2021-06-22 13:12:52 +02:00
Eren Gölge	7a0750a4f5	make style	2021-06-22 13:12:52 +02:00
Eren Gölge	534401377d	styling formatting.py	2021-06-22 13:12:52 +02:00
Eren Gölge	e229f5c081	fix type annotations	2021-06-22 13:12:52 +02:00
Eren Gölge	506189bdee	update glow-tts output shapes to match [B, T, C]	2021-06-22 13:12:52 +02:00
Eren Gölge	f568833d28	formating `cond_input` with a function in Tacotron models	2021-06-22 13:12:52 +02:00
Eren Gölge	254707c610	update imports for `formatters`	2021-06-22 13:12:52 +02:00
Eren Gölge	223502d827	fix glow-tts inference and forward functions for handling `cond_input` and refactor its test	2021-06-22 13:12:52 +02:00
Eren Gölge	d4b1acfa81	refactor `SpeakerManager`	2021-06-22 13:12:52 +02:00
Eren Gölge	26e7c0960c	linter fixes	2021-06-22 13:12:52 +02:00
Eren Gölge	79f7c5da1e	delete separate tts training scripts and pre-commit configuration	2021-06-22 13:12:52 +02:00
Eren Gölge	ca787be193	make style	2021-06-22 13:12:52 +02:00
Eren Gölge	d376647ca0	`logging/__init__.py`	2021-06-22 13:12:52 +02:00
Eren Gölge	bb58a0588e	fix logger imports	2021-06-22 13:12:52 +02:00
Eren Gölge	9bbc924377	import missings	2021-06-22 13:12:52 +02:00
Eren Gölge	b4d4ce0d7e	remove redundant imports	2021-06-22 13:12:52 +02:00
Eren Gölge	aefa71155c	make style	2021-06-22 13:12:52 +02:00
Eren Gölge	88d8a94a10	update extract_tts_spectrogram for `cond_input` API of the models	2021-06-22 13:12:52 +02:00
Eren Gölge	667bb708b6	update `extract_tts_spec...` using `SpeakerManager`	2021-06-22 13:12:52 +02:00
Eren Gölge	830306d2fd	update `extract_tts_spectrograms` for the new model API	2021-06-22 13:12:52 +02:00
Eren Gölge	c673eb8ef8	correct import of `load_meta_data` remove redundant import	2021-06-22 13:12:52 +02:00
Eren Gölge	f0a419546b	fix `Synthesized` for the new `synthesis()`	2021-06-22 13:12:52 +02:00
Eren Gölge	c7ff175592	revert logging.info to print statements for trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	fd6afe5ae5	update `setup_model.py` imports	2021-06-22 13:12:52 +02:00
Eren Gölge	c82d91051d	update align_tts.py model for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	4f66e816d1	update align_tts_loss for trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	8213ad8b5f	update aling_tts_config for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	8dfd4c91ff	update trainer.py for better logging handling, restoring models and rename init_ functions with get_	2021-06-22 13:12:52 +02:00
Eren Gölge	fb9289d365	update `synthesis.py` for being more generic	2021-06-22 13:12:52 +02:00
Eren Gölge	f121b0ff5d	update `speedy_speech.py` model for trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	843b3ba960	update `speedy_speecy_config.py` for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	c9790bee2c	update tacotron model to return `model_outputs`	2021-06-22 13:12:52 +02:00
Eren Gölge	f09ec7e3a7	update glow-tts for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	3346a6d9dc	update `sequence_mask` import globally	2021-06-22 13:12:52 +02:00
Eren Gölge	9765b1aa6b	update `glow_tts_config.py` for setting the optimizer and the scheduler	2021-06-22 13:12:52 +02:00
Eren Gölge	6bf6543df8	typing annotation for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	57cdddef16	add trainer and train_tts	2021-06-22 13:12:52 +02:00
Eren Gölge	d769af9e3b	remove `truncated` from synthesizer	2021-06-22 13:12:52 +02:00
Eren Gölge	570633ab80	update console logger	2021-06-22 13:12:52 +02:00
Eren Gölge	2ac6b824ca	update `synthesis.py` for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	c9e5527070	remove `tts.generic_utils` as all the functions are moved to other files	2021-06-22 13:12:52 +02:00
Eren Gölge	2ab723cd10	update Tacotron models for the trainer	2021-06-22 13:12:52 +02:00
Eren Gölge	d6b6a15b5c	add `gradual_training` argument to tacotron.py	2021-06-22 13:12:52 +02:00
Eren Gölge	118a7f2b43	import missings for tacotron.py	2021-06-22 13:12:52 +02:00
Eren Gölge	c98149d488	mode `setup_model()` to `models/__init__.py`	2021-06-22 13:12:52 +02:00
Eren Gölge	86edf6ab0e	add sequence_mask to `utils.data`	2021-06-22 13:12:52 +02:00
Eren Gölge	c61486b1e3	`setup_loss()` in `layer/__init__.py`	2021-06-22 13:12:52 +02:00
Eren Gölge	f07209d2e0	rename preprocess.py -> formatters.py	2021-06-22 13:12:52 +02:00
Eren Gölge	facb782851	move load_meta_data and related functions to `datasets/__init__.py`	2021-06-22 13:12:52 +02:00
Eren Gölge	b9d4355d20	set test_sentences in config	2021-06-22 13:12:52 +02:00
Eren Gölge	7bdd0eb72f	trainer-API updates	2021-06-22 13:12:52 +02:00
Eren Gölge	0f284841d1	rename MyDataset -> TTSDataset	2021-06-22 13:12:52 +02:00
Edresson	99d40e98d9	fix Lint checks	2021-06-18 14:59:01 -03:00
Edresson	28bec238ca	fix Lint checks	2021-06-18 14:33:50 -03:00
Edresson	83644056e3	fix Lint checks	2021-06-18 14:32:28 -03:00
Edresson Casanova	e78e3cd81e	Merge branch 'dev' into dev	2021-06-18 14:10:03 -03:00
Edresson	b74b510d3c	Compute embeddings and find characters using config file	2021-06-18 14:04:49 -03:00
Adam Froghyar	b0aa189348	Forcing do_trim_silence to False in the extract TTS script	2021-06-14 10:44:00 +02:00
Eren Gölge	d245b5d48f	bump up v0.0.15.1	2021-06-08 09:21:01 +02:00
Edresson	14b209c7e9	Create a batch for more fast inference on LSTM Speaker Encoder	2021-06-05 03:12:17 -03:00
Eren Gölge	b8b79a5e5a	fix `use_cuda` bug in `server.py`	2021-06-04 14:02:53 +02:00
Eren Gölge	203ab855c3	bump up to v0.0.15	2021-06-04 13:52:54 +02:00
Eren Gölge	ba9bcf7c6b	auto upload to pypi on release	2021-06-04 12:20:06 +02:00
Eren Gölge	e66753bd0d	fixup! new japanese model placeholder in `.models.json`	2021-06-03 18:04:28 +02:00
Eren Gölge	bd434636a9	new japanese model placeholder in `.models.json`	2021-06-02 15:54:37 +02:00
Eren Gölge	401fbd8978	bump up to v0.0.15	2021-06-02 11:48:17 +02:00
Eren Gölge	49c5e5d820	maket style japanese PR	2021-06-02 11:44:46 +02:00
Eren Gölge	73b4083c6c	Merge pull request #502 from kaiidams/kaiidams/kokoro Japanese Tacotron 2 model	2021-06-02 10:20:08 +02:00
Katsuya Iida	6d8310d2a9	Set the version to the same with the dev branch.	2021-06-02 07:48:28 +09:00
Alexander Korolev	c1eb9bdcca	fix speaker dim inference	2021-06-01 15:15:26 +02:00
Katsuya Iida	1cc18d1972	Move unittest of Japanese phonemizer.	2021-06-01 18:51:34 +09:00
Alexander Korolev	5b89ef2c6e	fix speaker-embeddings dimension during inference	2021-06-01 11:06:35 +02:00
Eren Gölge	d0ab0382fc	linter fixes	2021-06-01 09:15:32 +02:00
Eren Gölge	bec85ac58d	make style	2021-05-31 16:37:15 +02:00
Eren Gölge	d9f1268f99	init tb_logger None for rank > 0 processes	2021-05-31 15:47:07 +02:00
Eren Gölge	301c516abd	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-05-31 15:46:25 +02:00
Edresson	7448177b72	use SpeakerManager on compute embeddings script	2021-05-29 21:11:53 -03:00
Katsuya Iida	c4a5a73f18	update Kokoro config	2021-05-29 19:17:27 +09:00
Katsuya Iida	3a9ac2de4a	Merge remote-tracking branch 'coqui-ai/main' into kaiidams/kokoro	2021-05-29 09:39:23 +09:00
Katsuya Iida	d0c9c1ca5c	Move TTS/tts/utils/japanese	2021-05-29 09:21:47 +09:00
Edresson	099142d4dd	bug fix	2021-05-27 21:50:56 -03:00
Edresson	208bb0f0ee	add batched speaker encoder inference	2021-05-27 20:01:00 -03:00
Edresson	825734a3a9	remove unused embeddings export	2021-05-27 19:10:24 -03:00
Katsuya Iida	c4987e9d4e	Move import at the head of the file.	2021-05-28 00:22:57 +09:00
Eren Gölge	925c08cf95	replace unidecode with anyascii	2021-05-27 14:02:44 +02:00
Eren Gölge	e08c58db3b	bump up version to v0.14.1	2021-05-27 13:11:01 +02:00
Eren Gölge	c6f22aaa67	fix #509	2021-05-27 13:09:15 +02:00
Edresson	1496f271dc	update Compute embeddings script	2021-05-27 00:45:18 -03:00
Edresson	bc5307caa0	add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix	2021-05-26 20:35:58 -03:00
Edresson	c90037c2e9	solve merge problems	2021-05-26 16:01:30 -03:00
Katsuya Iida	f921a05bdb	Fixed lint errors	2021-05-26 19:02:16 +09:00
Edresson Casanova	f89cb6aec2	Merge branch 'dev' into dev	2021-05-25 17:30:25 -03:00
Edresson	d570c2d790	pylint fix and data loader bug fix	2021-05-26 01:11:37 -03:00
Katsuya Iida	0536aa6d0f	Japanese Tacotron 2 model	2021-05-22 17:12:19 +09:00
Eren Gölge	5482a0f62d	type def for gradual_training	2021-05-19 14:03:26 +02:00
Eren Gölge	df6a98d0c3	type def for gradual_training	2021-05-19 14:00:44 +02:00
Eren Gölge	16576d6408	bump version number	2021-05-19 12:35:10 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	a14fcf2a13	remove text_processing test	2021-05-18 17:57:28 +02:00
Eren Gölge	d7fae3f515	remove all espeaker and phonemizer deps	2021-05-18 17:57:28 +02:00
Eren Gölge	ced05e812a	move chinese phonemizer	2021-05-18 17:57:28 +02:00
Eren Gölge	218af1d9a2	change `list` to `List` in config	2021-05-18 17:30:27 +02:00
Eren Gölge	4df31f7fbd	unused_speakers argument for ignoring speaker ids in multi-speaker training	2021-05-18 14:50:03 +02:00
Eren Gölge	c2c7dff805	use relaxted coqpit parser	2021-05-18 14:49:47 +02:00
Edresson	856ea19758	bug fix in dataloader and update inference	2021-05-18 03:43:16 -03:00
Eren Gölge	d1b469935d	tacotron DDC LJSpeech recipe	2021-05-17 11:42:14 +02:00
Eren Gölge	34a42d379f	update tacotron_config.py for checking `r` and the docstring	2021-05-17 11:35:30 +02:00
Eren Gölge	12722501bb	styling	2021-05-15 23:48:31 +02:00
Eren Gölge	8b1014d188	add docstrings with default value fixes	2021-05-15 23:45:10 +02:00
Eren Gölge	da49089a72	update melgan training test batch size	2021-05-12 10:12:11 +02:00
Edresson	3433c2f348	add compute embedding for the new speaker encoder	2021-05-12 03:06:46 -03:00
Eren Gölge	0213e1cbf4	update configs for tts models to match the field typed with the expected values	2021-05-12 00:57:38 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Edresson	3fcc748b2e	implement the Speaker Encoder H/ASP	2021-05-11 16:27:05 -03:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	6e980b49c4	fix synthesizer.py for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	db14dcd95a	remove old load_config	2021-05-11 11:29:18 +02:00
Eren Gölge	a21ac883dd	add get_cuda()	2021-05-11 11:29:18 +02:00
Eren Gölge	21dd4d7960	fix load_config imports for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	c57f0b46bb	reintro use_gst for backwars compat	2021-05-11 11:29:18 +02:00
Eren Gölge	18e76a2309	fix speaker encoder model initialization	2021-05-11 11:29:18 +02:00
Eren Gölge	10de40bba1	make num_workers mandatory config field	2021-05-11 11:29:18 +02:00
Eren Gölge	df1ddd3539	allow read_json_with_comments for backward compat	2021-05-11 11:29:18 +02:00
Eren Gölge	9f7599e3c3	fix train_encoder for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	f8e52965dd	add speaker encoder coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	ce2bba543e	remove extra from utils and move funcs to io.py	2021-05-11 11:29:18 +02:00
Eren Gölge	812dbc2b06	rm config.json	2021-05-11 11:29:18 +02:00
Eren Gölge	3fde2001b1	train_encoder refactoring for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	9ee70af9bb	code styling	2021-05-11 11:29:18 +02:00
Eren Gölge	10db2baa06	global shared Coqpit configs	2021-05-11 11:29:18 +02:00
Eren Gölge	3dec62b183	add Coqpits for the vocoder models	2021-05-11 11:29:18 +02:00
Eren Gölge	6f4eed94f5	remove *.json vocoder configs	2021-05-11 11:29:18 +02:00
Eren Gölge	78b3825d0b	update train scripts for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	757e90b1cc	load_config function to initialize the right Coqpit for the given model	2021-05-11 11:29:18 +02:00
Eren Gölge	e6f45b9eb7	update train_vocoder_gan.py for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	bcebd69d09	remove bash tts training tests	2021-05-11 11:29:17 +02:00
Eren Gölge	7663bc63c1	add Coqpit configs for the TTS models	2021-05-11 11:29:17 +02:00
Eren Gölge	7227e8f1d2	update train_align_tts.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	51a7e06945	glow_tts_config.py and train test on python	2021-05-11 11:29:17 +02:00
Eren Gölge	720fe13056	update glow_tts modules and training script for coqpit use	2021-05-11 11:29:17 +02:00
Eren Gölge	816e7ee698	remove default configs.json as replacing with Coqpit configs	2021-05-11 11:29:17 +02:00
Eren Gölge	35341d5482	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	647163397d	coqpit refactoring	2021-05-11 11:29:17 +02:00
Eren Gölge	eaa130e813	fix tacotron for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	65d7ad4250	refactor train_speedy_speech.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	4a58fdfd59	comment out check-arguments before copying fields to the configs	2021-05-11 11:29:17 +02:00
Eren Gölge	05d9543ed8	init GST module using gst config in Tacotron models	2021-05-11 11:29:17 +02:00
Eren Gölge	93a00373f6	move split_dataset	2021-05-11 11:29:17 +02:00

... 3 4 5 6 7 ...

1123 Commits