coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	af2d36faeb	update synthesize.py for multi-speaker setting	2021-04-23 18:04:37 +02:00
Edresson	d2b6326b8b	change optimizer initialization for compatibility with Hifi-GAN official implementation	2021-04-23 07:54:39 -03:00
Eren Gölge	9cc17be53a	formatting and a small bug fix in Tacotron model	2021-04-15 16:36:51 +02:00
Eren Gölge	d60a8d7211	show the real waveform on TB too for GAN vocoder training.	2021-04-15 15:30:06 +02:00
Eren Gölge	5fbe926429	change the default TTS model to TacotronDDC	2021-04-15 15:29:44 +02:00
Eren Gölge	b11d1cb845	small fixes	2021-04-12 12:40:55 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	5b70da2e3f	restore schedulers only if training is continuing a previous training inherit nn.Module for TorchSTFT	2021-04-09 19:31:28 +02:00
Eren Gölge	105e0b4d62	vocoder gan training fixes	2021-04-09 11:38:04 +02:00
Eren Gölge	18d9ec8036	format with black	2021-04-09 00:54:59 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	cd69da4868	linter fixes #2	2021-04-08 16:57:46 +02:00
Eren Gölge	0ee0458309	remove redundant imports	2021-04-08 11:29:15 +02:00
Eren Gölge	4998ece8d8	allow configuration of optimziers from the config file	2021-04-08 11:28:30 +02:00
Eren Gölge	8daf407652	cache empty	2021-04-08 11:28:30 +02:00
Eren Gölge	3fb78c004a	move scheduler updates to the end of the epoch	2021-04-08 11:28:30 +02:00
Eren Gölge	2a872c98aa	don't call os.exit as it leaves the process resources standing	2021-04-08 11:27:40 +02:00
Eren Gölge	57f6bd1afa	make using different samples for G and D networks optional	2021-04-08 11:26:01 +02:00
rishikksh20	e656e8b108	Remove select size bug	2021-04-08 11:20:33 +02:00
rishikksh20	ef6ff4e95c	Add Exponential LR scheduler check	2021-04-08 11:20:33 +02:00
Eren Gölge	6ad4eba678	gan vocoder train fix in case of restoring models wiht no scheduler is defined	2021-04-06 16:24:50 +02:00
Eren Gölge	b4c2cf80f2	fix eval iter	2021-03-30 14:39:16 +02:00
Eren Gölge	a3a840fd78	linter fixes	2021-03-30 14:39:16 +02:00
Eren Gölge	7a382a5c2b	stowed aligntts commit and small refactoring with feed_forward layers	2021-03-30 14:39:16 +02:00
Eren Gölge	2b3e12ea49	correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting	2021-03-30 14:39:16 +02:00
Eren Gölge	d9c405f0c3	create feedforward folder for SS layers	2021-03-30 14:39:16 +02:00
Eren Gölge	ca2f22cdd7	linter fix	2021-03-30 14:36:12 +02:00
Eren Gölge	d0dcd7d1b8	let the user define outpu.wav file path fix #393	2021-03-30 14:24:31 +02:00
Eren Gölge	3947750dd9	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-18 14:09:47 +01:00
WeberJulian	596ea2c98a	Add resample script	2021-03-18 13:33:37 +01:00
Eren Gölge	65533f33e9	fix #374	2021-03-18 13:33:00 +01:00
WeberJulian	af96080e17	fix linter issues	2021-03-18 13:33:00 +01:00
WeberJulian	f6cd8e0ecc	test case	2021-03-18 13:33:00 +01:00
WeberJulian	e954e45e57	linter + test	2021-03-18 13:33:00 +01:00
WeberJulian	e598977f3d	Using path.join instead of concat	2021-03-18 13:33:00 +01:00
WeberJulian	c5ef2de73f	Add resample script	2021-03-18 13:33:00 +01:00
Eren Gölge	babc94f63f	fix #374	2021-03-16 19:13:32 +01:00
WeberJulian	11e25a7125	fix linter issues	2021-03-16 19:13:01 +01:00
WeberJulian	b94373afb8	test case	2021-03-16 19:13:01 +01:00
WeberJulian	93fdc0729c	linter + test	2021-03-16 19:13:01 +01:00
WeberJulian	17f197f51e	Using path.join instead of concat	2021-03-16 19:13:01 +01:00
WeberJulian	d6749f030f	Add resample script	2021-03-16 19:13:01 +01:00
Eren Gölge	6c932c8503	print the desc if required parameters are not provided	2021-03-10 15:19:00 +01:00
Eren Gölge	19bb9ba851	fix tts endpoint using list-models argument	2021-03-09 14:06:09 +01:00
Eren Gölge	94805236fb	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-08 15:21:06 +01:00
Eren Gölge	9a48ba3821	a ton of linter updates	2021-03-08 05:06:54 +01:00
gerazov	2451a813a2	refactored keep_all_best	2021-03-08 02:57:11 +01:00
gerazov	2db40457e8	brushed up printing model load path and best loss path	2021-03-08 02:56:36 +01:00
gerazov	f2e474cd37	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-03-08 02:56:36 +01:00
Eren Gölge	8993120634	do not test server and modelManager until fixing #657	2021-03-08 02:54:47 +01:00
Eren Gölge	39fbf2fe84	Update TTS/bin/find_unique_chars.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-03-08 02:54:47 +01:00
Eren Gölge	ee71eb4eb7	linter fixes	2021-03-08 02:54:47 +01:00
Eren Gölge	62aeacbdd1	save used model characters to the checkpoints	2021-03-08 02:54:47 +01:00
Eren Gölge	c6702b5b9f	find unique characters in a dataset	2021-03-08 02:54:47 +01:00
Eren Gölge	00e0933f43	save_wav with a custom sampling rate	2021-03-08 02:54:47 +01:00
Eren Gölge	8955333e9d	use default vocoder in synthesize.py	2021-03-08 02:54:47 +01:00
Eren Gölge	1c1abb8a9b	docstring update	2021-03-08 02:54:47 +01:00
Eren Gölge	43b951018e	fix the default vocoder name	2021-03-08 02:54:47 +01:00
Eren Gölge	3c961370e7	linter fixes	2021-03-08 02:54:21 +01:00
gerazov	b3c5cc2cdc	final fixes	2021-03-08 02:54:21 +01:00
gerazov	10d5a63d49	updated to current dev	2021-03-08 02:54:21 +01:00
gerazov	6f06e31541	changed train scripts	2021-03-08 02:54:21 +01:00
Branislav Gerazov	b1e3160884	waveRNN fix	2021-03-08 02:54:21 +01:00
Eren Gölge	08581deb61	linter updates	2021-03-08 02:53:02 +01:00
Thorsten Mueller	167901813d	Ups. Added missing ,	2021-03-08 02:53:02 +01:00
Eren Gölge	93a6bdfd6c	linter fixes and version updates for deps	2021-03-08 02:51:10 +01:00
Thorsten Mueller	3eb00e8d93	Set out_path to be required param.	2021-03-08 02:49:15 +01:00
Alexander Korolev	ace430d5e6	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-03-08 02:49:15 +01:00
Eren Gölge	83143fbe39	fix #638	2021-03-08 02:48:31 +01:00
Alexander Korolev	b4bc5f6eb1	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-03-08 02:48:31 +01:00
Eren Gölge	534e3c67c6	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-03-08 02:48:31 +01:00
Eren Gölge	d0454461de	Merge branch 'pr/gerazov/650-2' into dev	2021-02-17 13:40:45 +00:00
Eren Gölge	ce0c5eccbd	do not test server and modelManager until fixing #657	2021-02-17 00:35:43 +00:00
gerazov	61c88beb94	refactored keep_all_best	2021-02-15 18:40:17 +01:00
Eren Gölge	3b6ce04332	Update TTS/bin/find_unique_chars.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-02-15 13:02:29 +01:00
Eren Gölge	420901f4c2	linter fixes	2021-02-12 14:41:17 +00:00
Eren Gölge	e774f68aee	save used model characters to the checkpoints	2021-02-12 12:03:42 +00:00
gerazov	310d18325e	brushed up printing model load path and best loss path	2021-02-12 10:55:45 +01:00
Eren Gölge	8b6fd76ad2	find unique characters in a dataset	2021-02-12 09:46:11 +00:00
gerazov	af46727517	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-02-12 02:12:00 +01:00
Eren Gölge	1649ad3431	save_wav with a custom sampling rate	2021-02-11 15:27:20 +00:00
Eren Gölge	0657b38111	use default vocoder in synthesize.py	2021-02-11 15:26:17 +00:00
Eren Gölge	f1799dbd60	docstring update	2021-02-11 11:25:31 +00:00
Eren Gölge	3c2e13ca5c	fix the default vocoder name	2021-02-11 10:36:52 +00:00
Eren Gölge	c619859a3f	linter fixes	2021-02-09 11:43:17 +00:00
gerazov	ad17dc9e76	final fixes	2021-02-06 23:05:01 +01:00
gerazov	8fdd08ea15	updated to current dev	2021-02-06 22:59:52 +01:00
gerazov	2705d27b28	changed train scripts	2021-02-06 22:29:30 +01:00
Eren Gölge	f4f6290eec	Merge branch 'pr/gerazov/641' into dev	2021-02-05 13:14:49 +00:00
Eren Gölge	d49757faaa	linter updates	2021-02-05 13:10:43 +00:00
Branislav Gerazov	cb77aef36c	waveRNN fix	2021-02-04 09:52:03 +01:00
Thorsten Mueller	d74866cb8e	Merge remote-tracking branch 'upstream/dev' into dev Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637	2021-02-02 19:40:18 +01:00
Thorsten Mueller	a82152eef3	Ups. Added missing ,	2021-02-02 19:29:16 +01:00
Thorsten Mueller	4cb4fcf02c	Set out_path to be required param.	2021-02-02 19:29:16 +01:00
Eren Gölge	5c46543765	linter fixes and version updates for deps	2021-02-01 13:18:56 +00:00
Eren Gölge	5beed0ddcd	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-01 11:27:14 +00:00
Eren Gölge	c7407571fa	fix #638	2021-02-01 10:05:55 +00:00
Eren Gölge	dfdac1def9	Merge pull request #636 from thorstenMueller/dev Set out_path to be required param in compute_statistics.py.	2021-01-29 18:08:31 +01:00
Thorsten Mueller	44c4a49745	Set out_path to be required param.	2021-01-29 17:23:38 +01:00
Alexander Korolev	e81ebec7a8	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-01-29 15:18:59 +01:00
Alexander Korolev	ca28e05ed7	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-01-27 16:33:25 +01:00
Eren Gölge	25c86ca715	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-01-27 11:47:03 +01:00
Eren Gölge	877f0bbfba	manifest.in update	2021-01-26 02:56:55 +01:00
Eren Gölge	82e029529e	fix manifest file	2021-01-25 13:27:54 +01:00
Eren Gölge	57b668fd86	fixing dome pypi issues	2021-01-25 13:06:12 +01:00
Eren Gölge	60c1bb93d9	fixes before first PyPI release	2021-01-25 11:16:20 +01:00
Eren Gölge	fae10309e4	Merge pull request #624 from SanjaESC/patch-3 Update train_tacotron.py	2021-01-22 13:29:09 +01:00
Eren Gölge	c990b3a59c	linter fixes and test fixes	2021-01-22 02:32:35 +01:00
Alexander Korolev	f251dc8c0e	Update train_tacotron.py When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training. By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.	2021-01-21 21:16:30 +01:00
Eren Gölge	0ab2eb2664	use synthesizer in both synthesize.py and server.pu	2021-01-21 15:54:33 +01:00
Eren Gölge	6b6e989fd2	update server readme	2021-01-21 15:29:46 +01:00
root	3d30dae8f3	.models.json and synthesize.py update for interfacing with model manager	2021-01-20 02:08:58 +00:00
root	7beaacc55b	update compute_attention_masks.py	2021-01-13 10:03:57 +00:00
erogol	cc2b1e043d	docstrings for common layers	2021-01-11 15:06:12 +01:00
erogol	d382d759b3	small fixes and test fixes	2021-01-08 15:48:40 +01:00
erogol	f352b3534c	make noise augmentation optional	2021-01-06 13:19:40 +01:00
erogol	d5a0190c4b	update copy_config_file to copy_model_files	2021-01-06 13:19:40 +01:00
erogol	8971c59b2d	plot eval alignment score right	2021-01-06 13:19:40 +01:00
erogol	fede46e96e	pylint and test fixes	2021-01-06 13:19:40 +01:00
erogol	2abe3df153	compute_attention_masks.py	2021-01-06 13:19:40 +01:00
erogol	cf869e8922	add SS files	2021-01-06 13:19:40 +01:00
erogol	29b17c0808	bug fix for gradual training	2021-01-06 13:19:40 +01:00
erogol	6478d552dc	tacotron training bug fix	2021-01-06 13:19:40 +01:00
erogol	1dd086577a	tacotron training bug fix	2021-01-06 13:18:41 +01:00
Thorsten Mueller	f673f8f74d	Added support for npy output from tune-wavegrad	2020-12-19 22:51:22 +01:00
Thorsten Mueller	2aa0354b44	Fix for 'NoneType' object has no attribute 'to'	2020-12-19 22:37:03 +01:00
Thorsten Mueller	28a64221ea	Improve robostness on cpu / gpu model mix	2020-12-19 22:23:28 +01:00
Eren Gölge	2473b2dc62	Merge pull request #559 from krzim/patch-1 Fix import to grab the encoder model save function	2020-12-10 00:19:32 +01:00
erogol	53679b706d	glow-tts distributed fix	2020-12-09 23:39:09 +01:00
erogol	62bc171db5	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-09 15:46:57 +01:00
erogol	df180148e9	use noise augmentation in TTSDataset	2020-12-09 15:46:25 +01:00
Thorsten Mueller	e39628ce2f	Limit filenames to 10 chars	2020-12-08 18:44:19 +01:00
erogol	06612ce305	test fixes	2020-12-07 15:57:34 +01:00
erogol	0252a07fa6	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-07 11:31:55 +01:00
erogol	482e725752	sync torch calls before logging training results	2020-12-07 11:30:19 +01:00
erogol	affe1c1138	setup training scripts for computing phonemes before training optionally. And define data_loaders before starting training and re-use them instead of re-define for every train and eval calls. This is to enable better instance filtering based on input length.	2020-12-07 11:26:57 +01:00
erogol	7c3cdced1a	make speaker_mapping a global variable to prevent reload. Fix glow-tts training	2020-12-01 03:23:25 +01:00
Thorsten Mueller	06a389bc08	Added option for saving raw spectograms	2020-11-27 15:49:55 +01:00
erogol	4b92ac0f92	tune_wavegrad update	2020-11-25 14:49:48 +01:00
erogol	d8c1b5b73d	print max lengths in tacotron training	2020-11-25 14:49:07 +01:00
erogol	1229554c42	use native amp	2020-11-25 14:48:54 +01:00
erogol	8a820930c6	compute_embedding update	2020-11-25 14:46:08 +01:00
erogol	aa2b31a1b0	use 'enabled' argument to control autocast	2020-11-17 14:22:01 +01:00
Qingping Hou	b0b97d636f	speed up metafile build for voxceleb	2020-11-14 23:45:17 -08:00
erogol	a2a142dc39	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-11-14 13:02:19 +01:00
erogol	c65712426a	change noise scheduling for wavegrad. Compute beta values externally to enable better flexibility	2020-11-14 13:01:10 +01:00
erogol	5a59467f34	scaler fix for wavegrad and wavernn. Save and load scaler	2020-11-14 13:00:35 +01:00
erogol	d8511efa8f	use native amp for tacotron training	2020-11-14 12:59:28 +01:00
Qingping Hou	0cc3650ef6	support loading config in yaml	2020-11-14 00:13:53 -08:00
erogol	a7aefd5c50	use pytorch amp for mixed precision training for Tacotron	2020-11-12 12:51:56 +01:00
erogol	67e2b664e5	compute embeddings and create speakers.json	2020-11-12 12:51:17 +01:00
erogol	f8fd300b3e	bug fix	2020-11-10 12:53:39 +01:00
erogol	016d3503da	compute embeddings with speaker encoder	2020-11-10 12:51:02 +01:00
erogol	21364331d2	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-11-09 13:31:12 +01:00
erogol	c76a617072	linter updates	2020-11-09 13:18:35 +01:00
erogol	c80225544e	tune wavegrad to fine the best noise schedule for inferece	2020-11-06 13:04:46 +01:00
erogol	ef04d7fae7	bug fix for wavernn training	2020-10-30 14:08:41 +01:00
erogol	183fe56d95	Merge branch 'ssim_loss' into dev	2020-10-29 23:49:09 +01:00
krzim	2202e171c5	Fix import to grab the encoder model save function I saw that this was recently changed but I'm not sure if it should have been. This is the correct function given the arguments provided to it in the train loop.	2020-10-29 18:03:11 -04:00
erogol	73581cd94c	renaming train scripts and updating tests	2020-10-29 16:50:07 +01:00
erogol	946a0c0fb9	bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts	2020-10-29 15:45:50 +01:00
erogol	14c2381207	weight norm and torch based amp training for wavegrad	2020-10-29 12:31:43 +01:00
erogol	c8a4c771a8	train wavegrad updates	2020-10-29 12:31:43 +01:00
erogol	670f44aa18	enable compute stats by vocoder config	2020-10-29 12:31:43 +01:00
erogol	f79bbbbd00	use Adam for wavegras instead of RAdam	2020-10-29 12:31:43 +01:00
erogol	7bcdb7ac35	wavegrad updates	2020-10-29 12:31:43 +01:00
erogol	a1582a0e12	fix distributed training for train_* scripts	2020-10-29 12:31:43 +01:00
erogol	e02cd6a220	initial wavegrad layers model and trainig script	2020-10-29 12:30:37 +01:00
erogol	e723b99888	handle distributed model as saving	2020-10-29 12:30:37 +01:00
Eren Gölge	26c18b61c9	Merge pull request #553 from Edresson/dev bug fix in the inference with GlowTTS	2020-10-28 18:49:31 +01:00
erogol	9d0ae2bfb4	wavernn dataloader handling for short samples and mixed precision training	2020-10-28 12:31:01 +01:00
Edresson	f01502a9db	bug fix in glowTTS sythesize	2020-10-27 16:30:16 -03:00
Eren Gölge	f4b8170bd1	Merge pull request #545 from Edresson/dev GlowTTS zeroshot TTS support	2020-10-27 15:23:41 +01:00
erogol	0becef4b58	small updates	2020-10-27 12:17:38 +01:00
sanjaesc	2ee47e9568	fix pylint once again	2020-10-27 12:17:38 +01:00
sanjaesc	bef3f2020b	compute audio feat on dataload	2020-10-27 12:17:38 +01:00
sanjaesc	91e5f8b63d	added to device cpu/gpu + formatting	2020-10-27 12:17:38 +01:00
sanjaesc	016a77fcf2	fix formatting + pylint	2020-10-27 12:17:38 +01:00
sanjaesc	e8294cb9db	fixing pylint errors	2020-10-27 12:17:38 +01:00
sanjaesc	878b7c373e	added feature preprocessing if not set in config	2020-10-27 12:17:38 +01:00
sanjaesc	e495e03ea1	some minor changes to wavernn	2020-10-27 12:17:38 +01:00
Alex K	6378fa2b07	add initial wavernn support	2020-10-27 12:17:38 +01:00
Edresson	d9540a5857	add blank token in sequence for encrease glowtts results	2020-10-25 15:08:28 -03:00
Edresson	fbea058c59	add parse speakers function	2020-10-24 16:10:05 -03:00
Edresson	07345099ee	GlowTTS zero-shot TTS Support	2020-10-24 15:58:39 -03:00
Edresson	b7f9ebd32b	add check arguments for GlowTTS and multispeaker training bug fix	2020-10-19 17:17:58 -03:00
erogol	c5074cfd8e	general purpose distribute.py	2020-10-08 01:30:42 +02:00
Edresson	99d5a0ac07	add Speaker Conditional GST support	2020-09-29 16:09:27 -03:00
erogol	154f90bc44	format speaker encoder imports	2020-09-28 11:19:19 +02:00
mueller91	cfeeef7a7f	fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev. speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.	2020-09-22 20:10:41 +02:00
mueller91	1fe5eb054f	Merge branch 'dev' of https://github.com/mozilla/TTS into dev Conflicts: TTS/bin/train_encoder.py requirements.txt	2020-09-22 19:58:53 +02:00
mueller91	df4caec4b7	add: check_config for speaker_encoder	2020-09-22 19:52:09 +02:00
erogol	10258724d1	linter fixes	2020-09-22 03:54:16 +02:00
erogol	a6df617eb1	Merge branch 'glow-tts-amp-time_depth_conv' into dev	2020-09-21 14:23:45 +02:00
erogol	8150d5727e	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-09-21 14:21:55 +02:00
erogol	e0b9fa887f	glow-tts modules added	2020-09-21 14:15:40 +02:00
mueller	6b0621c794	cleanup	2020-09-17 16:46:43 +02:00
mueller	a273b1a210	add: add random noise to dataset	2020-09-17 14:23:40 +02:00
mueller	e36a3067e4	add: save wavs instead feats to storage. This is done in order to mitigate staleness when caching and loading from data storage	2020-09-17 14:14:30 +02:00
mueller	1511076fde	add: Configurable encoder dataset storage to reduce disk I/O add: Averaged time for data loader to console and Tensorboard output	2020-09-17 12:29:38 +02:00
maxbachmann	60ce862113	use difflib for string matching	2020-09-14 23:55:34 +02:00
erogol	498a3ea36f	fix condition check	2020-09-12 03:39:01 +02:00
erogol	15e6ab3912	glow-tts module renaming updates	2020-09-12 03:33:36 +02:00
erogol	f9001a4bdd	refactor and fix compat issues for speaker encoder	2020-09-11 17:17:07 +02:00
erogol	df19428ec6	rename the project to old TTS	2020-09-09 12:27:23 +02:00

... 3 4 5 6 7 ...

407 Commits