coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	fae10309e4	Merge pull request #624 from SanjaESC/patch-3 Update train_tacotron.py	2021-01-22 13:29:09 +01:00
Eren Gölge	5ee73c2bae	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-01-22 13:26:27 +01:00
Eren Gölge	5fb611ef40	static image for server index.html	2021-01-22 03:01:53 +01:00
Eren Gölge	ca647cf222	Model Manager to download released models	2021-01-22 02:35:43 +01:00
Eren Gölge	ca8ad9c21e	rename audio._normalize to audio.normalize	2021-01-22 02:33:19 +01:00
Eren Gölge	c990b3a59c	linter fixes and test fixes	2021-01-22 02:32:35 +01:00
Alexander Korolev	f251dc8c0e	Update train_tacotron.py When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training. By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.	2021-01-21 21:16:30 +01:00
Eren Gölge	0ab2eb2664	use synthesizer in both synthesize.py and server.pu	2021-01-21 15:54:33 +01:00
Eren Gölge	9addfabc43	wavernn load_checkpoint function	2021-01-21 15:31:13 +01:00
Eren Gölge	50fee59a2c	update synthesizer.py for better interfacing to different models	2021-01-21 15:30:49 +01:00
Eren Gölge	007a4d7139	remove 3rd paty wavernn support from server.py and add ModelManager arguments	2021-01-21 15:30:16 +01:00
Eren Gölge	6b6e989fd2	update server readme	2021-01-21 15:29:46 +01:00
Thorsten Mueller	e414582be6	Added option for server ui details page.	2021-01-20 21:56:40 +01:00
root	1bc8fbbd3c	set eval mode whe nloading models	2021-01-20 02:14:18 +00:00
root	5bd7238153	interpolate spectrogram in vocoder generic utils for matching sample rates	2021-01-20 02:13:01 +00:00
root	ca3743539a	load_checkpoint func for vocoder models	2021-01-20 02:12:29 +00:00
root	ea39715305	read_json_with_comments	2021-01-20 02:11:55 +00:00
root	563bc921d8	optional verbose for audio.py init	2021-01-20 02:11:24 +00:00
root	1faf565e3a	add load_checkpoint func to tts models	2021-01-20 02:10:56 +00:00
root	5c87753e88	glow-tts fix for saving inverse weight	2021-01-20 02:09:42 +00:00
root	3d30dae8f3	.models.json and synthesize.py update for interfacing with model manager	2021-01-20 02:08:58 +00:00
gerazov	b2b4828f17	set requires_grad=False	2021-01-16 19:46:04 +01:00
gerazov	c96f7a2614	TorchSTFT to device fix	2021-01-16 12:21:16 +01:00
root	7beaacc55b	update compute_attention_masks.py	2021-01-13 10:03:57 +00:00
erogol	428c224b88	commet update	2021-01-12 17:31:04 +01:00
erogol	bbc8d665a1	move attention layers to a sperate file	2021-01-11 17:27:30 +01:00
erogol	79c841ccd3	mass refactoring and update	2021-01-11 17:26:58 +01:00
erogol	1d961d6f8a	cladd renaming	2021-01-11 17:26:11 +01:00
erogol	c0a2aa68d3	formatting	2021-01-11 17:25:39 +01:00
erogol	b206162d11	more docstrings	2021-01-11 17:25:04 +01:00
erogol	6e9043c5d2	rename convbnblocks and handle none mask	2021-01-11 17:22:34 +01:00
erogol	921fa5db92	remove attentions from common layers	2021-01-11 15:06:42 +01:00
erogol	cc2b1e043d	docstrings for common layers	2021-01-11 15:06:12 +01:00
erogol	a6f40fef2e	stage missing files	2021-01-08 16:02:56 +01:00
erogol	d382d759b3	small fixes and test fixes	2021-01-08 15:48:40 +01:00
erogol	a6259041d3	docstring for speedyspeech	2021-01-07 14:35:22 +01:00
erogol	de2a542f83	glow-tts bug fix	2021-01-07 13:40:32 +01:00
erogol	14d33662ea	input shapes for tacotron models	2021-01-06 13:19:40 +01:00
erogol	f288e9a260	docstrings for taoctron models	2021-01-06 13:19:40 +01:00
erogol	5a45af48f1	fix	2021-01-06 13:19:40 +01:00
erogol	e7fad928e7	doc strings for the all glow-tts layers	2021-01-06 13:19:40 +01:00
erogol	d3b7284be4	glow-tts comments and refactoring	2021-01-06 13:19:40 +01:00
erogol	7586fbc4de	SS refactoring	2021-01-06 13:19:40 +01:00
erogol	e82d31b6ac	glow ttss refactoring	2021-01-06 13:19:40 +01:00
erogol	29f4329d7f	update glow-tts layers and add some comments	2021-01-06 13:19:40 +01:00
erogol	29cf933831	update SS condif	2021-01-06 13:19:40 +01:00
erogol	228ada04b5	update glow-tts ljspeech config	2021-01-06 13:19:40 +01:00
erogol	f352b3534c	make noise augmentation optional	2021-01-06 13:19:40 +01:00
erogol	71c382be14	copy model scale stats file with config.json to the trianing folder, fixed for model inits	2021-01-06 13:19:40 +01:00
erogol	aa40fe1aa0	SS model refacotring for multi speaker	2021-01-06 13:19:40 +01:00
erogol	eb555855e4	small fixes	2021-01-06 13:19:40 +01:00
erogol	5901a00576	argument rename	2021-01-06 13:19:40 +01:00
erogol	4ef083f0f1	select decoder type for SS	2021-01-06 13:19:40 +01:00
erogol	d5a0190c4b	update copy_config_file to copy_model_files	2021-01-06 13:19:40 +01:00
erogol	8971c59b2d	plot eval alignment score right	2021-01-06 13:19:40 +01:00
erogol	3fa408a5ea	change order BN + ReLU to ReLU + BN for SS	2021-01-06 13:19:40 +01:00
erogol	ac5c9217d1	positional encoding masking for SS	2021-01-06 13:19:40 +01:00
erogol	fede46e96e	pylint and test fixes	2021-01-06 13:19:40 +01:00
erogol	2abe3df153	compute_attention_masks.py	2021-01-06 13:19:40 +01:00
erogol	cf869e8922	add SS files	2021-01-06 13:19:40 +01:00
erogol	e4680e1b99	plot float16 alignments	2021-01-06 13:19:40 +01:00
erogol	13c6665c92	inference for SS	2021-01-06 13:19:40 +01:00
erogol	30788960a8	check SS model parameters	2021-01-06 13:19:40 +01:00
erogol	5cae2c5742	make optional position encoding for speedyspeech	2021-01-06 13:19:40 +01:00
erogol	dc4a16d62e	speedy speehc losses	2021-01-06 13:19:40 +01:00
erogol	d62cac7252	fix glow-tts prenet bug fix	2021-01-06 13:19:40 +01:00
erogol	a1d5a9ddda	config update tyo use noise for augmentation	2021-01-06 13:19:40 +01:00
erogol	022af74d74	update prompt msg	2021-01-06 13:19:40 +01:00
erogol	57ef53bef3	update argumnet check for non tacotron models	2021-01-06 13:19:40 +01:00
erogol	27a75de15f	update processors for loading attention maps	2021-01-06 13:19:40 +01:00
erogol	fa6907fa0e	update glow-tts parameters and fix rel-attn-win size	2021-01-06 13:19:40 +01:00
erogol	7b20d8cbd3	implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic	2021-01-06 13:19:40 +01:00
erogol	973754d893	fix for init glow-tts	2021-01-06 13:19:40 +01:00
erogol	f81af4eb0d	config update disable guided attention for dynamic conv attention	2021-01-06 13:19:40 +01:00
erogol	29b17c0808	bug fix for gradual training	2021-01-06 13:19:40 +01:00
erogol	5c50e104d6	config update	2021-01-06 13:19:40 +01:00
erogol	6478d552dc	tacotron training bug fix	2021-01-06 13:19:40 +01:00
erogol	1dd086577a	tacotron training bug fix	2021-01-06 13:18:41 +01:00
erogol	fa20638083	config for ljspeech dynamic conv attention	2021-01-06 13:18:41 +01:00
erogol	070146e143	add monotonic dynamic convolution attention	2021-01-06 13:18:41 +01:00
erogol	18392bc13a	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-01-06 13:18:08 +01:00
Thorsten Mueller	f673f8f74d	Added support for npy output from tune-wavegrad	2020-12-19 22:51:22 +01:00
Thorsten Mueller	2aa0354b44	Fix for 'NoneType' object has no attribute 'to'	2020-12-19 22:37:03 +01:00
Thorsten Mueller	28a64221ea	Improve robostness on cpu / gpu model mix	2020-12-19 22:23:28 +01:00
erogol	8293751a38	remove mozilla from server page	2020-12-17 12:28:28 +01:00
erogol	639fa29261	update speaker id casting for glow-tts	2020-12-14 16:58:47 +01:00
erogol	999120ecdf	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-12 18:50:14 +01:00
erogol	f611e6ac01	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-12 18:47:59 +01:00
Jörg Thalheim	62fd4ca70d	inflect negative numbers correctly	2020-12-10 16:47:51 +01:00
Jörg Thalheim	6646682650	cleaners: expand english time	2020-12-10 14:53:20 +01:00
Jörg Thalheim	76138687d3	expand more currencies	2020-12-10 14:53:20 +01:00
erogol	a2859b7ddc	update config args checks	2020-12-10 13:52:57 +01:00
erogol	788cd6f902	fix multi-speaker glow-tts inference	2020-12-10 02:05:48 +01:00
erogol	3d5066e2b8	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-10 00:31:03 +01:00
erogol	92cc9630d7	fix glow-tts synthesis for DPP	2020-12-10 00:30:34 +01:00
Eren Gölge	2473b2dc62	Merge pull request #559 from krzim/patch-1 Fix import to grab the encoder model save function	2020-12-10 00:19:32 +01:00
erogol	53679b706d	glow-tts distributed fix	2020-12-09 23:39:09 +01:00
erogol	62bc171db5	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-09 15:46:57 +01:00
erogol	df180148e9	use noise augmentation in TTSDataset	2020-12-09 15:46:25 +01:00
Thorsten Mueller	e39628ce2f	Limit filenames to 10 chars	2020-12-08 18:44:19 +01:00

1 2 3 4 5 ...

303 Commits