coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	93a6bdfd6c	linter fixes and version updates for deps	2021-03-08 02:51:10 +01:00
Eren Gölge	a30a231566	unpin cython version and commentout pyworld in audio.py causing dep issues	2021-03-08 02:50:15 +01:00
Thorsten Mueller	3eb00e8d93	Set out_path to be required param.	2021-03-08 02:49:15 +01:00
Alexander Korolev	ace430d5e6	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-03-08 02:49:15 +01:00
Eren Gölge	83143fbe39	fix #638	2021-03-08 02:48:31 +01:00
Eren Gölge	30c3bef3f9	move hubconf	2021-03-08 02:48:31 +01:00
Eren Gölge	bbea6a0884	hubconf.py and load .models.json from the defualt location by mange.py	2021-03-08 02:48:31 +01:00
Eren Gölge	90d4f08d6c	reorder imports	2021-03-08 02:48:31 +01:00
Eren Gölge	db231c83fc	distill import statement, check python version in setup.py	2021-03-08 02:48:31 +01:00
Thorsten Mueller	915ec1faac	Added info if model already downloaded in --list_models	2021-03-08 02:48:31 +01:00
Alexander Korolev	b4bc5f6eb1	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-03-08 02:48:31 +01:00
Eren Gölge	534e3c67c6	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-03-08 02:48:31 +01:00
kirianguiller	7f36d91131	update chinese model	2021-03-01 14:55:05 +01:00
Eren Gölge	547bfc4ce9	bug fix	2021-02-18 18:24:03 +00:00
Eren Gölge	adaeec57ec	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-18 17:21:09 +00:00
Eren Gölge	5b70c8ba4f	enable backward compat for loading the best model	2021-02-18 17:20:36 +00:00
Eren Gölge	e4f81d6856	Merge pull request #654 from kirianguiller/chinese-implementation Chinese implementation (merge into dev)	2021-02-18 17:15:32 +01:00
kirianguiller	22a6bbfa80	remove gst handling in synthetizer.py class	2021-02-17 20:53:56 +01:00
kirianguiller	3911b87e54	remove re.Match typing in '_number_replace()'	2021-02-17 20:53:56 +01:00
kirianguiller	fb0655d1e7	modify according to PR reviews	2021-02-17 20:53:56 +01:00
kirianguiller	c4c7bc1b88	<add> Chinese mandarin implementation (tacotron2)	2021-02-17 20:53:56 +01:00
Eren Gölge	d0454461de	Merge branch 'pr/gerazov/650-2' into dev	2021-02-17 13:40:45 +00:00
Eren Gölge	a8ea0ea6ce	Docstrings for audioprocessor	2021-02-17 13:35:41 +00:00
Eren Gölge	f6e6314910	add RUSLAN dataset preprocessor	2021-02-17 13:35:23 +00:00
Eren Gölge	ce0c5eccbd	do not test server and modelManager until fixing #657	2021-02-17 00:35:43 +00:00
gerazov	61c88beb94	refactored keep_all_best	2021-02-15 18:40:17 +01:00
Eren Gölge	eb543c027e	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-15 17:06:40 +00:00
Eren Gölge	8a106e0527	fix #655	2021-02-15 17:06:03 +00:00
Eren Gölge	216945e653	Merge pull request #647 from adonispujols/patch-1 Easy Fix for #454 (which was somehow deleted?)	2021-02-15 13:17:17 +01:00
Eren Gölge	06a3ba2fe2	linter update	2021-02-15 12:10:19 +00:00
Eren Gölge	7f58fa365b	Merge branch 'save_characters' into dev	2021-02-15 12:07:28 +00:00
Eren Gölge	ff218e2370	remove redundancy	2021-02-15 12:07:02 +00:00
Eren Gölge	80af8ca5e1	Update TTS/utils/arguments.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-02-15 13:03:59 +01:00
Eren Gölge	3b6ce04332	Update TTS/bin/find_unique_chars.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-02-15 13:02:29 +01:00
Eren Gölge	dc3596dad4	model_manager tests	2021-02-15 11:29:22 +00:00
Eren Gölge	77e630348e	author , license and contact info in .model.json	2021-02-15 11:02:21 +00:00
Eren Gölge	e1bc823e44	Merge branch 'pr/nmstoker/652' into dev	2021-02-15 10:57:12 +00:00
nmstoker	33bcdc6ff8	Updating models list to include EK1 TTS/vocoder	2021-02-14 23:44:05 +00:00
Eren Gölge	420901f4c2	linter fixes	2021-02-12 14:41:17 +00:00
Eren Gölge	4244096ccb	update test_text_processing for espeak-ng	2021-02-12 14:07:26 +00:00
Eren Gölge	b28c724c04	remove _phoneme_punctuations	2021-02-12 12:10:57 +00:00
Eren Gölge	7ab527d17e	save default model chars to the training config file	2021-02-12 12:06:46 +00:00
Eren Gölge	593cedee14	parse_characters function	2021-02-12 12:05:56 +00:00
Eren Gölge	2abfff17f9	enable saving model characters in io.py	2021-02-12 12:04:41 +00:00
Eren Gölge	918f007a11	docstring update	2021-02-12 12:04:07 +00:00
Eren Gölge	e774f68aee	save used model characters to the checkpoints	2021-02-12 12:03:42 +00:00
gerazov	0e78e31dbf	reformated docstrings in arguments.py	2021-02-12 11:36:01 +01:00
gerazov	310d18325e	brushed up printing model load path and best loss path	2021-02-12 10:55:45 +01:00
Eren Gölge	8b6fd76ad2	find unique characters in a dataset	2021-02-12 09:46:11 +00:00
gerazov	af46727517	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-02-12 02:12:00 +01:00
Eren Gölge	a1e595790d	use default vocoders in server.pu	2021-02-11 15:31:39 +00:00
Eren Gölge	8aa6a0decb	set an output_sample_rate in synthesizer and use it for writing the wav file	2021-02-11 15:28:07 +00:00
Eren Gölge	0c52d27d65	return the json entry of the downloaded model	2021-02-11 15:27:41 +00:00
Eren Gölge	1649ad3431	save_wav with a custom sampling rate	2021-02-11 15:27:20 +00:00
Eren Gölge	43f54d2dce	fix make_symbols	2021-02-11 15:26:52 +00:00
Eren Gölge	0657b38111	use default vocoder in synthesize.py	2021-02-11 15:26:17 +00:00
Eren Gölge	2043a9b5f5	define default vocoders	2021-02-11 15:25:55 +00:00
Eren Gölge	ff27690ca7	bug fix	2021-02-11 13:43:29 +00:00
Eren Gölge	bc131208be	fix spelling of a def argument and parse phonemes from config.json if use_phonemes is True	2021-02-11 13:04:47 +00:00
Eren Gölge	f1799dbd60	docstring update	2021-02-11 11:25:31 +00:00
Eren Gölge	3baec4ea96	add missing phonemes to test_config.json	2021-02-11 11:14:39 +00:00
Eren Gölge	a3d1e65b34	Merge branch 'pr/adonispujols/646' into dev	2021-02-11 10:37:29 +00:00
Eren Gölge	3c2e13ca5c	fix the default vocoder name	2021-02-11 10:36:52 +00:00
Adonis Pujols	48011a8b58	add encoding="utf-8"	2021-02-11 05:26:06 -05:00
Adonis Pujols	b29a7e9645	spelling error. should be multiband not mulitband	2021-02-11 04:49:28 -05:00
Adonis Pujols	6c824a6629	spelling error. should be multiband not mulitband	2021-02-11 04:48:53 -05:00
Eren Gölge	b08b8ca2a1	add russian phoneme char	2021-02-10 13:30:59 +00:00
Eren Gölge	9cad435288	css10 dataset preprocessor	2021-02-09 15:11:26 +00:00
Eren Gölge	cea5e517f2	download github model releases by model manager	2021-02-09 14:24:14 +00:00
Eren Gölge	c619859a3f	linter fixes	2021-02-09 11:43:17 +00:00
gerazov	e507373b55	final final fixes	2021-02-06 23:08:47 +01:00
gerazov	ad17dc9e76	final fixes	2021-02-06 23:05:01 +01:00
gerazov	8fdd08ea15	updated to current dev	2021-02-06 22:59:52 +01:00
gerazov	2705d27b28	changed train scripts	2021-02-06 22:29:30 +01:00
gerazov	4f8f274d6e	restructured arg parsing and processing to utils	2021-02-06 22:25:56 +01:00
Eren Gölge	e7e880f514	fix gdown	2021-02-05 13:42:24 +00:00
Eren Gölge	f4f6290eec	Merge branch 'pr/gerazov/641' into dev	2021-02-05 13:14:49 +00:00
Eren Gölge	d49757faaa	linter updates	2021-02-05 13:10:43 +00:00
Branislav Gerazov	f063545325	improve robustness of defining wavernn in config file	2021-02-05 13:26:33 +01:00
Branislav Gerazov	24ffa9e9f6	update wavernn test config, delete cap=True	2021-02-05 13:10:02 +01:00
Branislav Gerazov	cb77aef36c	waveRNN fix	2021-02-04 09:52:03 +01:00
Thorsten Mueller	d74866cb8e	Merge remote-tracking branch 'upstream/dev' into dev Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637	2021-02-02 19:40:18 +01:00
Thorsten Mueller	a82152eef3	Ups. Added missing ,	2021-02-02 19:29:16 +01:00
Thorsten Mueller	4cb4fcf02c	Set out_path to be required param.	2021-02-02 19:29:16 +01:00
Thorsten Mueller	c75ea74914	Added info if model already downloaded in --list_models	2021-02-02 19:29:16 +01:00
Eren Gölge	2edab4b3f9	disable pw in audio that causes numpy issue	2021-02-01 17:05:03 +00:00
Eren Gölge	5c46543765	linter fixes and version updates for deps	2021-02-01 13:18:56 +00:00
Eren Gölge	8774e37444	unpin cython version and commentout pyworld in audio.py causing dep issues	2021-02-01 11:34:05 +00:00
Eren Gölge	5beed0ddcd	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-01 11:27:14 +00:00
Eren Gölge	c7407571fa	fix #638	2021-02-01 10:05:55 +00:00
Eren Gölge	dfdac1def9	Merge pull request #636 from thorstenMueller/dev Set out_path to be required param in compute_statistics.py.	2021-01-29 18:08:31 +01:00
Thorsten Mueller	44c4a49745	Set out_path to be required param.	2021-01-29 17:23:38 +01:00
Eren Gölge	536366dc0a	Merge pull request #635 from SanjaESC/patch-1 fix device mismatch wavegrad training	2021-01-29 16:42:25 +01:00
Eren Gölge	0354b6f35e	move hubconf	2021-01-29 15:28:32 +00:00
Eren Gölge	aa5f24608a	hubconf.py and load .models.json from the defualt location by mange.py	2021-01-29 15:28:26 +00:00
Alexander Korolev	e81ebec7a8	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-01-29 15:18:59 +01:00
Eren Gölge	a926aa106d	reorder imports	2021-01-29 01:36:21 +01:00
Eren Gölge	8a6eee7fec	distill import statement, check python version in setup.py	2021-01-28 17:04:08 +01:00
Eren Gölge	131a163c95	Merge pull request #628 from thorstenMueller/dev Added info if model already downloaded in --list_models	2021-01-28 13:10:06 +01:00
Alexander Korolev	ca28e05ed7	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-01-27 16:33:25 +01:00
Thorsten Mueller	ccbd542eb0	Added info if model already downloaded in --list_models	2021-01-27 16:19:02 +01:00
Eren Gölge	25c86ca715	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-01-27 11:47:03 +01:00
Eren Gölge	4f32e77006	platform indep. way to fetch user data folder	2021-01-26 17:32:43 +01:00
Eren Gölge	0117c811a9	add a button to index.html to see the model details	2021-01-26 12:33:27 +01:00
Eren Gölge	a3adcaccdb	Merge branch 'pr/thorstenMueller/623' into dev	2021-01-26 12:19:39 +01:00
Eren Gölge	b464cab9b8	setup.py update and pylint fixes	2021-01-26 02:57:50 +01:00
Eren Gölge	660d61aeeb	maximum_path_numpy and CYTHON adabtable import	2021-01-26 02:57:07 +01:00
Eren Gölge	877f0bbfba	manifest.in update	2021-01-26 02:56:55 +01:00
Eren Gölge	82e029529e	fix manifest file	2021-01-25 13:27:54 +01:00
Eren Gölge	57b668fd86	fixing dome pypi issues	2021-01-25 13:06:12 +01:00
Eren Gölge	60c1bb93d9	fixes before first PyPI release	2021-01-25 11:16:20 +01:00
Thorsten Mueller	afb7db2a1d	Removed unneeded check and removed specific taco2 model name.	2021-01-22 16:22:50 +01:00
Eren Gölge	fae10309e4	Merge pull request #624 from SanjaESC/patch-3 Update train_tacotron.py	2021-01-22 13:29:09 +01:00
Eren Gölge	5ee73c2bae	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-01-22 13:26:27 +01:00
Eren Gölge	5fb611ef40	static image for server index.html	2021-01-22 03:01:53 +01:00
Eren Gölge	ca647cf222	Model Manager to download released models	2021-01-22 02:35:43 +01:00
Eren Gölge	ca8ad9c21e	rename audio._normalize to audio.normalize	2021-01-22 02:33:19 +01:00
Eren Gölge	c990b3a59c	linter fixes and test fixes	2021-01-22 02:32:35 +01:00
Alexander Korolev	f251dc8c0e	Update train_tacotron.py When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training. By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.	2021-01-21 21:16:30 +01:00
Eren Gölge	0ab2eb2664	use synthesizer in both synthesize.py and server.pu	2021-01-21 15:54:33 +01:00
Eren Gölge	9addfabc43	wavernn load_checkpoint function	2021-01-21 15:31:13 +01:00
Eren Gölge	50fee59a2c	update synthesizer.py for better interfacing to different models	2021-01-21 15:30:49 +01:00
Eren Gölge	007a4d7139	remove 3rd paty wavernn support from server.py and add ModelManager arguments	2021-01-21 15:30:16 +01:00
Eren Gölge	6b6e989fd2	update server readme	2021-01-21 15:29:46 +01:00
Thorsten Mueller	e414582be6	Added option for server ui details page.	2021-01-20 21:56:40 +01:00
root	1bc8fbbd3c	set eval mode whe nloading models	2021-01-20 02:14:18 +00:00
root	5bd7238153	interpolate spectrogram in vocoder generic utils for matching sample rates	2021-01-20 02:13:01 +00:00
root	ca3743539a	load_checkpoint func for vocoder models	2021-01-20 02:12:29 +00:00
root	ea39715305	read_json_with_comments	2021-01-20 02:11:55 +00:00
root	563bc921d8	optional verbose for audio.py init	2021-01-20 02:11:24 +00:00
root	1faf565e3a	add load_checkpoint func to tts models	2021-01-20 02:10:56 +00:00
root	5c87753e88	glow-tts fix for saving inverse weight	2021-01-20 02:09:42 +00:00
root	3d30dae8f3	.models.json and synthesize.py update for interfacing with model manager	2021-01-20 02:08:58 +00:00
gerazov	b2b4828f17	set requires_grad=False	2021-01-16 19:46:04 +01:00
gerazov	c96f7a2614	TorchSTFT to device fix	2021-01-16 12:21:16 +01:00
root	7beaacc55b	update compute_attention_masks.py	2021-01-13 10:03:57 +00:00
erogol	428c224b88	commet update	2021-01-12 17:31:04 +01:00
erogol	bbc8d665a1	move attention layers to a sperate file	2021-01-11 17:27:30 +01:00
erogol	79c841ccd3	mass refactoring and update	2021-01-11 17:26:58 +01:00
erogol	1d961d6f8a	cladd renaming	2021-01-11 17:26:11 +01:00
erogol	c0a2aa68d3	formatting	2021-01-11 17:25:39 +01:00
erogol	b206162d11	more docstrings	2021-01-11 17:25:04 +01:00
erogol	6e9043c5d2	rename convbnblocks and handle none mask	2021-01-11 17:22:34 +01:00
erogol	921fa5db92	remove attentions from common layers	2021-01-11 15:06:42 +01:00
erogol	cc2b1e043d	docstrings for common layers	2021-01-11 15:06:12 +01:00
erogol	a6f40fef2e	stage missing files	2021-01-08 16:02:56 +01:00
erogol	d382d759b3	small fixes and test fixes	2021-01-08 15:48:40 +01:00
erogol	a6259041d3	docstring for speedyspeech	2021-01-07 14:35:22 +01:00
erogol	de2a542f83	glow-tts bug fix	2021-01-07 13:40:32 +01:00
erogol	14d33662ea	input shapes for tacotron models	2021-01-06 13:19:40 +01:00
erogol	f288e9a260	docstrings for taoctron models	2021-01-06 13:19:40 +01:00
erogol	5a45af48f1	fix	2021-01-06 13:19:40 +01:00
erogol	e7fad928e7	doc strings for the all glow-tts layers	2021-01-06 13:19:40 +01:00
erogol	d3b7284be4	glow-tts comments and refactoring	2021-01-06 13:19:40 +01:00
erogol	7586fbc4de	SS refactoring	2021-01-06 13:19:40 +01:00
erogol	e82d31b6ac	glow ttss refactoring	2021-01-06 13:19:40 +01:00
erogol	29f4329d7f	update glow-tts layers and add some comments	2021-01-06 13:19:40 +01:00
erogol	29cf933831	update SS condif	2021-01-06 13:19:40 +01:00
erogol	228ada04b5	update glow-tts ljspeech config	2021-01-06 13:19:40 +01:00
erogol	f352b3534c	make noise augmentation optional	2021-01-06 13:19:40 +01:00
erogol	71c382be14	copy model scale stats file with config.json to the trianing folder, fixed for model inits	2021-01-06 13:19:40 +01:00
erogol	aa40fe1aa0	SS model refacotring for multi speaker	2021-01-06 13:19:40 +01:00
erogol	eb555855e4	small fixes	2021-01-06 13:19:40 +01:00
erogol	5901a00576	argument rename	2021-01-06 13:19:40 +01:00
erogol	4ef083f0f1	select decoder type for SS	2021-01-06 13:19:40 +01:00
erogol	d5a0190c4b	update copy_config_file to copy_model_files	2021-01-06 13:19:40 +01:00
erogol	8971c59b2d	plot eval alignment score right	2021-01-06 13:19:40 +01:00
erogol	3fa408a5ea	change order BN + ReLU to ReLU + BN for SS	2021-01-06 13:19:40 +01:00
erogol	ac5c9217d1	positional encoding masking for SS	2021-01-06 13:19:40 +01:00
erogol	fede46e96e	pylint and test fixes	2021-01-06 13:19:40 +01:00
erogol	2abe3df153	compute_attention_masks.py	2021-01-06 13:19:40 +01:00
erogol	cf869e8922	add SS files	2021-01-06 13:19:40 +01:00
erogol	e4680e1b99	plot float16 alignments	2021-01-06 13:19:40 +01:00
erogol	13c6665c92	inference for SS	2021-01-06 13:19:40 +01:00
erogol	30788960a8	check SS model parameters	2021-01-06 13:19:40 +01:00
erogol	5cae2c5742	make optional position encoding for speedyspeech	2021-01-06 13:19:40 +01:00
erogol	dc4a16d62e	speedy speehc losses	2021-01-06 13:19:40 +01:00
erogol	d62cac7252	fix glow-tts prenet bug fix	2021-01-06 13:19:40 +01:00
erogol	a1d5a9ddda	config update tyo use noise for augmentation	2021-01-06 13:19:40 +01:00
erogol	022af74d74	update prompt msg	2021-01-06 13:19:40 +01:00
erogol	57ef53bef3	update argumnet check for non tacotron models	2021-01-06 13:19:40 +01:00
erogol	27a75de15f	update processors for loading attention maps	2021-01-06 13:19:40 +01:00
erogol	fa6907fa0e	update glow-tts parameters and fix rel-attn-win size	2021-01-06 13:19:40 +01:00
erogol	7b20d8cbd3	implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic	2021-01-06 13:19:40 +01:00
erogol	973754d893	fix for init glow-tts	2021-01-06 13:19:40 +01:00
erogol	f81af4eb0d	config update disable guided attention for dynamic conv attention	2021-01-06 13:19:40 +01:00
erogol	29b17c0808	bug fix for gradual training	2021-01-06 13:19:40 +01:00
erogol	5c50e104d6	config update	2021-01-06 13:19:40 +01:00
erogol	6478d552dc	tacotron training bug fix	2021-01-06 13:19:40 +01:00
erogol	1dd086577a	tacotron training bug fix	2021-01-06 13:18:41 +01:00
erogol	fa20638083	config for ljspeech dynamic conv attention	2021-01-06 13:18:41 +01:00
erogol	070146e143	add monotonic dynamic convolution attention	2021-01-06 13:18:41 +01:00
erogol	18392bc13a	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-01-06 13:18:08 +01:00
Thorsten Mueller	f673f8f74d	Added support for npy output from tune-wavegrad	2020-12-19 22:51:22 +01:00
Thorsten Mueller	2aa0354b44	Fix for 'NoneType' object has no attribute 'to'	2020-12-19 22:37:03 +01:00
Thorsten Mueller	28a64221ea	Improve robostness on cpu / gpu model mix	2020-12-19 22:23:28 +01:00
erogol	8293751a38	remove mozilla from server page	2020-12-17 12:28:28 +01:00
erogol	639fa29261	update speaker id casting for glow-tts	2020-12-14 16:58:47 +01:00
erogol	999120ecdf	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-12 18:50:14 +01:00
erogol	f611e6ac01	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-12-12 18:47:59 +01:00

... 2 3 4 5 6 ...

515 Commits