coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	34f8a74e4d	remove `truncated` from synthesizer	2021-06-28 17:03:19 +02:00
Eren Gölge	178eccbc16	update console logger	2021-06-28 17:03:19 +02:00
Eren Gölge	a20a1c7d06	rename preprocess.py -> formatters.py	2021-06-28 17:03:19 +02:00
Eren Gölge	8def3c87af	trainer-API updates	2021-06-28 17:03:19 +02:00
Michael Hansen	67869e77f9	Use gruut for phonemization	2021-06-25 14:41:05 +02:00
Eren Gölge	d0ab0382fc	linter fixes	2021-06-01 09:15:32 +02:00
Eren Gölge	d9f1268f99	init tb_logger None for rank > 0 processes	2021-05-31 15:47:07 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	c2c7dff805	use relaxted coqpit parser	2021-05-18 14:49:47 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	6e980b49c4	fix synthesizer.py for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	db14dcd95a	remove old load_config	2021-05-11 11:29:18 +02:00
Eren Gölge	a21ac883dd	add get_cuda()	2021-05-11 11:29:18 +02:00
Eren Gölge	21dd4d7960	fix load_config imports for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	9ee70af9bb	code styling	2021-05-11 11:29:18 +02:00
Eren Gölge	757e90b1cc	load_config function to initialize the right Coqpit for the given model	2021-05-11 11:29:18 +02:00
Eren Gölge	35341d5482	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	647163397d	coqpit refactoring	2021-05-11 11:29:17 +02:00
Eren Gölge	9c18e40f64	black formatting	2021-05-11 11:29:17 +02:00
Eren Gölge	79d7215142	config refactor #5 WIP	2021-05-11 11:29:17 +02:00
Eren Gölge	dc50f5f0b0	config refactor #4 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	97bd5f9734	[ci skip] config update #3 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	e092ae40dc	config update WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	06f80a4806	update check argument	2021-05-11 11:28:35 +02:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Eren Gölge	87d674a038	bumpup librosa version to 0.8.0	2021-05-03 14:25:09 +02:00
Eren Gölge	4719414f2e	remove imports	2021-04-27 11:25:17 +02:00
Eren Gölge	add97cddc1	move function and remove import	2021-04-27 11:22:56 +02:00
Eren Gölge	734e6a515c	bug fix	2021-04-27 10:27:45 +02:00
Eren Gölge	2f0716073e	enable multi-speaker CoquiTTS models for synthesize.py	2021-04-26 19:36:53 +02:00
Eren Gölge	f37b488876	Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager	2021-04-26 15:25:25 +02:00
Eren Gölge	b82daa5e86	style and linter fixes	2021-04-26 15:22:24 +02:00
Eren Gölge	4cf211348d	styling and linting	2021-04-23 18:04:37 +02:00
Eren Gölge	7eb0c60d2e	let synthesizer to pass speaker encoder file paths to speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	f9f3d04d14	remove moved function	2021-04-23 18:04:37 +02:00
Eren Gölge	6d0f5e0459	use SpeakerManager in Synthesizer	2021-04-23 18:04:37 +02:00
Eren Gölge	3ace2440fa	fix a mistake from rebase	2021-04-23 18:04:37 +02:00
Eren Gölge	aadb2106ec	code styling	2021-04-23 18:04:37 +02:00
Eren Gölge	af7baa3387	refactoring to allow defining the speaker file externally	2021-04-23 18:04:37 +02:00
kirianguiller	7dccbfdcd5	handle multi speaker and gst in Synthetizer class	2021-04-23 18:04:37 +02:00
WeberJulian	4205284f92	Change name of the functions	2021-04-23 10:09:55 +02:00
WeberJulian	a26498181b	Change back the default value	2021-04-22 16:10:17 +02:00
Julian Weber	355e1f47ab	fix dumb mistake	2021-04-22 15:50:29 +02:00
Julian Weber	c125b71f36	fix windows support	2021-04-22 15:14:24 +02:00
Eren Gölge	e1d960da9e	use SpeakerManager in Synthesizer	2021-04-21 13:13:27 +02:00
Eren Gölge	1038fd420d	fix a mistake from rebase	2021-04-16 19:39:47 +02:00
Eren Gölge	47e356cb48	code styling	2021-04-16 16:01:40 +02:00
Eren Gölge	25328aad00	refactoring to allow defining the speaker file externally	2021-04-16 15:59:57 +02:00
kirianguiller	48ae52a9a3	handle multi speaker and gst in Synthetizer class	2021-04-16 15:54:49 +02:00
Eren Gölge	7cada1a949	remove noise	2021-04-15 15:30:45 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	18d9ec8036	format with black	2021-04-09 00:54:59 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	6ee211c137	remove stft params causing warning	2021-04-08 11:28:30 +02:00
Eren Gölge	7726dfca99	change the upper bound in sound normalization	2021-04-08 11:26:01 +02:00
Eren Gölge	e0e3b12b26	pass all parameters explicity to _istft	2021-04-08 11:23:20 +02:00
Eren Gölge	d57f416957	small fixes	2021-04-08 11:22:30 +02:00
Eren Gölge	f890454de3	linter fixes	2021-04-07 12:36:03 +02:00
Eren Gölge	9782d9ea5d	[ci skip] implement #418	2021-04-06 16:24:50 +02:00
Eren Gölge	f46a275b22	update docstring 2	2021-04-06 16:24:50 +02:00
Eren Gölge	ec94ff3691	update docstring	2021-04-06 16:24:50 +02:00
Eren Gölge	2048095e9a	audio.py fix	2021-04-06 16:24:50 +02:00
Eren Gölge	e0b3008c31	allow choosing the log function used for amptodb conversion	2021-04-06 16:24:50 +02:00
Eren Gölge	e3c052382b	fix loading always best_model when continue	2021-04-01 03:41:15 +02:00
Eren Gölge	7a382a5c2b	stowed aligntts commit and small refactoring with feed_forward layers	2021-03-30 14:39:16 +02:00
Eren Gölge	1ac99ce0d0	if git is not available set git has 'unknown'	2021-03-30 14:39:16 +02:00
Guy Elsmore-Paddock	15459627cc	Fix `UnicodeEncodeError` on Windows Platforms Prevents the following error from appearing when running training on Windows platforms: ``` UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined> ```	2021-03-20 17:30:00 -04:00
Eren Gölge	6e68637f48	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	aeb4f82233	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	f06603a0db	force utf8	2021-03-18 13:33:23 +01:00
Eren Gölge	e5bb317242	fix model manager	2021-03-10 17:01:19 +01:00
Eren Gölge	d260fb03a2	fix handling scale_stats.npy for models downloaded from Github rls	2021-03-10 16:40:30 +01:00
Eren Gölge	4aba4e5b1e	linter fx	2021-03-10 15:33:11 +01:00
Eren Gölge	6c932c8503	print the desc if required parameters are not provided	2021-03-10 15:19:00 +01:00
Eren Gölge	9e84c8a623	do not copy scale_stats if exist in the output folder	2021-03-10 15:13:55 +01:00
Eren Gölge	7782034e7e	fix #369	2021-03-10 15:13:21 +01:00
Eren Gölge	599149a7e5	downloading models from github releases	2021-03-10 11:09:01 +01:00
Eren Gölge	9a48ba3821	a ton of linter updates	2021-03-08 05:06:54 +01:00
Eren Gölge	e03a426378	bug fix	2021-03-08 02:59:48 +01:00
kirianguiller	628afe5cb0	remove gst handling in synthetizer.py class	2021-03-08 02:59:48 +01:00
kirianguiller	9ab07f94e2	modify according to PR reviews	2021-03-08 02:59:48 +01:00
kirianguiller	42ba30eb8f	<add> Chinese mandarin implementation (tacotron2)	2021-03-08 02:59:24 +01:00
kirianguiller	49665783a6	remove gst handling in synthetizer.py class	2021-03-08 02:57:11 +01:00
kirianguiller	0d4525322c	modify according to PR reviews	2021-03-08 02:57:11 +01:00
kirianguiller	e6fd118cf8	<add> Chinese mandarin implementation (tacotron2)	2021-03-08 02:57:11 +01:00
Eren Gölge	e3102e753c	enable backward compat for loading the best model	2021-03-08 02:57:11 +01:00
gerazov	2451a813a2	refactored keep_all_best	2021-03-08 02:57:11 +01:00
gerazov	8cefa76bae	reformated docstrings in arguments.py	2021-03-08 02:57:11 +01:00
gerazov	2db40457e8	brushed up printing model load path and best loss path	2021-03-08 02:56:36 +01:00
gerazov	f2e474cd37	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-03-08 02:56:36 +01:00
Eren Gölge	4111df6769	Docstrings for audioprocessor	2021-03-08 02:54:47 +01:00
Adonis Pujols	89b7f01534	add encoding="utf-8"	2021-03-08 02:54:47 +01:00
Eren Gölge	ffceccb021	fix #655	2021-03-08 02:54:47 +01:00
Eren Gölge	534c341f16	linter update	2021-03-08 02:54:47 +01:00
Eren Gölge	93a83c0068	Update TTS/utils/arguments.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-03-08 02:54:47 +01:00
Eren Gölge	ee71eb4eb7	linter fixes	2021-03-08 02:54:47 +01:00
Eren Gölge	194f82de51	save default model chars to the training config file	2021-03-08 02:54:47 +01:00
Eren Gölge	e06c93fe81	model_manager tests	2021-03-08 02:54:47 +01:00
Eren Gölge	d30608ab17	set an output_sample_rate in synthesizer and use it for writing the wav file	2021-03-08 02:54:47 +01:00
Eren Gölge	3ccb015cd8	return the json entry of the downloaded model	2021-03-08 02:54:47 +01:00
Eren Gölge	00e0933f43	save_wav with a custom sampling rate	2021-03-08 02:54:47 +01:00
Eren Gölge	6bd8485d10	bug fix	2021-03-08 02:54:47 +01:00
Eren Gölge	49771f2541	download github model releases by model manager	2021-03-08 02:54:21 +01:00
Eren Gölge	3c961370e7	linter fixes	2021-03-08 02:54:21 +01:00
gerazov	2b5cb24db7	final final fixes	2021-03-08 02:54:21 +01:00
gerazov	2daca15802	restructured arg parsing and processing to utils	2021-03-08 02:54:21 +01:00
Eren Gölge	2fbe4a1b8a	fix gdown	2021-03-08 02:54:21 +01:00
Eren Gölge	08581deb61	linter updates	2021-03-08 02:53:02 +01:00
Eren Gölge	a30a231566	unpin cython version and commentout pyworld in audio.py causing dep issues	2021-03-08 02:50:15 +01:00
Eren Gölge	bbea6a0884	hubconf.py and load .models.json from the defualt location by mange.py	2021-03-08 02:48:31 +01:00
Eren Gölge	db231c83fc	distill import statement, check python version in setup.py	2021-03-08 02:48:31 +01:00
Thorsten Mueller	915ec1faac	Added info if model already downloaded in --list_models	2021-03-08 02:48:31 +01:00
Eren Gölge	534e3c67c6	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-03-08 02:48:31 +01:00
Eren Gölge	2edab4b3f9	disable pw in audio that causes numpy issue	2021-02-01 17:05:03 +00:00
Eren Gölge	4f32e77006	platform indep. way to fetch user data folder	2021-01-26 17:32:43 +01:00
Eren Gölge	b464cab9b8	setup.py update and pylint fixes	2021-01-26 02:57:50 +01:00
Eren Gölge	ca647cf222	Model Manager to download released models	2021-01-22 02:35:43 +01:00
Eren Gölge	ca8ad9c21e	rename audio._normalize to audio.normalize	2021-01-22 02:33:19 +01:00
Eren Gölge	c990b3a59c	linter fixes and test fixes	2021-01-22 02:32:35 +01:00
Eren Gölge	0ab2eb2664	use synthesizer in both synthesize.py and server.pu	2021-01-21 15:54:33 +01:00
root	ea39715305	read_json_with_comments	2021-01-20 02:11:55 +00:00
root	563bc921d8	optional verbose for audio.py init	2021-01-20 02:11:24 +00:00
erogol	7586fbc4de	SS refactoring	2021-01-06 13:19:40 +01:00
erogol	71c382be14	copy model scale stats file with config.json to the trianing folder, fixed for model inits	2021-01-06 13:19:40 +01:00
erogol	7b0a93d2f8	fix	2020-11-26 11:44:52 +01:00
erogol	0c6f7e4c77	resample audio if flag set true	2020-11-26 11:30:48 +01:00
erogol	e3b7157146	remove contextlib	2020-11-25 15:22:01 +01:00
erogol	1229554c42	use native amp	2020-11-25 14:48:54 +01:00
erogol	8b0e0846a3	temporary travis check	2020-11-17 14:17:03 +01:00
Qingping Hou	0cc3650ef6	support loading config in yaml	2020-11-14 00:13:53 -08:00
erogol	6cc464ead6	fix ton of tesnting bugs	2020-11-12 16:33:29 +01:00
erogol	ea976b0543	python compat update for contextlib	2020-11-06 13:34:11 +01:00
erogol	c80225544e	tune wavegrad to fine the best noise schedule for inferece	2020-11-06 13:04:46 +01:00
erogol	946a0c0fb9	bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts	2020-10-29 15:45:50 +01:00
erogol	e723b99888	handle distributed model as saving	2020-10-29 12:30:37 +01:00
WeberJulian	3c212be5a8	fix: fixing the RenamingUnpickler fix	2020-09-22 17:36:05 +02:00
erogol	10258724d1	linter fixes	2020-09-22 03:54:16 +02:00
erogol	c008003506	do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder	2020-09-18 12:52:19 +02:00
erogol	15e6ab3912	glow-tts module renaming updates	2020-09-12 03:33:36 +02:00
erogol	540d811dd5	solve pickling models after module name change	2020-09-11 12:03:39 +02:00
erogol	df19428ec6	rename the project to old TTS	2020-09-09 12:27:23 +02:00

... 3 4 5 6 7

346 Commits