coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	d42748082a	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-23 18:04:37 +02:00
Eren Gölge	2da81f5bb6	add load_chekpoint to speaker encoder	2021-04-23 18:04:37 +02:00
Eren Gölge	1229ccbf07	update argument name in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	af2d36faeb	update synthesize.py for multi-speaker setting	2021-04-23 18:04:37 +02:00
Eren Gölge	99dc07a7dd	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-23 18:04:37 +02:00
Eren Gölge	c955a12428	set the default layer size compatible with scglow	2021-04-23 18:04:37 +02:00
Eren Gölge	3ace2440fa	fix a mistake from rebase	2021-04-23 18:04:37 +02:00
Eren Gölge	aadb2106ec	code styling	2021-04-23 18:04:37 +02:00
Eren Gölge	af7baa3387	refactoring to allow defining the speaker file externally	2021-04-23 18:04:37 +02:00
kirianguiller	7dccbfdcd5	handle multi speaker and gst in Synthetizer class	2021-04-23 18:04:37 +02:00
Edresson	d2b6326b8b	change optimizer initialization for compatibility with Hifi-GAN official implementation	2021-04-23 07:54:39 -03:00
WeberJulian	4205284f92	Change name of the functions	2021-04-23 10:09:55 +02:00
WeberJulian	a26498181b	Change back the default value	2021-04-22 16:10:17 +02:00
Julian Weber	355e1f47ab	fix dumb mistake	2021-04-22 15:50:29 +02:00
Julian Weber	c125b71f36	fix windows support	2021-04-22 15:14:24 +02:00
Jörg Thalheim	f5fd7f78d4	server: also listen to ipv6 The [::] address will listen to both ipv4/ipv6 addresses.	2021-04-22 12:38:55 +02:00
Eren Gölge	ef37633cb3	[ci skip] use prenet_dropout by default with Tacotron models	2021-04-22 12:38:55 +02:00
Eren Gölge	e1d960da9e	use SpeakerManager in Synthesizer	2021-04-21 13:13:27 +02:00
Eren Gölge	04b6881b66	add ```unique``` argument to make_symbols to fix the incompat. issue of the SC-Glow models	2021-04-21 13:12:35 +02:00
Eren Gölge	790946faec	formating speakers.py	2021-04-21 13:12:11 +02:00
Eren Gölge	ab313814de	initial SpeakerManager implementation	2021-04-21 13:11:46 +02:00
Eren Gölge	09890c7421	fix the glow-tts in setup_model	2021-04-21 13:10:40 +02:00
Eren Gölge	8764d02eb2	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-21 13:09:44 +02:00
Eren Gölge	8b40720977	add load_chekpoint to speaker encoder	2021-04-21 13:09:04 +02:00
Eren Gölge	37cad38c27	update argument name in server.py	2021-04-21 13:08:45 +02:00
Eren Gölge	9bccee9da8	update synthesize.py for multi-speaker setting	2021-04-21 13:08:25 +02:00
Eren Gölge	d2fa8add1f	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-16 19:40:13 +02:00
Eren Gölge	d9612a4351	set the default layer size compatible with scglow	2021-04-16 19:40:13 +02:00
Eren Gölge	1038fd420d	fix a mistake from rebase	2021-04-16 19:39:47 +02:00
Eren Gölge	47e356cb48	code styling	2021-04-16 16:01:40 +02:00
Eren Gölge	25328aad00	refactoring to allow defining the speaker file externally	2021-04-16 15:59:57 +02:00
kirianguiller	48ae52a9a3	handle multi speaker and gst in Synthetizer class	2021-04-16 15:54:49 +02:00
Eren Gölge	a53958ae3a	fix urls for the new models	2021-04-15 17:05:00 +02:00
Eren Gölge	9cc17be53a	formatting and a small bug fix in Tacotron model	2021-04-15 16:36:51 +02:00
Eren Gölge	1ad838bc83	add newly released models under .model.json	2021-04-15 16:06:10 +02:00
Eren Gölge	7cada1a949	remove noise	2021-04-15 15:30:45 +02:00
Eren Gölge	d60a8d7211	show the real waveform on TB too for GAN vocoder training.	2021-04-15 15:30:06 +02:00
Eren Gölge	5fbe926429	change the default TTS model to TacotronDDC	2021-04-15 15:29:44 +02:00
Eren Gölge	3de5a89154	optionally enable prenet dropout at inference time for tacotron models	2021-04-13 13:24:56 +02:00
Eren Gölge	28a2fed8a3	update hifigan in .model.json	2021-04-12 16:48:05 +02:00
Eren Gölge	abaf36861a	aligntts model .model.json placeholder	2021-04-12 16:43:52 +02:00
Eren Gölge	480e2f7888	docstring update and better handling make_symbols	2021-04-12 16:40:49 +02:00
Eren Gölge	b735076bb4	linter fixes	2021-04-12 13:14:11 +02:00
Eren Gölge	b11d1cb845	small fixes	2021-04-12 12:40:55 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	9011dddf77	tacotron DDC placeholder in models.json	2021-04-12 04:06:27 +02:00
Eren Gölge	d295d5de97	remove torch.no_grad from TorchSTFT	2021-04-10 19:43:57 +02:00
Eren Gölge	5b70da2e3f	restore schedulers only if training is continuing a previous training inherit nn.Module for TorchSTFT	2021-04-09 19:31:28 +02:00
Eren Gölge	2c71c6d8cd	[ci skip]update gan vocoder configs to reflect the recent changes	2021-04-09 17:15:32 +02:00
Eren Gölge	2b529f60c8	update default hifigan config	2021-04-09 11:40:06 +02:00
Eren Gölge	105e0b4d62	vocoder gan training fixes	2021-04-09 11:38:04 +02:00
Eren Gölge	87ee6ceb57	style update #3	2021-04-09 01:17:15 +02:00
Eren Gölge	18d9ec8036	format with black	2021-04-09 00:54:59 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	cd69da4868	linter fixes #2	2021-04-08 16:57:46 +02:00
Eren Gölge	4d3e1e9d9a	linter fix	2021-04-08 14:57:46 +02:00
Eren Gölge	53f54898bc	small fixes	2021-04-08 14:22:47 +02:00
Eren Gölge	006b1d3aaa	bug fix	2021-04-08 13:17:45 +02:00
Eren Gölge	3f0993aebe	remove junk	2021-04-08 12:17:02 +02:00
Eren Gölge	0ee0458309	remove redundant imports	2021-04-08 11:29:15 +02:00
Eren Gölge	773f1db6fa	refactor HifiGAN discriminator	2021-04-08 11:28:30 +02:00
Eren Gölge	15f362d5b1	formatting	2021-04-08 11:28:30 +02:00
Eren Gölge	aee24b0704	set different seed in gan_dataset when it is multi-workers	2021-04-08 11:28:30 +02:00
Eren Gölge	6ee211c137	remove stft params causing warning	2021-04-08 11:28:30 +02:00
Eren Gölge	4998ece8d8	allow configuration of optimziers from the config file	2021-04-08 11:28:30 +02:00
Eren Gölge	8daf407652	cache empty	2021-04-08 11:28:30 +02:00
Eren Gölge	3fb78c004a	move scheduler updates to the end of the epoch	2021-04-08 11:28:30 +02:00
Eren Gölge	2a872c98aa	don't call os.exit as it leaves the process resources standing	2021-04-08 11:27:40 +02:00
Eren Gölge	7cecd2fb2e	add hifigan D	2021-04-08 11:27:40 +02:00
Eren Gölge	13dca6e6b6	revert some of Hifigan generator updates	2021-04-08 11:27:40 +02:00
Eren Gölge	02bc776c35	prevenet grad in TorchSTFT	2021-04-08 11:27:40 +02:00
Eren Gölge	cf44624df8	more docstring	2021-04-08 11:27:40 +02:00
Eren Gölge	d95b1458e8	Linter fixes and docstrings for HiFiGAN	2021-04-08 11:27:40 +02:00
Eren Gölge	bd7a1c177b	fix #419	2021-04-08 11:26:41 +02:00
Eren Gölge	7726dfca99	change the upper bound in sound normalization	2021-04-08 11:26:01 +02:00
Eren Gölge	57f6bd1afa	make using different samples for G and D networks optional	2021-04-08 11:26:01 +02:00
Eren Gölge	67f8248492	placeholder for finetuned sam hifigan model	2021-04-08 11:25:29 +02:00
Eren Gölge	241e968df1	load_checkpoint for hifigan and no_grad for inference	2021-04-08 11:25:29 +02:00
Eren Gölge	de3a04f104	some commeting for Generator loss and check if the argument is defines in the config file	2021-04-08 11:25:29 +02:00
Eren Gölge	ff07c5f5e3	update TorchSTFT to enable melspec	2021-04-08 11:25:29 +02:00
Eren Gölge	4a5b1d4ac2	update hifigan config	2021-04-08 11:24:21 +02:00
Eren Gölge	e0e3b12b26	pass all parameters explicity to _istft	2021-04-08 11:23:20 +02:00
Eren Gölge	f0e76ee135	initial models.json entry for universal hifigan	2021-04-08 11:23:20 +02:00
Eren Gölge	d57f416957	small fixes	2021-04-08 11:22:30 +02:00
Eren Gölge	8c9e1c9e58	hifigan implementation update	2021-04-08 11:21:43 +02:00
Eren Gölge	a14d7bc5db	hifigan config update	2021-04-08 11:20:33 +02:00
Eren Gölge	8d4fd79cd7	update hifigan config	2021-04-08 11:20:33 +02:00
rishikksh20	e656e8b108	Remove select size bug	2021-04-08 11:20:33 +02:00
rishikksh20	b533474e3b	Remove minor bugs and make code trainable	2021-04-08 11:20:33 +02:00
rishikksh20	ef6ff4e95c	Add Exponential LR scheduler check	2021-04-08 11:20:33 +02:00
rishikksh20	1535777f64	1) Add ExponentialLR	2021-04-08 11:18:36 +02:00
rishikksh20	c20a6b1185	* Format the model definition * Update code and integrate training code	2021-04-08 11:18:36 +02:00
rishikksh20	39b5845810	1) Add hifigan json files 2) Rename MPD disc 3) Re-format remove weight norm generator	2021-04-08 11:14:39 +02:00
rishikksh20	7b7c5d635f	1) Combine MSD with Multi-Period disc 2) Add remove weight norm layer on Generator	2021-04-08 11:14:39 +02:00
rishikksh20	4493feb95c	Add HiFi-GAN v1 generator and discriminator classes	2021-04-08 11:14:39 +02:00
Eren Gölge	c86c559349	docstring and optional padding in TorchSTFT	2021-04-07 12:36:15 +02:00
Eren Gölge	f890454de3	linter fixes	2021-04-07 12:36:03 +02:00
Eren Gölge	9782d9ea5d	[ci skip] implement #418	2021-04-06 16:24:50 +02:00
Eren Gölge	f46a275b22	update docstring 2	2021-04-06 16:24:50 +02:00
Eren Gölge	ec94ff3691	update docstring	2021-04-06 16:24:50 +02:00
Eren Gölge	2048095e9a	audio.py fix	2021-04-06 16:24:50 +02:00
Eren Gölge	e0b3008c31	allow choosing the log function used for amptodb conversion	2021-04-06 16:24:50 +02:00
Eren Gölge	44b4cb5ba5	DCA comment	2021-04-06 16:24:50 +02:00
Eren Gölge	b86e7fb2e8	pad short samples when loading precomputed features in vocoder trainign	2021-04-06 16:24:50 +02:00
Eren Gölge	6ad4eba678	gan vocoder train fix in case of restoring models wiht no scheduler is defined	2021-04-06 16:24:50 +02:00
Eren Gölge	e3ccfe37ea	add DE more urls	2021-04-02 14:54:41 +02:00
Eren Gölge	e84f120a04	sam-accenture model preprocessor	2021-04-01 03:41:41 +02:00
Eren Gölge	e3c052382b	fix loading always best_model when continue	2021-04-01 03:41:15 +02:00
Eren Gölge	48ea20e69f	example aligntts config	2021-03-30 14:41:00 +02:00
Eren Gölge	b4c2cf80f2	fix eval iter	2021-03-30 14:39:16 +02:00
Eren Gölge	a3a840fd78	linter fixes	2021-03-30 14:39:16 +02:00
Eren Gölge	6b2e13bf62	compute normalized logp using torch primitives	2021-03-30 14:39:16 +02:00
Eren Gölge	7a382a5c2b	stowed aligntts commit and small refactoring with feed_forward layers	2021-03-30 14:39:16 +02:00
Eren Gölge	d542a50818	fix losses for alignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	18cc7b95ec	update l1 and huber to mse loss	2021-03-30 14:39:16 +02:00
Eren Gölge	896d33ed49	update losses to hande alingtts phases	2021-03-30 14:39:16 +02:00
Eren Gölge	aec0b78aff	duration predictor fix 2	2021-03-30 14:39:16 +02:00
Eren Gölge	07269e639b	fix duration predictor in AlignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	c2d29e5cd4	FFTransformer encoder for aligntts	2021-03-30 14:39:16 +02:00
Eren Gölge	460a2d3e26	FFTransformer Decoder for AlignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	844e8e0ed4	adapt align_tts and model name handling	2021-03-30 14:39:16 +02:00
Eren Gölge	aa29f5b199	aligntts loss	2021-03-30 14:39:16 +02:00
Eren Gölge	a831468cab	align tts MDN layer	2021-03-30 14:39:16 +02:00
Eren Gölge	4396f8e2da	continue refactoring	2021-03-30 14:39:16 +02:00
Eren Gölge	892c3c3623	use torch for AngleProtoLoss	2021-03-30 14:39:16 +02:00
Eren Gölge	2b3e12ea49	correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting	2021-03-30 14:39:16 +02:00
Eren Gölge	ecb6b0d6ad	rename GlowTtts as GlowTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	e8cf8cb00e	restructure TF tacotron files	2021-03-30 14:39:16 +02:00
Eren Gölge	1ac99ce0d0	if git is not available set git has 'unknown'	2021-03-30 14:39:16 +02:00
Eren Gölge	d9c405f0c3	create feedforward folder for SS layers	2021-03-30 14:39:16 +02:00
Eren Gölge	a8cf1ae6b4	fix wavenet running with no input mask	2021-03-30 14:39:16 +02:00
Eren Gölge	1c1949d348	utf-8 encoding for certain preprocessors	2021-03-30 14:39:16 +02:00
Eren Gölge	ca2f22cdd7	linter fix	2021-03-30 14:36:12 +02:00
Eren Gölge	d0dcd7d1b8	let the user define outpu.wav file path fix #393	2021-03-30 14:24:31 +02:00
Eren Gölge	25654233d5	[ci skip]initial commit for the new DE models and stale ot update	2021-03-29 03:23:57 +02:00
Guy Elsmore-Paddock	15459627cc	Fix `UnicodeEncodeError` on Windows Platforms Prevents the following error from appearing when running training on Windows platforms: ``` UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined> ```	2021-03-20 17:30:00 -04:00
Eren Gölge	3947750dd9	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-18 14:09:47 +01:00
WeberJulian	4a9d2e4309	fix french_cleaners	2021-03-18 13:35:29 +01:00
WeberJulian	596ea2c98a	Add resample script	2021-03-18 13:33:37 +01:00
Eren Gölge	6e68637f48	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	f3e5ddfaaf	bug fix in preprocessor	2021-03-18 13:33:23 +01:00
Eren Gölge	aeb4f82233	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	0514330869	fix mozilla/TTS#685	2021-03-18 13:33:23 +01:00
Eren Gölge	f06603a0db	force utf8	2021-03-18 13:33:23 +01:00
Eren Gölge	32e8b56c45	linter fix	2021-03-18 13:33:23 +01:00
Eren Gölge	65533f33e9	fix #374	2021-03-18 13:33:00 +01:00
Eren Gölge	d790d2fccb	linter fix	2021-03-18 13:33:00 +01:00
WeberJulian	af96080e17	fix linter issues	2021-03-18 13:33:00 +01:00

1 2 3 4 5 ...

722 Commits