coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson	1496f271dc	update Compute embeddings script	2021-05-27 00:45:18 -03:00
Edresson	bc5307caa0	add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix	2021-05-26 20:35:58 -03:00
Edresson	c90037c2e9	solve merge problems	2021-05-26 16:01:30 -03:00
Katsuya Iida	f921a05bdb	Fixed lint errors	2021-05-26 19:02:16 +09:00
Edresson Casanova	f89cb6aec2	Merge branch 'dev' into dev	2021-05-25 17:30:25 -03:00
Edresson	d570c2d790	pylint fix and data loader bug fix	2021-05-26 01:11:37 -03:00
Katsuya Iida	0536aa6d0f	Japanese Tacotron 2 model	2021-05-22 17:12:19 +09:00
Eren Gölge	5482a0f62d	type def for gradual_training	2021-05-19 14:03:26 +02:00
Eren Gölge	df6a98d0c3	type def for gradual_training	2021-05-19 14:00:44 +02:00
Eren Gölge	16576d6408	bump version number	2021-05-19 12:35:10 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	a14fcf2a13	remove text_processing test	2021-05-18 17:57:28 +02:00
Eren Gölge	d7fae3f515	remove all espeaker and phonemizer deps	2021-05-18 17:57:28 +02:00
Eren Gölge	ced05e812a	move chinese phonemizer	2021-05-18 17:57:28 +02:00
Eren Gölge	218af1d9a2	change `list` to `List` in config	2021-05-18 17:30:27 +02:00
Eren Gölge	4df31f7fbd	unused_speakers argument for ignoring speaker ids in multi-speaker training	2021-05-18 14:50:03 +02:00
Eren Gölge	c2c7dff805	use relaxted coqpit parser	2021-05-18 14:49:47 +02:00
Edresson	856ea19758	bug fix in dataloader and update inference	2021-05-18 03:43:16 -03:00
Eren Gölge	d1b469935d	tacotron DDC LJSpeech recipe	2021-05-17 11:42:14 +02:00
Eren Gölge	34a42d379f	update tacotron_config.py for checking `r` and the docstring	2021-05-17 11:35:30 +02:00
Eren Gölge	12722501bb	styling	2021-05-15 23:48:31 +02:00
Eren Gölge	8b1014d188	add docstrings with default value fixes	2021-05-15 23:45:10 +02:00
Eren Gölge	da49089a72	update melgan training test batch size	2021-05-12 10:12:11 +02:00
Edresson	3433c2f348	add compute embedding for the new speaker encoder	2021-05-12 03:06:46 -03:00
Eren Gölge	0213e1cbf4	update configs for tts models to match the field typed with the expected values	2021-05-12 00:57:38 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Edresson	3fcc748b2e	implement the Speaker Encoder H/ASP	2021-05-11 16:27:05 -03:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	6e980b49c4	fix synthesizer.py for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	db14dcd95a	remove old load_config	2021-05-11 11:29:18 +02:00
Eren Gölge	a21ac883dd	add get_cuda()	2021-05-11 11:29:18 +02:00
Eren Gölge	21dd4d7960	fix load_config imports for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	c57f0b46bb	reintro use_gst for backwars compat	2021-05-11 11:29:18 +02:00
Eren Gölge	18e76a2309	fix speaker encoder model initialization	2021-05-11 11:29:18 +02:00
Eren Gölge	10de40bba1	make num_workers mandatory config field	2021-05-11 11:29:18 +02:00
Eren Gölge	df1ddd3539	allow read_json_with_comments for backward compat	2021-05-11 11:29:18 +02:00
Eren Gölge	9f7599e3c3	fix train_encoder for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	f8e52965dd	add speaker encoder coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	ce2bba543e	remove extra from utils and move funcs to io.py	2021-05-11 11:29:18 +02:00
Eren Gölge	812dbc2b06	rm config.json	2021-05-11 11:29:18 +02:00
Eren Gölge	3fde2001b1	train_encoder refactoring for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	9ee70af9bb	code styling	2021-05-11 11:29:18 +02:00
Eren Gölge	10db2baa06	global shared Coqpit configs	2021-05-11 11:29:18 +02:00
Eren Gölge	3dec62b183	add Coqpits for the vocoder models	2021-05-11 11:29:18 +02:00
Eren Gölge	6f4eed94f5	remove *.json vocoder configs	2021-05-11 11:29:18 +02:00
Eren Gölge	78b3825d0b	update train scripts for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	757e90b1cc	load_config function to initialize the right Coqpit for the given model	2021-05-11 11:29:18 +02:00
Eren Gölge	e6f45b9eb7	update train_vocoder_gan.py for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	bcebd69d09	remove bash tts training tests	2021-05-11 11:29:17 +02:00
Eren Gölge	7663bc63c1	add Coqpit configs for the TTS models	2021-05-11 11:29:17 +02:00
Eren Gölge	7227e8f1d2	update train_align_tts.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	51a7e06945	glow_tts_config.py and train test on python	2021-05-11 11:29:17 +02:00
Eren Gölge	720fe13056	update glow_tts modules and training script for coqpit use	2021-05-11 11:29:17 +02:00
Eren Gölge	816e7ee698	remove default configs.json as replacing with Coqpit configs	2021-05-11 11:29:17 +02:00
Eren Gölge	35341d5482	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	647163397d	coqpit refactoring	2021-05-11 11:29:17 +02:00
Eren Gölge	eaa130e813	fix tacotron for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	65d7ad4250	refactor train_speedy_speech.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	4a58fdfd59	comment out check-arguments before copying fields to the configs	2021-05-11 11:29:17 +02:00
Eren Gölge	05d9543ed8	init GST module using gst config in Tacotron models	2021-05-11 11:29:17 +02:00
Eren Gölge	93a00373f6	move split_dataset	2021-05-11 11:29:17 +02:00
Eren Gölge	9c18e40f64	black formatting	2021-05-11 11:29:17 +02:00
Eren Gölge	c34c8137d7	update compute_statistics for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	79d7215142	config refactor #5 WIP	2021-05-11 11:29:17 +02:00
Eren Gölge	dc50f5f0b0	config refactor #4 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	97bd5f9734	[ci skip] config update #3 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	a21c0b5585	config update 2 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	e092ae40dc	config update WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	06f80a4806	update check argument	2021-05-11 11:28:35 +02:00
Eren Gölge	bf7ddfa542	Merge pull request #481 from chmodsss/main Accessing __version__ command	2021-05-11 10:20:48 +02:00
Edresson	85ccad7e0a	add Audio data augamentation Addtive and RIR	2021-05-11 00:59:57 -03:00
Edresson	77d85c6cc5	add softmaxproto loss and bug fix in data loader	2021-05-10 17:08:38 -03:00
chmodsss	607d5cf377	[#480 ] Adding version variable	2021-05-10 19:46:34 +02:00
Adam Froghyar	7ddc885f37	deleted a line the broke GravesAttention	2021-05-10 15:42:59 +02:00
Edresson	78bad25f2b	update voxceleb download link	2021-05-07 23:45:15 -03:00
Eren Gölge	f7582107da	Merge pull request #453 from Edresson/dev Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.	2021-05-06 17:53:28 +02:00
Edresson	501c8e0302	remove unused vars on extract tts spectrograms script	2021-05-04 19:04:13 -03:00
Eren Gölge	0325c58862	Merge pull request #468 from shaun95/patch-1 Update losses.py	2021-05-03 14:45:24 +02:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Eren Gölge	87d674a038	bumpup librosa version to 0.8.0	2021-05-03 14:25:09 +02:00
shaun	7d0ec62bf1	Update losses.py The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled. The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)	2021-05-02 14:14:24 +02:00
Edresson	3ecd556bbe	add unit test for extract tts spectrograms script	2021-05-01 13:41:56 -03:00
Edresson	446b1da936	create inference function	2021-04-29 18:18:37 -03:00
Eren Gölge	f02f0338c2	fix .models.json and add testing to check released models availability	2021-04-29 09:32:36 +02:00
Eren Gölge	fd95e9b8a4	[ci skip] Add sam models	2021-04-28 21:57:31 +02:00
Agrin Hilmkil	351d0ed6ae	Remove unnecessary fsspec usage	2021-04-28 11:21:08 +02:00
Agrin Hilmkil	167f86417e	Move dev, tf, notebook dependencies to extras	2021-04-28 11:20:06 +02:00
Eren Gölge	1235e54738	test for synthesize.py	2021-04-27 14:17:38 +02:00
Eren Gölge	4719414f2e	remove imports	2021-04-27 11:25:17 +02:00
Eren Gölge	add97cddc1	move function and remove import	2021-04-27 11:22:56 +02:00
Eren Gölge	734e6a515c	bug fix	2021-04-27 10:27:45 +02:00
Eren Gölge	6bdd81667e	place holders for sc-glow and hifigan models	2021-04-26 19:53:12 +02:00
Eren Gölge	2f0716073e	enable multi-speaker CoquiTTS models for synthesize.py	2021-04-26 19:36:53 +02:00
Eren Gölge	b531fa699c	remove conflicy noise	2021-04-26 15:27:52 +02:00
Eren Gölge	f37b488876	Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager	2021-04-26 15:25:25 +02:00
Eren Gölge	b82daa5e86	style and linter fixes	2021-04-26 15:22:24 +02:00
Edresson	20e42a3381	add save audio option	2021-04-23 15:00:00 -03:00
Edresson	8228091f92	add script for extraction of tts spectrograms	2021-04-23 14:17:46 -03:00
Eren Gölge	4cf211348d	styling and linting	2021-04-23 18:04:37 +02:00
Eren Gölge	7eb0c60d2e	let synthesizer to pass speaker encoder file paths to speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	f69195739e	let speaker manager compute mean x_vector from multiple wav files	2021-04-23 18:04:37 +02:00
Eren Gölge	179722e3a7	new arguments to synthesize.py for loading speaker encoder and speaker wavs	2021-04-23 18:04:37 +02:00
Eren Gölge	dfa415a8b8	small refactor in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	c80d21f311	load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	ad047c8195	html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker	2021-04-23 18:04:37 +02:00
Eren Gölge	f9f3d04d14	remove moved function	2021-04-23 18:04:37 +02:00
Eren Gölge	10c988ac8c	update server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	6d0f5e0459	use SpeakerManager in Synthesizer	2021-04-23 18:04:37 +02:00
Eren Gölge	e97126314c	add ```unique``` argument to make_symbols to fix the incompat. issue of the SC-Glow models	2021-04-23 18:04:37 +02:00
Eren Gölge	d08888e603	formating speakers.py	2021-04-23 18:04:37 +02:00
Eren Gölge	df422223a3	initial SpeakerManager implementation	2021-04-23 18:04:37 +02:00
Eren Gölge	7a7aeb35f5	fix the glow-tts in setup_model	2021-04-23 18:04:37 +02:00
Eren Gölge	d42748082a	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-23 18:04:37 +02:00
Eren Gölge	2da81f5bb6	add load_chekpoint to speaker encoder	2021-04-23 18:04:37 +02:00
Eren Gölge	1229ccbf07	update argument name in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	af2d36faeb	update synthesize.py for multi-speaker setting	2021-04-23 18:04:37 +02:00
Eren Gölge	99dc07a7dd	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-23 18:04:37 +02:00
Eren Gölge	c955a12428	set the default layer size compatible with scglow	2021-04-23 18:04:37 +02:00
Eren Gölge	3ace2440fa	fix a mistake from rebase	2021-04-23 18:04:37 +02:00
Eren Gölge	aadb2106ec	code styling	2021-04-23 18:04:37 +02:00
Eren Gölge	af7baa3387	refactoring to allow defining the speaker file externally	2021-04-23 18:04:37 +02:00
kirianguiller	7dccbfdcd5	handle multi speaker and gst in Synthetizer class	2021-04-23 18:04:37 +02:00
Edresson	d2b6326b8b	change optimizer initialization for compatibility with Hifi-GAN official implementation	2021-04-23 07:54:39 -03:00
WeberJulian	4205284f92	Change name of the functions	2021-04-23 10:09:55 +02:00
WeberJulian	a26498181b	Change back the default value	2021-04-22 16:10:17 +02:00
Julian Weber	355e1f47ab	fix dumb mistake	2021-04-22 15:50:29 +02:00
Julian Weber	c125b71f36	fix windows support	2021-04-22 15:14:24 +02:00
Jörg Thalheim	f5fd7f78d4	server: also listen to ipv6 The [::] address will listen to both ipv4/ipv6 addresses.	2021-04-22 12:38:55 +02:00
Eren Gölge	ef37633cb3	[ci skip] use prenet_dropout by default with Tacotron models	2021-04-22 12:38:55 +02:00
Eren Gölge	e1d960da9e	use SpeakerManager in Synthesizer	2021-04-21 13:13:27 +02:00
Eren Gölge	04b6881b66	add ```unique``` argument to make_symbols to fix the incompat. issue of the SC-Glow models	2021-04-21 13:12:35 +02:00
Eren Gölge	790946faec	formating speakers.py	2021-04-21 13:12:11 +02:00
Eren Gölge	ab313814de	initial SpeakerManager implementation	2021-04-21 13:11:46 +02:00
Eren Gölge	09890c7421	fix the glow-tts in setup_model	2021-04-21 13:10:40 +02:00
Eren Gölge	8764d02eb2	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-21 13:09:44 +02:00
Eren Gölge	8b40720977	add load_chekpoint to speaker encoder	2021-04-21 13:09:04 +02:00
Eren Gölge	37cad38c27	update argument name in server.py	2021-04-21 13:08:45 +02:00
Eren Gölge	9bccee9da8	update synthesize.py for multi-speaker setting	2021-04-21 13:08:25 +02:00
Eren Gölge	d2fa8add1f	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-16 19:40:13 +02:00
Eren Gölge	d9612a4351	set the default layer size compatible with scglow	2021-04-16 19:40:13 +02:00
Eren Gölge	1038fd420d	fix a mistake from rebase	2021-04-16 19:39:47 +02:00
Eren Gölge	47e356cb48	code styling	2021-04-16 16:01:40 +02:00
Eren Gölge	25328aad00	refactoring to allow defining the speaker file externally	2021-04-16 15:59:57 +02:00
kirianguiller	48ae52a9a3	handle multi speaker and gst in Synthetizer class	2021-04-16 15:54:49 +02:00
Eren Gölge	a53958ae3a	fix urls for the new models	2021-04-15 17:05:00 +02:00
Eren Gölge	9cc17be53a	formatting and a small bug fix in Tacotron model	2021-04-15 16:36:51 +02:00
Eren Gölge	1ad838bc83	add newly released models under .model.json	2021-04-15 16:06:10 +02:00
Eren Gölge	7cada1a949	remove noise	2021-04-15 15:30:45 +02:00
Eren Gölge	d60a8d7211	show the real waveform on TB too for GAN vocoder training.	2021-04-15 15:30:06 +02:00
Eren Gölge	5fbe926429	change the default TTS model to TacotronDDC	2021-04-15 15:29:44 +02:00
Eren Gölge	3de5a89154	optionally enable prenet dropout at inference time for tacotron models	2021-04-13 13:24:56 +02:00
Eren Gölge	28a2fed8a3	update hifigan in .model.json	2021-04-12 16:48:05 +02:00
Eren Gölge	abaf36861a	aligntts model .model.json placeholder	2021-04-12 16:43:52 +02:00
Eren Gölge	480e2f7888	docstring update and better handling make_symbols	2021-04-12 16:40:49 +02:00
Eren Gölge	b735076bb4	linter fixes	2021-04-12 13:14:11 +02:00
Eren Gölge	b11d1cb845	small fixes	2021-04-12 12:40:55 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	9011dddf77	tacotron DDC placeholder in models.json	2021-04-12 04:06:27 +02:00
Eren Gölge	d295d5de97	remove torch.no_grad from TorchSTFT	2021-04-10 19:43:57 +02:00
Eren Gölge	5b70da2e3f	restore schedulers only if training is continuing a previous training inherit nn.Module for TorchSTFT	2021-04-09 19:31:28 +02:00
Eren Gölge	2c71c6d8cd	[ci skip]update gan vocoder configs to reflect the recent changes	2021-04-09 17:15:32 +02:00
Eren Gölge	2b529f60c8	update default hifigan config	2021-04-09 11:40:06 +02:00
Eren Gölge	105e0b4d62	vocoder gan training fixes	2021-04-09 11:38:04 +02:00
Eren Gölge	87ee6ceb57	style update #3	2021-04-09 01:17:15 +02:00
Eren Gölge	18d9ec8036	format with black	2021-04-09 00:54:59 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	cd69da4868	linter fixes #2	2021-04-08 16:57:46 +02:00
Eren Gölge	4d3e1e9d9a	linter fix	2021-04-08 14:57:46 +02:00
Eren Gölge	53f54898bc	small fixes	2021-04-08 14:22:47 +02:00
Eren Gölge	006b1d3aaa	bug fix	2021-04-08 13:17:45 +02:00
Eren Gölge	3f0993aebe	remove junk	2021-04-08 12:17:02 +02:00
Eren Gölge	0ee0458309	remove redundant imports	2021-04-08 11:29:15 +02:00
Eren Gölge	773f1db6fa	refactor HifiGAN discriminator	2021-04-08 11:28:30 +02:00
Eren Gölge	15f362d5b1	formatting	2021-04-08 11:28:30 +02:00
Eren Gölge	aee24b0704	set different seed in gan_dataset when it is multi-workers	2021-04-08 11:28:30 +02:00
Eren Gölge	6ee211c137	remove stft params causing warning	2021-04-08 11:28:30 +02:00
Eren Gölge	4998ece8d8	allow configuration of optimziers from the config file	2021-04-08 11:28:30 +02:00
Eren Gölge	8daf407652	cache empty	2021-04-08 11:28:30 +02:00
Eren Gölge	3fb78c004a	move scheduler updates to the end of the epoch	2021-04-08 11:28:30 +02:00
Eren Gölge	2a872c98aa	don't call os.exit as it leaves the process resources standing	2021-04-08 11:27:40 +02:00
Eren Gölge	7cecd2fb2e	add hifigan D	2021-04-08 11:27:40 +02:00
Eren Gölge	13dca6e6b6	revert some of Hifigan generator updates	2021-04-08 11:27:40 +02:00
Eren Gölge	02bc776c35	prevenet grad in TorchSTFT	2021-04-08 11:27:40 +02:00
Eren Gölge	cf44624df8	more docstring	2021-04-08 11:27:40 +02:00
Eren Gölge	d95b1458e8	Linter fixes and docstrings for HiFiGAN	2021-04-08 11:27:40 +02:00
Eren Gölge	bd7a1c177b	fix #419	2021-04-08 11:26:41 +02:00
Eren Gölge	7726dfca99	change the upper bound in sound normalization	2021-04-08 11:26:01 +02:00
Eren Gölge	57f6bd1afa	make using different samples for G and D networks optional	2021-04-08 11:26:01 +02:00
Eren Gölge	67f8248492	placeholder for finetuned sam hifigan model	2021-04-08 11:25:29 +02:00
Eren Gölge	241e968df1	load_checkpoint for hifigan and no_grad for inference	2021-04-08 11:25:29 +02:00
Eren Gölge	de3a04f104	some commeting for Generator loss and check if the argument is defines in the config file	2021-04-08 11:25:29 +02:00
Eren Gölge	ff07c5f5e3	update TorchSTFT to enable melspec	2021-04-08 11:25:29 +02:00
Eren Gölge	4a5b1d4ac2	update hifigan config	2021-04-08 11:24:21 +02:00
Eren Gölge	e0e3b12b26	pass all parameters explicity to _istft	2021-04-08 11:23:20 +02:00
Eren Gölge	f0e76ee135	initial models.json entry for universal hifigan	2021-04-08 11:23:20 +02:00
Eren Gölge	d57f416957	small fixes	2021-04-08 11:22:30 +02:00
Eren Gölge	8c9e1c9e58	hifigan implementation update	2021-04-08 11:21:43 +02:00
Eren Gölge	a14d7bc5db	hifigan config update	2021-04-08 11:20:33 +02:00
Eren Gölge	8d4fd79cd7	update hifigan config	2021-04-08 11:20:33 +02:00
rishikksh20	e656e8b108	Remove select size bug	2021-04-08 11:20:33 +02:00
rishikksh20	b533474e3b	Remove minor bugs and make code trainable	2021-04-08 11:20:33 +02:00
rishikksh20	ef6ff4e95c	Add Exponential LR scheduler check	2021-04-08 11:20:33 +02:00
rishikksh20	1535777f64	1) Add ExponentialLR	2021-04-08 11:18:36 +02:00
rishikksh20	c20a6b1185	* Format the model definition * Update code and integrate training code	2021-04-08 11:18:36 +02:00
rishikksh20	39b5845810	1) Add hifigan json files 2) Rename MPD disc 3) Re-format remove weight norm generator	2021-04-08 11:14:39 +02:00
rishikksh20	7b7c5d635f	1) Combine MSD with Multi-Period disc 2) Add remove weight norm layer on Generator	2021-04-08 11:14:39 +02:00
rishikksh20	4493feb95c	Add HiFi-GAN v1 generator and discriminator classes	2021-04-08 11:14:39 +02:00
Eren Gölge	c86c559349	docstring and optional padding in TorchSTFT	2021-04-07 12:36:15 +02:00
Eren Gölge	f890454de3	linter fixes	2021-04-07 12:36:03 +02:00
Eren Gölge	9782d9ea5d	[ci skip] implement #418	2021-04-06 16:24:50 +02:00
Eren Gölge	f46a275b22	update docstring 2	2021-04-06 16:24:50 +02:00
Eren Gölge	ec94ff3691	update docstring	2021-04-06 16:24:50 +02:00
Eren Gölge	2048095e9a	audio.py fix	2021-04-06 16:24:50 +02:00
Eren Gölge	e0b3008c31	allow choosing the log function used for amptodb conversion	2021-04-06 16:24:50 +02:00
Eren Gölge	44b4cb5ba5	DCA comment	2021-04-06 16:24:50 +02:00
Eren Gölge	b86e7fb2e8	pad short samples when loading precomputed features in vocoder trainign	2021-04-06 16:24:50 +02:00
Eren Gölge	6ad4eba678	gan vocoder train fix in case of restoring models wiht no scheduler is defined	2021-04-06 16:24:50 +02:00
Eren Gölge	e3ccfe37ea	add DE more urls	2021-04-02 14:54:41 +02:00
Eren Gölge	e84f120a04	sam-accenture model preprocessor	2021-04-01 03:41:41 +02:00
Eren Gölge	e3c052382b	fix loading always best_model when continue	2021-04-01 03:41:15 +02:00
Eren Gölge	48ea20e69f	example aligntts config	2021-03-30 14:41:00 +02:00
Eren Gölge	b4c2cf80f2	fix eval iter	2021-03-30 14:39:16 +02:00
Eren Gölge	a3a840fd78	linter fixes	2021-03-30 14:39:16 +02:00
Eren Gölge	6b2e13bf62	compute normalized logp using torch primitives	2021-03-30 14:39:16 +02:00
Eren Gölge	7a382a5c2b	stowed aligntts commit and small refactoring with feed_forward layers	2021-03-30 14:39:16 +02:00
Eren Gölge	d542a50818	fix losses for alignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	18cc7b95ec	update l1 and huber to mse loss	2021-03-30 14:39:16 +02:00
Eren Gölge	896d33ed49	update losses to hande alingtts phases	2021-03-30 14:39:16 +02:00
Eren Gölge	aec0b78aff	duration predictor fix 2	2021-03-30 14:39:16 +02:00
Eren Gölge	07269e639b	fix duration predictor in AlignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	c2d29e5cd4	FFTransformer encoder for aligntts	2021-03-30 14:39:16 +02:00
Eren Gölge	460a2d3e26	FFTransformer Decoder for AlignTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	844e8e0ed4	adapt align_tts and model name handling	2021-03-30 14:39:16 +02:00
Eren Gölge	aa29f5b199	aligntts loss	2021-03-30 14:39:16 +02:00
Eren Gölge	a831468cab	align tts MDN layer	2021-03-30 14:39:16 +02:00
Eren Gölge	4396f8e2da	continue refactoring	2021-03-30 14:39:16 +02:00
Eren Gölge	892c3c3623	use torch for AngleProtoLoss	2021-03-30 14:39:16 +02:00
Eren Gölge	2b3e12ea49	correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting	2021-03-30 14:39:16 +02:00
Eren Gölge	ecb6b0d6ad	rename GlowTtts as GlowTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	e8cf8cb00e	restructure TF tacotron files	2021-03-30 14:39:16 +02:00
Eren Gölge	1ac99ce0d0	if git is not available set git has 'unknown'	2021-03-30 14:39:16 +02:00
Eren Gölge	d9c405f0c3	create feedforward folder for SS layers	2021-03-30 14:39:16 +02:00
Eren Gölge	a8cf1ae6b4	fix wavenet running with no input mask	2021-03-30 14:39:16 +02:00
Eren Gölge	1c1949d348	utf-8 encoding for certain preprocessors	2021-03-30 14:39:16 +02:00
Eren Gölge	ca2f22cdd7	linter fix	2021-03-30 14:36:12 +02:00
Eren Gölge	d0dcd7d1b8	let the user define outpu.wav file path fix #393	2021-03-30 14:24:31 +02:00
Eren Gölge	25654233d5	[ci skip]initial commit for the new DE models and stale ot update	2021-03-29 03:23:57 +02:00
Guy Elsmore-Paddock	15459627cc	Fix `UnicodeEncodeError` on Windows Platforms Prevents the following error from appearing when running training on Windows platforms: ``` UnicodeEncodeError: 'charmap' codec can't encode characters in position: character maps to <undefined> ```	2021-03-20 17:30:00 -04:00
Eren Gölge	3947750dd9	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-18 14:09:47 +01:00
WeberJulian	4a9d2e4309	fix french_cleaners	2021-03-18 13:35:29 +01:00
WeberJulian	596ea2c98a	Add resample script	2021-03-18 13:33:37 +01:00
Eren Gölge	6e68637f48	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	f3e5ddfaaf	bug fix in preprocessor	2021-03-18 13:33:23 +01:00
Eren Gölge	aeb4f82233	bug fix	2021-03-18 13:33:23 +01:00
Eren Gölge	0514330869	fix mozilla/TTS#685	2021-03-18 13:33:23 +01:00
Eren Gölge	f06603a0db	force utf8	2021-03-18 13:33:23 +01:00
Eren Gölge	32e8b56c45	linter fix	2021-03-18 13:33:23 +01:00
Eren Gölge	65533f33e9	fix #374	2021-03-18 13:33:00 +01:00
Eren Gölge	d790d2fccb	linter fix	2021-03-18 13:33:00 +01:00
WeberJulian	af96080e17	fix linter issues	2021-03-18 13:33:00 +01:00
WeberJulian	bf04383e74	fix french_cleaners	2021-03-18 13:33:00 +01:00
WeberJulian	f6cd8e0ecc	test case	2021-03-18 13:33:00 +01:00
WeberJulian	e954e45e57	linter + test	2021-03-18 13:33:00 +01:00
WeberJulian	e598977f3d	Using path.join instead of concat	2021-03-18 13:33:00 +01:00
WeberJulian	c5ef2de73f	Add resample script	2021-03-18 13:33:00 +01:00
Eren Gölge	2690ab2ee5	bug fix	2021-03-16 19:15:28 +01:00
Eren Gölge	4c1aed4a9c	bug fix in preprocessor	2021-03-16 19:13:32 +01:00
Eren Gölge	01e35e06c4	bug fix	2021-03-16 19:13:32 +01:00
Eren Gölge	aa8bb815a7	fix mozilla/TTS#685	2021-03-16 19:13:32 +01:00
Eren Gölge	a8c348ffb2	force utf8	2021-03-16 19:13:32 +01:00
Eren Gölge	bf0caba0bc	linter fix	2021-03-16 19:13:32 +01:00
Eren Gölge	babc94f63f	fix #374	2021-03-16 19:13:32 +01:00
Eren Gölge	bdfd1f8a89	linter fix	2021-03-16 19:13:32 +01:00
WeberJulian	11e25a7125	fix linter issues	2021-03-16 19:13:01 +01:00
WeberJulian	1574d8dd39	fix french_cleaners	2021-03-16 19:13:01 +01:00
WeberJulian	b94373afb8	test case	2021-03-16 19:13:01 +01:00
WeberJulian	93fdc0729c	linter + test	2021-03-16 19:13:01 +01:00
WeberJulian	17f197f51e	Using path.join instead of concat	2021-03-16 19:13:01 +01:00
WeberJulian	d6749f030f	Add resample script	2021-03-16 19:13:01 +01:00
Eren Gölge	838ebd6ad5	add the missing russian model	2021-03-16 18:38:35 +01:00
Eren Gölge	5c657715f2	fix #382	2021-03-16 17:31:48 +01:00
Eren Gölge	38a29ce1c9	move all models to github rls	2021-03-10 18:19:32 +01:00
Eren Gölge	e5bb317242	fix model manager	2021-03-10 17:01:19 +01:00
Eren Gölge	d260fb03a2	fix handling scale_stats.npy for models downloaded from Github rls	2021-03-10 16:40:30 +01:00
Eren Gölge	4aba4e5b1e	linter fx	2021-03-10 15:33:11 +01:00
Eren Gölge	6c932c8503	print the desc if required parameters are not provided	2021-03-10 15:19:00 +01:00
Eren Gölge	9e84c8a623	do not copy scale_stats if exist in the output folder	2021-03-10 15:13:55 +01:00
Eren Gölge	7782034e7e	fix #369	2021-03-10 15:13:21 +01:00
Eren Gölge	4337e9ff87	pad_mode in torch_stft	2021-03-10 14:41:00 +01:00
Eren Gölge	599149a7e5	downloading models from github releases	2021-03-10 11:09:01 +01:00
Eren Gölge	fc19411ac6	update some of the models to github releases	2021-03-10 11:08:15 +01:00
Eren Gölge	19bb9ba851	fix tts endpoint using list-models argument	2021-03-09 14:06:09 +01:00
Eren Gölge	43379eecef	fix the nl model and add the vocoder	2021-03-09 14:05:56 +01:00
r-dh	8a4dcd152f	Add Dutch model	2021-03-09 13:22:19 +01:00
Eren Gölge	94805236fb	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-08 15:21:06 +01:00
Eren Gölge	5dcc4be560	rebrand demo server	2021-03-08 14:51:04 +01:00
Eren Gölge	947e3d6a93	rename test	2021-03-08 14:50:54 +01:00
Eren Gölge	a519ed52f2	deprecate embedding models to the wheel	2021-03-08 14:06:15 +01:00
Eren Gölge	c16ad38930	update server rEADME	2021-03-08 14:05:59 +01:00
Eren Gölge	594d8d8f09	linter fixes	2021-03-08 11:22:59 +01:00
Eren Gölge	00b5090974	linter fix	2021-03-08 11:05:30 +01:00
Eren Gölge	e15734c3fc	linter fix	2021-03-08 05:29:43 +01:00
Eren Gölge	9a48ba3821	a ton of linter updates	2021-03-08 05:06:54 +01:00
Eren Gölge	e03a426378	bug fix	2021-03-08 02:59:48 +01:00
kirianguiller	628afe5cb0	remove gst handling in synthetizer.py class	2021-03-08 02:59:48 +01:00
kirianguiller	557239db7f	remove re.Match typing in '_number_replace()'	2021-03-08 02:59:48 +01:00
kirianguiller	9ab07f94e2	modify according to PR reviews	2021-03-08 02:59:48 +01:00
kirianguiller	42ba30eb8f	<add> Chinese mandarin implementation (tacotron2)	2021-03-08 02:59:24 +01:00
kirianguiller	49665783a6	remove gst handling in synthetizer.py class	2021-03-08 02:57:11 +01:00
kirianguiller	e85658ac2b	remove re.Match typing in '_number_replace()'	2021-03-08 02:57:11 +01:00
kirianguiller	0d4525322c	modify according to PR reviews	2021-03-08 02:57:11 +01:00
kirianguiller	e6fd118cf8	<add> Chinese mandarin implementation (tacotron2)	2021-03-08 02:57:11 +01:00
Eren Gölge	e3102e753c	enable backward compat for loading the best model	2021-03-08 02:57:11 +01:00
gerazov	2451a813a2	refactored keep_all_best	2021-03-08 02:57:11 +01:00
gerazov	8cefa76bae	reformated docstrings in arguments.py	2021-03-08 02:57:11 +01:00
gerazov	2db40457e8	brushed up printing model load path and best loss path	2021-03-08 02:56:36 +01:00
gerazov	f2e474cd37	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-03-08 02:56:36 +01:00
Eren Gölge	4111df6769	Docstrings for audioprocessor	2021-03-08 02:54:47 +01:00
Eren Gölge	2ca74b8ab3	add RUSLAN dataset preprocessor	2021-03-08 02:54:47 +01:00
Eren Gölge	8993120634	do not test server and modelManager until fixing #657	2021-03-08 02:54:47 +01:00
Adonis Pujols	89b7f01534	add encoding="utf-8"	2021-03-08 02:54:47 +01:00
Eren Gölge	ffceccb021	fix #655	2021-03-08 02:54:47 +01:00
Eren Gölge	534c341f16	linter update	2021-03-08 02:54:47 +01:00
Eren Gölge	0e1e60bef0	remove redundancy	2021-03-08 02:54:47 +01:00
Eren Gölge	93a83c0068	Update TTS/utils/arguments.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-03-08 02:54:47 +01:00
Eren Gölge	39fbf2fe84	Update TTS/bin/find_unique_chars.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-03-08 02:54:47 +01:00
Eren Gölge	ee71eb4eb7	linter fixes	2021-03-08 02:54:47 +01:00
Eren Gölge	55fc50b26d	update test_text_processing for espeak-ng	2021-03-08 02:54:47 +01:00
Eren Gölge	5b8a6736a7	remove _phoneme_punctuations	2021-03-08 02:54:47 +01:00
Eren Gölge	194f82de51	save default model chars to the training config file	2021-03-08 02:54:47 +01:00
Eren Gölge	62a8eba3b2	parse_characters function	2021-03-08 02:54:47 +01:00
Eren Gölge	0b33acdcca	enable saving model characters in io.py	2021-03-08 02:54:47 +01:00
Eren Gölge	f9fe167537	docstring update	2021-03-08 02:54:47 +01:00
Eren Gölge	62aeacbdd1	save used model characters to the checkpoints	2021-03-08 02:54:47 +01:00
Eren Gölge	e06c93fe81	model_manager tests	2021-03-08 02:54:47 +01:00
Eren Gölge	fe41084eb3	author , license and contact info in .model.json	2021-03-08 02:54:47 +01:00
nmstoker	ae0d54ddae	Updating models list to include EK1 TTS/vocoder	2021-03-08 02:54:47 +01:00
Eren Gölge	c6702b5b9f	find unique characters in a dataset	2021-03-08 02:54:47 +01:00
Eren Gölge	dad3565379	use default vocoders in server.pu	2021-03-08 02:54:47 +01:00
Eren Gölge	d30608ab17	set an output_sample_rate in synthesizer and use it for writing the wav file	2021-03-08 02:54:47 +01:00
Eren Gölge	3ccb015cd8	return the json entry of the downloaded model	2021-03-08 02:54:47 +01:00
Eren Gölge	00e0933f43	save_wav with a custom sampling rate	2021-03-08 02:54:47 +01:00
Eren Gölge	9fefc79f0c	fix make_symbols	2021-03-08 02:54:47 +01:00
Eren Gölge	8955333e9d	use default vocoder in synthesize.py	2021-03-08 02:54:47 +01:00
Eren Gölge	23b282f600	define default vocoders	2021-03-08 02:54:47 +01:00
Eren Gölge	6bd8485d10	bug fix	2021-03-08 02:54:47 +01:00
Eren Gölge	5f1018abee	fix spelling of a def argument and parse phonemes from config.json if use_phonemes is True	2021-03-08 02:54:47 +01:00
Eren Gölge	1c1abb8a9b	docstring update	2021-03-08 02:54:47 +01:00
Eren Gölge	6cd642c2e1	add missing phonemes to test_config.json	2021-03-08 02:54:47 +01:00
Eren Gölge	43b951018e	fix the default vocoder name	2021-03-08 02:54:47 +01:00
Adonis Pujols	81b145c321	spelling error. should be multiband not mulitband	2021-03-08 02:54:47 +01:00
Adonis Pujols	59b1b13e07	spelling error. should be multiband not mulitband	2021-03-08 02:54:47 +01:00
Eren Gölge	ee58ff2d38	add russian phoneme char	2021-03-08 02:54:47 +01:00
Eren Gölge	29d928d531	css10 dataset preprocessor	2021-03-08 02:54:47 +01:00
Eren Gölge	49771f2541	download github model releases by model manager	2021-03-08 02:54:21 +01:00
Eren Gölge	3c961370e7	linter fixes	2021-03-08 02:54:21 +01:00
gerazov	2b5cb24db7	final final fixes	2021-03-08 02:54:21 +01:00
gerazov	b3c5cc2cdc	final fixes	2021-03-08 02:54:21 +01:00
gerazov	10d5a63d49	updated to current dev	2021-03-08 02:54:21 +01:00
gerazov	6f06e31541	changed train scripts	2021-03-08 02:54:21 +01:00
gerazov	2daca15802	restructured arg parsing and processing to utils	2021-03-08 02:54:21 +01:00
Eren Gölge	2fbe4a1b8a	fix gdown	2021-03-08 02:54:21 +01:00
Branislav Gerazov	ed56944c4a	improve robustness of defining wavernn in config file	2021-03-08 02:54:21 +01:00
Branislav Gerazov	5e2bc8c99f	update wavernn test config, delete cap=True	2021-03-08 02:54:21 +01:00
Branislav Gerazov	b1e3160884	waveRNN fix	2021-03-08 02:54:21 +01:00
Eren Gölge	08581deb61	linter updates	2021-03-08 02:53:02 +01:00
Thorsten Mueller	167901813d	Ups. Added missing ,	2021-03-08 02:53:02 +01:00
Eren Gölge	93a6bdfd6c	linter fixes and version updates for deps	2021-03-08 02:51:10 +01:00
Eren Gölge	a30a231566	unpin cython version and commentout pyworld in audio.py causing dep issues	2021-03-08 02:50:15 +01:00
Thorsten Mueller	3eb00e8d93	Set out_path to be required param.	2021-03-08 02:49:15 +01:00
Alexander Korolev	ace430d5e6	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-03-08 02:49:15 +01:00
Eren Gölge	83143fbe39	fix #638	2021-03-08 02:48:31 +01:00
Eren Gölge	30c3bef3f9	move hubconf	2021-03-08 02:48:31 +01:00
Eren Gölge	bbea6a0884	hubconf.py and load .models.json from the defualt location by mange.py	2021-03-08 02:48:31 +01:00
Eren Gölge	90d4f08d6c	reorder imports	2021-03-08 02:48:31 +01:00
Eren Gölge	db231c83fc	distill import statement, check python version in setup.py	2021-03-08 02:48:31 +01:00
Thorsten Mueller	915ec1faac	Added info if model already downloaded in --list_models	2021-03-08 02:48:31 +01:00
Alexander Korolev	b4bc5f6eb1	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-03-08 02:48:31 +01:00
Eren Gölge	534e3c67c6	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-03-08 02:48:31 +01:00
kirianguiller	7f36d91131	update chinese model	2021-03-01 14:55:05 +01:00
Eren Gölge	547bfc4ce9	bug fix	2021-02-18 18:24:03 +00:00
Eren Gölge	adaeec57ec	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-18 17:21:09 +00:00
Eren Gölge	5b70c8ba4f	enable backward compat for loading the best model	2021-02-18 17:20:36 +00:00
Eren Gölge	e4f81d6856	Merge pull request #654 from kirianguiller/chinese-implementation Chinese implementation (merge into dev)	2021-02-18 17:15:32 +01:00
kirianguiller	22a6bbfa80	remove gst handling in synthetizer.py class	2021-02-17 20:53:56 +01:00
kirianguiller	3911b87e54	remove re.Match typing in '_number_replace()'	2021-02-17 20:53:56 +01:00
kirianguiller	fb0655d1e7	modify according to PR reviews	2021-02-17 20:53:56 +01:00
kirianguiller	c4c7bc1b88	<add> Chinese mandarin implementation (tacotron2)	2021-02-17 20:53:56 +01:00
Eren Gölge	d0454461de	Merge branch 'pr/gerazov/650-2' into dev	2021-02-17 13:40:45 +00:00
Eren Gölge	a8ea0ea6ce	Docstrings for audioprocessor	2021-02-17 13:35:41 +00:00
Eren Gölge	f6e6314910	add RUSLAN dataset preprocessor	2021-02-17 13:35:23 +00:00
Eren Gölge	ce0c5eccbd	do not test server and modelManager until fixing #657	2021-02-17 00:35:43 +00:00
gerazov	61c88beb94	refactored keep_all_best	2021-02-15 18:40:17 +01:00
Eren Gölge	eb543c027e	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-15 17:06:40 +00:00
Eren Gölge	8a106e0527	fix #655	2021-02-15 17:06:03 +00:00
Eren Gölge	216945e653	Merge pull request #647 from adonispujols/patch-1 Easy Fix for #454 (which was somehow deleted?)	2021-02-15 13:17:17 +01:00
Eren Gölge	06a3ba2fe2	linter update	2021-02-15 12:10:19 +00:00
Eren Gölge	7f58fa365b	Merge branch 'save_characters' into dev	2021-02-15 12:07:28 +00:00
Eren Gölge	ff218e2370	remove redundancy	2021-02-15 12:07:02 +00:00
Eren Gölge	80af8ca5e1	Update TTS/utils/arguments.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-02-15 13:03:59 +01:00
Eren Gölge	3b6ce04332	Update TTS/bin/find_unique_chars.py Co-authored-by: Jörg Thalheim <Mic92@users.noreply.github.com>	2021-02-15 13:02:29 +01:00
Eren Gölge	dc3596dad4	model_manager tests	2021-02-15 11:29:22 +00:00
Eren Gölge	77e630348e	author , license and contact info in .model.json	2021-02-15 11:02:21 +00:00
Eren Gölge	e1bc823e44	Merge branch 'pr/nmstoker/652' into dev	2021-02-15 10:57:12 +00:00
nmstoker	33bcdc6ff8	Updating models list to include EK1 TTS/vocoder	2021-02-14 23:44:05 +00:00
Eren Gölge	420901f4c2	linter fixes	2021-02-12 14:41:17 +00:00
Eren Gölge	4244096ccb	update test_text_processing for espeak-ng	2021-02-12 14:07:26 +00:00
Eren Gölge	b28c724c04	remove _phoneme_punctuations	2021-02-12 12:10:57 +00:00
Eren Gölge	7ab527d17e	save default model chars to the training config file	2021-02-12 12:06:46 +00:00
Eren Gölge	593cedee14	parse_characters function	2021-02-12 12:05:56 +00:00
Eren Gölge	2abfff17f9	enable saving model characters in io.py	2021-02-12 12:04:41 +00:00
Eren Gölge	918f007a11	docstring update	2021-02-12 12:04:07 +00:00
Eren Gölge	e774f68aee	save used model characters to the checkpoints	2021-02-12 12:03:42 +00:00
gerazov	0e78e31dbf	reformated docstrings in arguments.py	2021-02-12 11:36:01 +01:00
gerazov	310d18325e	brushed up printing model load path and best loss path	2021-02-12 10:55:45 +01:00
Eren Gölge	8b6fd76ad2	find unique characters in a dataset	2021-02-12 09:46:11 +00:00
gerazov	af46727517	loading last checkpoint/best_model works, deleting last best models options added, loading last best_loss added	2021-02-12 02:12:00 +01:00
Eren Gölge	a1e595790d	use default vocoders in server.pu	2021-02-11 15:31:39 +00:00
Eren Gölge	8aa6a0decb	set an output_sample_rate in synthesizer and use it for writing the wav file	2021-02-11 15:28:07 +00:00
Eren Gölge	0c52d27d65	return the json entry of the downloaded model	2021-02-11 15:27:41 +00:00
Eren Gölge	1649ad3431	save_wav with a custom sampling rate	2021-02-11 15:27:20 +00:00
Eren Gölge	43f54d2dce	fix make_symbols	2021-02-11 15:26:52 +00:00
Eren Gölge	0657b38111	use default vocoder in synthesize.py	2021-02-11 15:26:17 +00:00
Eren Gölge	2043a9b5f5	define default vocoders	2021-02-11 15:25:55 +00:00
Eren Gölge	ff27690ca7	bug fix	2021-02-11 13:43:29 +00:00
Eren Gölge	bc131208be	fix spelling of a def argument and parse phonemes from config.json if use_phonemes is True	2021-02-11 13:04:47 +00:00
Eren Gölge	f1799dbd60	docstring update	2021-02-11 11:25:31 +00:00
Eren Gölge	3baec4ea96	add missing phonemes to test_config.json	2021-02-11 11:14:39 +00:00
Eren Gölge	a3d1e65b34	Merge branch 'pr/adonispujols/646' into dev	2021-02-11 10:37:29 +00:00
Eren Gölge	3c2e13ca5c	fix the default vocoder name	2021-02-11 10:36:52 +00:00
Adonis Pujols	48011a8b58	add encoding="utf-8"	2021-02-11 05:26:06 -05:00
Adonis Pujols	b29a7e9645	spelling error. should be multiband not mulitband	2021-02-11 04:49:28 -05:00
Adonis Pujols	6c824a6629	spelling error. should be multiband not mulitband	2021-02-11 04:48:53 -05:00
Eren Gölge	b08b8ca2a1	add russian phoneme char	2021-02-10 13:30:59 +00:00
Eren Gölge	9cad435288	css10 dataset preprocessor	2021-02-09 15:11:26 +00:00
Eren Gölge	cea5e517f2	download github model releases by model manager	2021-02-09 14:24:14 +00:00
Eren Gölge	c619859a3f	linter fixes	2021-02-09 11:43:17 +00:00
gerazov	e507373b55	final final fixes	2021-02-06 23:08:47 +01:00
gerazov	ad17dc9e76	final fixes	2021-02-06 23:05:01 +01:00
gerazov	8fdd08ea15	updated to current dev	2021-02-06 22:59:52 +01:00
gerazov	2705d27b28	changed train scripts	2021-02-06 22:29:30 +01:00
gerazov	4f8f274d6e	restructured arg parsing and processing to utils	2021-02-06 22:25:56 +01:00
Eren Gölge	e7e880f514	fix gdown	2021-02-05 13:42:24 +00:00
Eren Gölge	f4f6290eec	Merge branch 'pr/gerazov/641' into dev	2021-02-05 13:14:49 +00:00
Eren Gölge	d49757faaa	linter updates	2021-02-05 13:10:43 +00:00
Branislav Gerazov	f063545325	improve robustness of defining wavernn in config file	2021-02-05 13:26:33 +01:00
Branislav Gerazov	24ffa9e9f6	update wavernn test config, delete cap=True	2021-02-05 13:10:02 +01:00
Branislav Gerazov	cb77aef36c	waveRNN fix	2021-02-04 09:52:03 +01:00
Thorsten Mueller	d74866cb8e	Merge remote-tracking branch 'upstream/dev' into dev Fix for circleci error mentioned in PR https://github.com/mozilla/TTS/pull/637	2021-02-02 19:40:18 +01:00
Thorsten Mueller	a82152eef3	Ups. Added missing ,	2021-02-02 19:29:16 +01:00
Thorsten Mueller	4cb4fcf02c	Set out_path to be required param.	2021-02-02 19:29:16 +01:00
Thorsten Mueller	c75ea74914	Added info if model already downloaded in --list_models	2021-02-02 19:29:16 +01:00
Eren Gölge	2edab4b3f9	disable pw in audio that causes numpy issue	2021-02-01 17:05:03 +00:00
Eren Gölge	5c46543765	linter fixes and version updates for deps	2021-02-01 13:18:56 +00:00
Eren Gölge	8774e37444	unpin cython version and commentout pyworld in audio.py causing dep issues	2021-02-01 11:34:05 +00:00
Eren Gölge	5beed0ddcd	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-02-01 11:27:14 +00:00
Eren Gölge	c7407571fa	fix #638	2021-02-01 10:05:55 +00:00
Eren Gölge	dfdac1def9	Merge pull request #636 from thorstenMueller/dev Set out_path to be required param in compute_statistics.py.	2021-01-29 18:08:31 +01:00
Thorsten Mueller	44c4a49745	Set out_path to be required param.	2021-01-29 17:23:38 +01:00
Eren Gölge	536366dc0a	Merge pull request #635 from SanjaESC/patch-1 fix device mismatch wavegrad training	2021-01-29 16:42:25 +01:00
Eren Gölge	0354b6f35e	move hubconf	2021-01-29 15:28:32 +00:00
Eren Gölge	aa5f24608a	hubconf.py and load .models.json from the defualt location by mange.py	2021-01-29 15:28:26 +00:00
Alexander Korolev	e81ebec7a8	fix device mismatch wavegrad training this should fixe the device mismatch as seen here https://github.com/mozilla/TTS/issues/622#issue-789802916	2021-01-29 15:18:59 +01:00
Eren Gölge	a926aa106d	reorder imports	2021-01-29 01:36:21 +01:00
Eren Gölge	8a6eee7fec	distill import statement, check python version in setup.py	2021-01-28 17:04:08 +01:00
Eren Gölge	131a163c95	Merge pull request #628 from thorstenMueller/dev Added info if model already downloaded in --list_models	2021-01-28 13:10:06 +01:00
Alexander Korolev	ca28e05ed7	update fixed stopnet_pos_weight parameter config parameter c.stopnet_pos_weight has currently no effect as it is not used.	2021-01-27 16:33:25 +01:00
Thorsten Mueller	ccbd542eb0	Added info if model already downloaded in --list_models	2021-01-27 16:19:02 +01:00
Eren Gölge	25c86ca715	README update, set default models for synthesize.py and server.py. Disable verbose for ap init.	2021-01-27 11:47:03 +01:00
Eren Gölge	4f32e77006	platform indep. way to fetch user data folder	2021-01-26 17:32:43 +01:00
Eren Gölge	0117c811a9	add a button to index.html to see the model details	2021-01-26 12:33:27 +01:00
Eren Gölge	a3adcaccdb	Merge branch 'pr/thorstenMueller/623' into dev	2021-01-26 12:19:39 +01:00
Eren Gölge	b464cab9b8	setup.py update and pylint fixes	2021-01-26 02:57:50 +01:00
Eren Gölge	660d61aeeb	maximum_path_numpy and CYTHON adabtable import	2021-01-26 02:57:07 +01:00
Eren Gölge	877f0bbfba	manifest.in update	2021-01-26 02:56:55 +01:00
Eren Gölge	82e029529e	fix manifest file	2021-01-25 13:27:54 +01:00
Eren Gölge	57b668fd86	fixing dome pypi issues	2021-01-25 13:06:12 +01:00
Eren Gölge	60c1bb93d9	fixes before first PyPI release	2021-01-25 11:16:20 +01:00
Thorsten Mueller	afb7db2a1d	Removed unneeded check and removed specific taco2 model name.	2021-01-22 16:22:50 +01:00
Eren Gölge	fae10309e4	Merge pull request #624 from SanjaESC/patch-3 Update train_tacotron.py	2021-01-22 13:29:09 +01:00
Eren Gölge	5ee73c2bae	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2021-01-22 13:26:27 +01:00
Eren Gölge	5fb611ef40	static image for server index.html	2021-01-22 03:01:53 +01:00
Eren Gölge	ca647cf222	Model Manager to download released models	2021-01-22 02:35:43 +01:00
Eren Gölge	ca8ad9c21e	rename audio._normalize to audio.normalize	2021-01-22 02:33:19 +01:00
Eren Gölge	c990b3a59c	linter fixes and test fixes	2021-01-22 02:32:35 +01:00
Alexander Korolev	f251dc8c0e	Update train_tacotron.py When attempting to fine-tune a model with "prenet_type": "bn" that was originally trained with "prenet_type": "original", a RuntimeError is thrown that stops the training. By catching the RuntimeError, the required layers can be partially restored and the training will continue without any problems.	2021-01-21 21:16:30 +01:00
Eren Gölge	0ab2eb2664	use synthesizer in both synthesize.py and server.pu	2021-01-21 15:54:33 +01:00
Eren Gölge	9addfabc43	wavernn load_checkpoint function	2021-01-21 15:31:13 +01:00
Eren Gölge	50fee59a2c	update synthesizer.py for better interfacing to different models	2021-01-21 15:30:49 +01:00
Eren Gölge	007a4d7139	remove 3rd paty wavernn support from server.py and add ModelManager arguments	2021-01-21 15:30:16 +01:00
Eren Gölge	6b6e989fd2	update server readme	2021-01-21 15:29:46 +01:00
Thorsten Mueller	e414582be6	Added option for server ui details page.	2021-01-20 21:56:40 +01:00
root	1bc8fbbd3c	set eval mode whe nloading models	2021-01-20 02:14:18 +00:00
root	5bd7238153	interpolate spectrogram in vocoder generic utils for matching sample rates	2021-01-20 02:13:01 +00:00
root	ca3743539a	load_checkpoint func for vocoder models	2021-01-20 02:12:29 +00:00
root	ea39715305	read_json_with_comments	2021-01-20 02:11:55 +00:00

... 8 9 10 11 12 ...

1186 Commits