coqui-tts

Commit Graph

Author	SHA1	Message	Date
erogol	5a59467f34	scaler fix for wavegrad and wavernn. Save and load scaler	2020-11-14 13:00:35 +01:00
erogol	d8511efa8f	use native amp for tacotron training	2020-11-14 12:59:28 +01:00
Qingping Hou	0cc3650ef6	support loading config in yaml	2020-11-14 00:13:53 -08:00
erogol	a7aefd5c50	use pytorch amp for mixed precision training for Tacotron	2020-11-12 12:51:56 +01:00
erogol	67e2b664e5	compute embeddings and create speakers.json	2020-11-12 12:51:17 +01:00
erogol	f8fd300b3e	bug fix	2020-11-10 12:53:39 +01:00
erogol	016d3503da	compute embeddings with speaker encoder	2020-11-10 12:51:02 +01:00
erogol	21364331d2	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-11-09 13:31:12 +01:00
erogol	c76a617072	linter updates	2020-11-09 13:18:35 +01:00
erogol	c80225544e	tune wavegrad to fine the best noise schedule for inferece	2020-11-06 13:04:46 +01:00
erogol	ef04d7fae7	bug fix for wavernn training	2020-10-30 14:08:41 +01:00
erogol	183fe56d95	Merge branch 'ssim_loss' into dev	2020-10-29 23:49:09 +01:00
krzim	2202e171c5	Fix import to grab the encoder model save function I saw that this was recently changed but I'm not sure if it should have been. This is the correct function given the arguments provided to it in the train loop.	2020-10-29 18:03:11 -04:00
erogol	73581cd94c	renaming train scripts and updating tests	2020-10-29 16:50:07 +01:00
erogol	946a0c0fb9	bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts	2020-10-29 15:45:50 +01:00
erogol	14c2381207	weight norm and torch based amp training for wavegrad	2020-10-29 12:31:43 +01:00
erogol	c8a4c771a8	train wavegrad updates	2020-10-29 12:31:43 +01:00
erogol	670f44aa18	enable compute stats by vocoder config	2020-10-29 12:31:43 +01:00
erogol	f79bbbbd00	use Adam for wavegras instead of RAdam	2020-10-29 12:31:43 +01:00
erogol	7bcdb7ac35	wavegrad updates	2020-10-29 12:31:43 +01:00
erogol	a1582a0e12	fix distributed training for train_* scripts	2020-10-29 12:31:43 +01:00
erogol	e02cd6a220	initial wavegrad layers model and trainig script	2020-10-29 12:30:37 +01:00
erogol	e723b99888	handle distributed model as saving	2020-10-29 12:30:37 +01:00
Eren Gölge	26c18b61c9	Merge pull request #553 from Edresson/dev bug fix in the inference with GlowTTS	2020-10-28 18:49:31 +01:00
erogol	9d0ae2bfb4	wavernn dataloader handling for short samples and mixed precision training	2020-10-28 12:31:01 +01:00
Edresson	f01502a9db	bug fix in glowTTS sythesize	2020-10-27 16:30:16 -03:00
Eren Gölge	f4b8170bd1	Merge pull request #545 from Edresson/dev GlowTTS zeroshot TTS support	2020-10-27 15:23:41 +01:00
erogol	0becef4b58	small updates	2020-10-27 12:17:38 +01:00
sanjaesc	2ee47e9568	fix pylint once again	2020-10-27 12:17:38 +01:00
sanjaesc	bef3f2020b	compute audio feat on dataload	2020-10-27 12:17:38 +01:00
sanjaesc	91e5f8b63d	added to device cpu/gpu + formatting	2020-10-27 12:17:38 +01:00
sanjaesc	016a77fcf2	fix formatting + pylint	2020-10-27 12:17:38 +01:00
sanjaesc	e8294cb9db	fixing pylint errors	2020-10-27 12:17:38 +01:00
sanjaesc	878b7c373e	added feature preprocessing if not set in config	2020-10-27 12:17:38 +01:00
sanjaesc	e495e03ea1	some minor changes to wavernn	2020-10-27 12:17:38 +01:00
Alex K	6378fa2b07	add initial wavernn support	2020-10-27 12:17:38 +01:00
Edresson	d9540a5857	add blank token in sequence for encrease glowtts results	2020-10-25 15:08:28 -03:00
Edresson	fbea058c59	add parse speakers function	2020-10-24 16:10:05 -03:00
Edresson	07345099ee	GlowTTS zero-shot TTS Support	2020-10-24 15:58:39 -03:00
Edresson	b7f9ebd32b	add check arguments for GlowTTS and multispeaker training bug fix	2020-10-19 17:17:58 -03:00
erogol	c5074cfd8e	general purpose distribute.py	2020-10-08 01:30:42 +02:00
Edresson	99d5a0ac07	add Speaker Conditional GST support	2020-09-29 16:09:27 -03:00
erogol	154f90bc44	format speaker encoder imports	2020-09-28 11:19:19 +02:00
mueller91	cfeeef7a7f	fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev. speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.	2020-09-22 20:10:41 +02:00
mueller91	1fe5eb054f	Merge branch 'dev' of https://github.com/mozilla/TTS into dev Conflicts: TTS/bin/train_encoder.py requirements.txt	2020-09-22 19:58:53 +02:00
mueller91	df4caec4b7	add: check_config for speaker_encoder	2020-09-22 19:52:09 +02:00
erogol	10258724d1	linter fixes	2020-09-22 03:54:16 +02:00
erogol	a6df617eb1	Merge branch 'glow-tts-amp-time_depth_conv' into dev	2020-09-21 14:23:45 +02:00
erogol	8150d5727e	Merge branch 'dev' of https://github.com/mozilla/TTS into dev	2020-09-21 14:21:55 +02:00
erogol	e0b9fa887f	glow-tts modules added	2020-09-21 14:15:40 +02:00
mueller	6b0621c794	cleanup	2020-09-17 16:46:43 +02:00
mueller	a273b1a210	add: add random noise to dataset	2020-09-17 14:23:40 +02:00
mueller	e36a3067e4	add: save wavs instead feats to storage. This is done in order to mitigate staleness when caching and loading from data storage	2020-09-17 14:14:30 +02:00
mueller	1511076fde	add: Configurable encoder dataset storage to reduce disk I/O add: Averaged time for data loader to console and Tensorboard output	2020-09-17 12:29:38 +02:00
maxbachmann	60ce862113	use difflib for string matching	2020-09-14 23:55:34 +02:00
erogol	498a3ea36f	fix condition check	2020-09-12 03:39:01 +02:00
erogol	15e6ab3912	glow-tts module renaming updates	2020-09-12 03:33:36 +02:00
erogol	f9001a4bdd	refactor and fix compat issues for speaker encoder	2020-09-11 17:17:07 +02:00
erogol	df19428ec6	rename the project to old TTS	2020-09-09 12:27:23 +02:00

... 5 6 7 8 9

409 Commits