coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson	77d85c6cc5	add softmaxproto loss and bug fix in data loader	2021-05-10 17:08:38 -03:00
Edresson	78bad25f2b	update voxceleb download link	2021-05-07 23:45:15 -03:00
Eren Gölge	f7582107da	Merge pull request #453 from Edresson/dev Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.	2021-05-06 17:53:28 +02:00
Edresson	501c8e0302	remove unused vars on extract tts spectrograms script	2021-05-04 19:04:13 -03:00
Eren Gölge	0325c58862	Merge pull request #468 from shaun95/patch-1 Update losses.py	2021-05-03 14:45:24 +02:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Eren Gölge	87d674a038	bumpup librosa version to 0.8.0	2021-05-03 14:25:09 +02:00
shaun	7d0ec62bf1	Update losses.py The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled. The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)	2021-05-02 14:14:24 +02:00
Edresson	3ecd556bbe	add unit test for extract tts spectrograms script	2021-05-01 13:41:56 -03:00
Edresson	446b1da936	create inference function	2021-04-29 18:18:37 -03:00
Eren Gölge	f02f0338c2	fix .models.json and add testing to check released models availability	2021-04-29 09:32:36 +02:00
Eren Gölge	fd95e9b8a4	[ci skip] Add sam models	2021-04-28 21:57:31 +02:00
Agrin Hilmkil	351d0ed6ae	Remove unnecessary fsspec usage	2021-04-28 11:21:08 +02:00
Agrin Hilmkil	167f86417e	Move dev, tf, notebook dependencies to extras	2021-04-28 11:20:06 +02:00
Eren Gölge	1235e54738	test for synthesize.py	2021-04-27 14:17:38 +02:00
Eren Gölge	4719414f2e	remove imports	2021-04-27 11:25:17 +02:00
Eren Gölge	add97cddc1	move function and remove import	2021-04-27 11:22:56 +02:00
Eren Gölge	734e6a515c	bug fix	2021-04-27 10:27:45 +02:00
Eren Gölge	6bdd81667e	place holders for sc-glow and hifigan models	2021-04-26 19:53:12 +02:00
Eren Gölge	2f0716073e	enable multi-speaker CoquiTTS models for synthesize.py	2021-04-26 19:36:53 +02:00
Eren Gölge	b531fa699c	remove conflicy noise	2021-04-26 15:27:52 +02:00
Eren Gölge	f37b488876	Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager	2021-04-26 15:25:25 +02:00
Eren Gölge	b82daa5e86	style and linter fixes	2021-04-26 15:22:24 +02:00
Edresson	20e42a3381	add save audio option	2021-04-23 15:00:00 -03:00
Edresson	8228091f92	add script for extraction of tts spectrograms	2021-04-23 14:17:46 -03:00
Eren Gölge	4cf211348d	styling and linting	2021-04-23 18:04:37 +02:00
Eren Gölge	7eb0c60d2e	let synthesizer to pass speaker encoder file paths to speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	f69195739e	let speaker manager compute mean x_vector from multiple wav files	2021-04-23 18:04:37 +02:00
Eren Gölge	179722e3a7	new arguments to synthesize.py for loading speaker encoder and speaker wavs	2021-04-23 18:04:37 +02:00
Eren Gölge	dfa415a8b8	small refactor in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	c80d21f311	load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	ad047c8195	html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker	2021-04-23 18:04:37 +02:00
Eren Gölge	f9f3d04d14	remove moved function	2021-04-23 18:04:37 +02:00
Eren Gölge	10c988ac8c	update server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	6d0f5e0459	use SpeakerManager in Synthesizer	2021-04-23 18:04:37 +02:00
Eren Gölge	e97126314c	add ```unique``` argument to make_symbols to fix the incompat. issue of the SC-Glow models	2021-04-23 18:04:37 +02:00
Eren Gölge	d08888e603	formating speakers.py	2021-04-23 18:04:37 +02:00
Eren Gölge	df422223a3	initial SpeakerManager implementation	2021-04-23 18:04:37 +02:00
Eren Gölge	7a7aeb35f5	fix the glow-tts in setup_model	2021-04-23 18:04:37 +02:00
Eren Gölge	d42748082a	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-23 18:04:37 +02:00
Eren Gölge	2da81f5bb6	add load_chekpoint to speaker encoder	2021-04-23 18:04:37 +02:00
Eren Gölge	1229ccbf07	update argument name in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	af2d36faeb	update synthesize.py for multi-speaker setting	2021-04-23 18:04:37 +02:00
Eren Gölge	99dc07a7dd	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-23 18:04:37 +02:00
Eren Gölge	c955a12428	set the default layer size compatible with scglow	2021-04-23 18:04:37 +02:00
Eren Gölge	3ace2440fa	fix a mistake from rebase	2021-04-23 18:04:37 +02:00
Eren Gölge	aadb2106ec	code styling	2021-04-23 18:04:37 +02:00
Eren Gölge	af7baa3387	refactoring to allow defining the speaker file externally	2021-04-23 18:04:37 +02:00
kirianguiller	7dccbfdcd5	handle multi speaker and gst in Synthetizer class	2021-04-23 18:04:37 +02:00
Edresson	d2b6326b8b	change optimizer initialization for compatibility with Hifi-GAN official implementation	2021-04-23 07:54:39 -03:00

1 2 3 4 5 ...

661 Commits