coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	da49089a72	update melgan training test batch size	2021-05-12 10:12:11 +02:00
Edresson	3433c2f348	add compute embedding for the new speaker encoder	2021-05-12 03:06:46 -03:00
Eren Gölge	0213e1cbf4	update configs for tts models to match the field typed with the expected values	2021-05-12 00:57:38 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Edresson	3fcc748b2e	implement the Speaker Encoder H/ASP	2021-05-11 16:27:05 -03:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	6e980b49c4	fix synthesizer.py for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	db14dcd95a	remove old load_config	2021-05-11 11:29:18 +02:00
Eren Gölge	a21ac883dd	add get_cuda()	2021-05-11 11:29:18 +02:00
Eren Gölge	21dd4d7960	fix load_config imports for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	c57f0b46bb	reintro use_gst for backwars compat	2021-05-11 11:29:18 +02:00
Eren Gölge	18e76a2309	fix speaker encoder model initialization	2021-05-11 11:29:18 +02:00
Eren Gölge	10de40bba1	make num_workers mandatory config field	2021-05-11 11:29:18 +02:00
Eren Gölge	df1ddd3539	allow read_json_with_comments for backward compat	2021-05-11 11:29:18 +02:00
Eren Gölge	9f7599e3c3	fix train_encoder for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	f8e52965dd	add speaker encoder coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	ce2bba543e	remove extra from utils and move funcs to io.py	2021-05-11 11:29:18 +02:00
Eren Gölge	812dbc2b06	rm config.json	2021-05-11 11:29:18 +02:00
Eren Gölge	3fde2001b1	train_encoder refactoring for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	9ee70af9bb	code styling	2021-05-11 11:29:18 +02:00
Eren Gölge	10db2baa06	global shared Coqpit configs	2021-05-11 11:29:18 +02:00
Eren Gölge	3dec62b183	add Coqpits for the vocoder models	2021-05-11 11:29:18 +02:00
Eren Gölge	6f4eed94f5	remove *.json vocoder configs	2021-05-11 11:29:18 +02:00
Eren Gölge	78b3825d0b	update train scripts for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	757e90b1cc	load_config function to initialize the right Coqpit for the given model	2021-05-11 11:29:18 +02:00
Eren Gölge	e6f45b9eb7	update train_vocoder_gan.py for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	bcebd69d09	remove bash tts training tests	2021-05-11 11:29:17 +02:00
Eren Gölge	7663bc63c1	add Coqpit configs for the TTS models	2021-05-11 11:29:17 +02:00
Eren Gölge	7227e8f1d2	update train_align_tts.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	51a7e06945	glow_tts_config.py and train test on python	2021-05-11 11:29:17 +02:00
Eren Gölge	720fe13056	update glow_tts modules and training script for coqpit use	2021-05-11 11:29:17 +02:00
Eren Gölge	816e7ee698	remove default configs.json as replacing with Coqpit configs	2021-05-11 11:29:17 +02:00
Eren Gölge	35341d5482	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	647163397d	coqpit refactoring	2021-05-11 11:29:17 +02:00
Eren Gölge	eaa130e813	fix tacotron for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	65d7ad4250	refactor train_speedy_speech.py for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	4a58fdfd59	comment out check-arguments before copying fields to the configs	2021-05-11 11:29:17 +02:00
Eren Gölge	05d9543ed8	init GST module using gst config in Tacotron models	2021-05-11 11:29:17 +02:00
Eren Gölge	93a00373f6	move split_dataset	2021-05-11 11:29:17 +02:00
Eren Gölge	9c18e40f64	black formatting	2021-05-11 11:29:17 +02:00
Eren Gölge	c34c8137d7	update compute_statistics for coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	79d7215142	config refactor #5 WIP	2021-05-11 11:29:17 +02:00
Eren Gölge	dc50f5f0b0	config refactor #4 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	97bd5f9734	[ci skip] config update #3 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	a21c0b5585	config update 2 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	e092ae40dc	config update WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	06f80a4806	update check argument	2021-05-11 11:28:35 +02:00
Eren Gölge	bf7ddfa542	Merge pull request #481 from chmodsss/main Accessing __version__ command	2021-05-11 10:20:48 +02:00
Edresson	85ccad7e0a	add Audio data augamentation Addtive and RIR	2021-05-11 00:59:57 -03:00
Edresson	77d85c6cc5	add softmaxproto loss and bug fix in data loader	2021-05-10 17:08:38 -03:00
chmodsss	607d5cf377	[#480 ] Adding version variable	2021-05-10 19:46:34 +02:00
Adam Froghyar	7ddc885f37	deleted a line the broke GravesAttention	2021-05-10 15:42:59 +02:00
Edresson	78bad25f2b	update voxceleb download link	2021-05-07 23:45:15 -03:00
Eren Gölge	f7582107da	Merge pull request #453 from Edresson/dev Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.	2021-05-06 17:53:28 +02:00
Edresson	501c8e0302	remove unused vars on extract tts spectrograms script	2021-05-04 19:04:13 -03:00
Eren Gölge	0325c58862	Merge pull request #468 from shaun95/patch-1 Update losses.py	2021-05-03 14:45:24 +02:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Eren Gölge	87d674a038	bumpup librosa version to 0.8.0	2021-05-03 14:25:09 +02:00
shaun	7d0ec62bf1	Update losses.py The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled. The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)	2021-05-02 14:14:24 +02:00
Edresson	3ecd556bbe	add unit test for extract tts spectrograms script	2021-05-01 13:41:56 -03:00
Edresson	446b1da936	create inference function	2021-04-29 18:18:37 -03:00
Eren Gölge	f02f0338c2	fix .models.json and add testing to check released models availability	2021-04-29 09:32:36 +02:00
Eren Gölge	fd95e9b8a4	[ci skip] Add sam models	2021-04-28 21:57:31 +02:00
Agrin Hilmkil	351d0ed6ae	Remove unnecessary fsspec usage	2021-04-28 11:21:08 +02:00
Agrin Hilmkil	167f86417e	Move dev, tf, notebook dependencies to extras	2021-04-28 11:20:06 +02:00
Eren Gölge	1235e54738	test for synthesize.py	2021-04-27 14:17:38 +02:00
Eren Gölge	4719414f2e	remove imports	2021-04-27 11:25:17 +02:00
Eren Gölge	add97cddc1	move function and remove import	2021-04-27 11:22:56 +02:00
Eren Gölge	734e6a515c	bug fix	2021-04-27 10:27:45 +02:00
Eren Gölge	6bdd81667e	place holders for sc-glow and hifigan models	2021-04-26 19:53:12 +02:00
Eren Gölge	2f0716073e	enable multi-speaker CoquiTTS models for synthesize.py	2021-04-26 19:36:53 +02:00
Eren Gölge	b531fa699c	remove conflicy noise	2021-04-26 15:27:52 +02:00
Eren Gölge	f37b488876	Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager	2021-04-26 15:25:25 +02:00
Eren Gölge	b82daa5e86	style and linter fixes	2021-04-26 15:22:24 +02:00
Edresson	20e42a3381	add save audio option	2021-04-23 15:00:00 -03:00
Edresson	8228091f92	add script for extraction of tts spectrograms	2021-04-23 14:17:46 -03:00
Eren Gölge	4cf211348d	styling and linting	2021-04-23 18:04:37 +02:00
Eren Gölge	7eb0c60d2e	let synthesizer to pass speaker encoder file paths to speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	f69195739e	let speaker manager compute mean x_vector from multiple wav files	2021-04-23 18:04:37 +02:00
Eren Gölge	179722e3a7	new arguments to synthesize.py for loading speaker encoder and speaker wavs	2021-04-23 18:04:37 +02:00
Eren Gölge	dfa415a8b8	small refactor in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	c80d21f311	load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	ad047c8195	html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker	2021-04-23 18:04:37 +02:00
Eren Gölge	f9f3d04d14	remove moved function	2021-04-23 18:04:37 +02:00
Eren Gölge	10c988ac8c	update server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	6d0f5e0459	use SpeakerManager in Synthesizer	2021-04-23 18:04:37 +02:00
Eren Gölge	e97126314c	add ```unique``` argument to make_symbols to fix the incompat. issue of the SC-Glow models	2021-04-23 18:04:37 +02:00
Eren Gölge	d08888e603	formating speakers.py	2021-04-23 18:04:37 +02:00
Eren Gölge	df422223a3	initial SpeakerManager implementation	2021-04-23 18:04:37 +02:00
Eren Gölge	7a7aeb35f5	fix the glow-tts in setup_model	2021-04-23 18:04:37 +02:00
Eren Gölge	d42748082a	update argument name external_speaker_embedding_dim -> speaker_embedding_dim add inference_noise_scale argument to glow-tts	2021-04-23 18:04:37 +02:00
Eren Gölge	2da81f5bb6	add load_chekpoint to speaker encoder	2021-04-23 18:04:37 +02:00
Eren Gölge	1229ccbf07	update argument name in server.py	2021-04-23 18:04:37 +02:00
Eren Gölge	af2d36faeb	update synthesize.py for multi-speaker setting	2021-04-23 18:04:37 +02:00
Eren Gölge	99dc07a7dd	add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set)	2021-04-23 18:04:37 +02:00
Eren Gölge	c955a12428	set the default layer size compatible with scglow	2021-04-23 18:04:37 +02:00
Eren Gölge	3ace2440fa	fix a mistake from rebase	2021-04-23 18:04:37 +02:00
Eren Gölge	aadb2106ec	code styling	2021-04-23 18:04:37 +02:00
Eren Gölge	af7baa3387	refactoring to allow defining the speaker file externally	2021-04-23 18:04:37 +02:00

1 2 3 4 5 ...

763 Commits