coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	3c2d5a9e03	Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb (#3230 ) * chore: remove unused argument * refactor(audio.processor): remove duplicate stft+griffin_lim * chore(audio.processor): remove unused compute_stft_paddings Same function available in numpy_transforms * refactor(audio.processor): remove duplicate db_to_amp * refactor(audio.processor): remove duplicate amp_to_db * refactor(audio.processor): remove duplicate linear_to_mel * refactor(audio.processor): remove duplicate mel_to_linear * refactor(audio.processor): remove duplicate build_mel_basis * refactor(audio.processor): remove duplicate stft_parameters * refactor(audio.processor): use pre-/deemphasis from numpy_transforms * refactor(audio.processor): use rms_volume_norm from numpy_transforms * chore(audio.processor): remove duplicate assert Already checked in numpy_transforms.compute_f0 * refactor(audio.processor): use find_endpoint from numpy_transforms * refactor(audio.processor): use trim_silence from numpy_transforms * refactor(audio.processor): use volume_norm from numpy_transforms * refactor(audio.processor): use load_wav from numpy_transforms * fix(bin.extract_tts_spectrograms): set quantization bits * fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code Fixes #2447, #2574 * refactor(audio.processor): remove duplicate quantization methods	2023-11-16 10:57:06 +01:00
Eren Gölge	914280a556	Bump up to v0.11.0 (#2329 ) * Make style * Bump up to v0.11.0	2023-02-08 13:58:49 +01:00
Stanislav Kachnov	2c9f00a808	Fix tune wavegrad (#1844 ) * fix imports in tune_wavegrad * load_config returns Coqpit object instead None * set action (store true) for flag "--use_cuda"; start to tune if module is running as the main program * fix var order in the result of batch collating * make style * make style with black and isort	2022-08-22 09:55:32 +02:00
Edresson Casanova	60034674f9	Remove audio padding before mel spec extraction	2022-05-07 13:12:09 +02:00
Eren Gölge	7058fcc3ff	Take file extension as an argument	2022-02-25 11:26:59 +01:00
Eren Gölge	127118c637	Update TTS.tts formatters (#1228 ) * Return Dict from tts formatters * Make style	2022-02-11 23:03:43 +01:00
Eren Gölge	e15bc157d8	Fix #873	2021-10-14 14:39:45 +00:00
Eren Gölge	21cc0517a3	Fix WaveRNN test	2021-10-01 10:21:37 +00:00
Eren Gölge	9f23ad6a0f	Fix imports	2021-09-30 14:47:56 +00:00
Eren Gölge	e20ea57c87	Update comment and add a warning	2021-09-07 12:23:32 +00:00
Eren Gölge	58cc414477	Fix WaveGrad `test_run`	2021-07-16 13:02:25 +02:00
Eren Gölge	d7225eedb0	Update `vocoder` datasets and `setup_dataset`	2021-06-28 17:03:20 +02:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	cd69da4868	linter fixes #2	2021-04-08 16:57:46 +02:00
Eren Gölge	aee24b0704	set different seed in gan_dataset when it is multi-workers	2021-04-08 11:28:30 +02:00
Eren Gölge	57f6bd1afa	make using different samples for G and D networks optional	2021-04-08 11:26:01 +02:00
Eren Gölge	f890454de3	linter fixes	2021-04-07 12:36:03 +02:00
Eren Gölge	b86e7fb2e8	pad short samples when loading precomputed features in vocoder trainign	2021-04-06 16:24:50 +02:00
Eren Gölge	594d8d8f09	linter fixes	2021-03-08 11:22:59 +01:00
Eren Gölge	00b5090974	linter fix	2021-03-08 11:05:30 +01:00
Eren Gölge	e15734c3fc	linter fix	2021-03-08 05:29:43 +01:00
erogol	e3eda159d1	wavegrad_dataset update	2020-11-25 14:50:50 +01:00
erogol	c76a617072	linter updates	2020-11-09 13:18:35 +01:00
erogol	c80225544e	tune wavegrad to fine the best noise schedule for inferece	2020-11-06 13:04:46 +01:00
erogol	dc2825dfb2	wavegrad dataset update	2020-10-29 12:31:43 +01:00
erogol	7bcdb7ac35	wavegrad updates	2020-10-29 12:31:43 +01:00
erogol	e02cd6a220	initial wavegrad layers model and trainig script	2020-10-29 12:30:37 +01:00
erogol	9d0ae2bfb4	wavernn dataloader handling for short samples and mixed precision training	2020-10-28 12:31:01 +01:00
sanjaesc	2ee47e9568	fix pylint once again	2020-10-27 12:17:38 +01:00
sanjaesc	bef3f2020b	compute audio feat on dataload	2020-10-27 12:17:38 +01:00
sanjaesc	91e5f8b63d	added to device cpu/gpu + formatting	2020-10-27 12:17:38 +01:00
sanjaesc	e8294cb9db	fixing pylint errors	2020-10-27 12:17:38 +01:00
sanjaesc	878b7c373e	added feature preprocessing if not set in config	2020-10-27 12:17:38 +01:00
sanjaesc	e495e03ea1	some minor changes to wavernn	2020-10-27 12:17:38 +01:00
Alex K	9c3c7ce2f8	wavernn stuff...	2020-10-27 12:17:38 +01:00
Alex K	6378fa2b07	add initial wavernn support	2020-10-27 12:17:38 +01:00
erogol	df19428ec6	rename the project to old TTS	2020-09-09 12:27:23 +02:00

41 Commits