coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	aa0fbdf27e	fix(bin.extract_tts_spectrograms): set quantization bits	2023-11-15 16:50:59 +01:00
Enno Hermann	13e640f17e	refactor(audio.processor): use load_wav from numpy_transforms	2023-11-15 16:50:59 +01:00
Enno Hermann	9a43eafd60	refactor(audio.processor): use volume_norm from numpy_transforms	2023-11-15 16:33:13 +01:00
Enno Hermann	0a0e7a3bae	refactor(audio.processor): use trim_silence from numpy_transforms	2023-11-15 16:33:13 +01:00
Enno Hermann	842a632cd5	refactor(audio.processor): use find_endpoint from numpy_transforms	2023-11-15 16:33:13 +01:00
Enno Hermann	5232bf9e36	chore(audio.processor): remove duplicate assert Already checked in numpy_transforms.compute_f0	2023-11-15 16:33:13 +01:00
Enno Hermann	b620092865	refactor(audio.processor): use rms_volume_norm from numpy_transforms	2023-11-15 16:33:13 +01:00
Enno Hermann	11e98d3dac	refactor(audio.processor): use pre-/deemphasis from numpy_transforms	2023-11-15 16:33:13 +01:00
Enno Hermann	f37cc4c028	refactor(audio.processor): remove duplicate stft_parameters	2023-11-15 16:33:13 +01:00
Enno Hermann	da229f3912	refactor(audio.processor): remove duplicate build_mel_basis	2023-11-15 16:33:13 +01:00
Enno Hermann	754877784b	refactor(audio.processor): remove duplicate mel_to_linear	2023-11-15 16:33:08 +01:00
Enno Hermann	fd9d6d4b0f	refactor(audio.processor): remove duplicate linear_to_mel	2023-11-15 16:32:13 +01:00
Enno Hermann	4fd5c46937	refactor(audio.processor): remove duplicate amp_to_db	2023-11-15 16:32:13 +01:00
Enno Hermann	794f41c611	refactor(audio.processor): remove duplicate db_to_amp	2023-11-15 16:32:13 +01:00
Enno Hermann	5a5da76260	chore(audio.processor): remove unused compute_stft_paddings Same function available in numpy_transforms	2023-11-15 16:32:13 +01:00
Enno Hermann	d75879802a	refactor(audio.processor): remove duplicate stft+griffin_lim	2023-11-15 16:32:06 +01:00
Enno Hermann	8fa4de1c8c	chore: remove unused argument	2023-11-13 21:43:06 +01:00
Eren Gölge	6f1cba2f81	Update to v0.20.3	2023-11-09 17:41:37 +01:00
Enno Hermann	3b1e7038bc	fix(formatters): set missing root_path attribute (#3182 ) Fixes #2778	2023-11-09 16:49:52 +01:00
Aarni Koskela	a8e9163fb3	xtts/tokenizer: merge duplicate implementations of preprocess_text (#3170 ) This was found via ruff: > F811 Redefinition of unused `preprocess_text` from line 570	2023-11-09 16:32:12 +01:00
Matthew Boakes	1b9c400bca	PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176 ) * Replaced PyTorch weight_norm With parametrizations.weight_norm * TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism * Corrected Code Style --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-11-09 16:31:03 +01:00
Gorkem	66a1e248d0	torchaudio should use proper backend to load audio (#3179 )	2023-11-09 16:28:39 +01:00
Eren Gölge	46d9c27212	Update to v0.20.2	2023-11-08 16:07:56 +01:00
Julian Weber	03ad90135b	Add lang code in XTTS doc (#3158 ) * Add lang code in XTTS doc * Remove ununsed config and args * update docs * woops	2023-11-08 13:47:33 +01:00
Gorkem	78a596618a	Fix for exception on streaming if last chunk empty (#3160 )	2023-11-08 11:32:02 +01:00
Enno Hermann	99edd6daa3	Fix ModelManager.list_models() (#3128 ) * fix(utils.manage): remove hard-coded model_type variable * refactor(utils.manage): address lint issues, fix typos Addressed the following: TTS/utils/manage.py:307:12: R1705: Unnecessary "else" after "return" (no-else-return) TTS/utils/manage.py:308:21: W1514: Using open without explicitly specifying an encoding (unspecified-encoding) TTS/utils/manage.py:299:4: R1710: Either all return statements in a function should return an expression, or none of them should. (inconsistent-return-statements) TTS/utils/manage.py:299:4: R0201: Method could be a function (no-self-use) TTS/utils/manage.py:314:4: R0201: Method could be a function (no-self-use)	2023-11-08 11:29:01 +01:00
Eren Gölge	77b18126c7	Merge pull request #3126 from akx/freevc-config-module Move FreeVCConfig to TTS.vc.configs (like all other config classes)	2023-11-08 11:24:47 +01:00
Eren Gölge	cc6e9fcaa7	Fix #3153 (#3169 )	2023-11-08 11:13:58 +01:00
Eren Gölge	a24ebcd8a6	Fix coqui api (#3168 )	2023-11-08 10:51:23 +01:00
Julian Weber	ce1a39a9a4	Add char limit warn (#3130 ) * Add char limit warning * Adding v2 langs * cached_property for cutlet * Fix import	2023-11-08 10:24:23 +01:00
Eren Gölge	f846a9f300	Update to v0.20.1	2023-11-07 14:17:36 +01:00
Edresson Casanova	cbdbc44e0f	Fix XTTS v2.0 training recipe (#3154 ) * Fix XTTS v2.0 training recipe * Update XTTS v2 model hash	2023-11-07 14:16:44 +01:00
Edresson Casanova	5f9ab6cfaa	Fix style Co-authored-by: Aarni Koskela <akx@iki.fi>	2023-11-06 19:22:34 -03:00
Edresson Casanova	2470599d18	Drop XTTS v1	2023-11-06 19:12:04 -03:00
Edresson Casanova	13243df526	Update XTTS v1.1 files	2023-11-06 19:10:21 -03:00
Edresson Casanova	09fb317e6d	Remove unused code	2023-11-06 17:36:32 -03:00
Edresson Casanova	b146de4ce8	Bug fix on XTTS v2.0 Trainer	2023-11-06 20:26:01 +01:00
Edresson Casanova	1b6f8d0e46	Update unit tests and recipes	2023-11-06 20:25:06 +01:00
Edresson Casanova	72b2bac0f8	Load reference in 24khz to avoid issued with multiple sr references	2023-11-06 20:25:06 +01:00
Edresson Casanova	00294ffdf6	Update XTTS docs	2023-11-06 20:24:06 +01:00
Edresson Casanova	459ad70dc8	Add support for multiples speaker references on XTTS inference	2023-11-06 20:22:35 +01:00
Eren Gölge	f0cb19ecca	Drop diffusion from XTTS (#3150 ) * Drop diffusion for XTTS * Make style * Drop diffusion deps in code * Restore thrashed	2023-11-06 20:15:49 +01:00
Eren G??lge	5d418bb84a	Update docs	2023-11-06 18:48:41 +01:00
Eren G??lge	9bbf6eb8dd	Drop use_ne_hifigan	2023-11-06 18:43:38 +01:00
Eren G??lge	9d54bd7655	Fixup XTTS	2023-11-06 18:13:58 +01:00
Eren Gölge	c713a839da	Update VERSION	2023-11-06 15:51:56 +01:00
Edresson Casanova	e45227d9ff	XTTS v2.0 (#3137 ) * Implement most similar ref training approach * Use non-enhanced hifigan for test samples * Add Perceiver * Update GPT Trainer for perceiver support * Update XTTS docs * Bug fix masking with XTTS perceiver * Bug fix on gpt forward * Bug Fix on XTTS v2.0 training * Add XTTS v2.0 unit tests * Add XTTS v2.0 inference unit tests * Bug Fix on diffusion inference * Add XTTS v2.0 training recipe * Placeholder model entry * Add cloning params to config * Make prompt embedding configurable * Make cloning configurable * Cheap fix for a cheaper fix * Prevent resampling * Update model entry * Update docs * Update requirements * Code linting * Add xtts v2 to sep tests * Bug fix on XTTS get_gpt_cond_latents * Bug fix on rebase * Make style * Bug fix in Japenese tokenizer * Add num2words to deps * Remove unused kwarg and added num_beams=1 as default --------- Co-authored-by: Eren G??lge <egolge@coqui.ai>	2023-11-06 14:58:18 +01:00
Aarni Koskela	38f6f8f0bb	Run `make style` & re-enable it in CI (#3127 )	2023-11-06 11:36:37 +01:00
Aarni Koskela	5ae369d629	Move FreeVCConfig to TTS.vc.configs (like all other config classes)	2023-10-31 16:56:25 +02:00
Eren Gölge	6fef4f9067	Bump up to v0.19.1	2023-10-30 10:37:28 +01:00

1 2 3 4 5 ...

1918 Commits