coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	e9a2c0606a	Add gc.collect()	2023-12-01 15:37:09 -03:00
Edresson Casanova	490af290d3	Delete unused variables	2023-12-01 15:21:33 -03:00
Edresson Casanova	eb18b27afc	Delete trainer to freeze memory	2023-12-01 14:07:33 -03:00
Edresson Casanova	5dd217a759	Update XTTS finetuner docs	2023-12-01 09:47:09 -03:00
Edresson Casanova	1a60767d83	Add max_audio_length parameter	2023-11-27 12:10:43 -03:00
Edresson Casanova	ceb8b05abe	Update	2023-11-27 11:16:41 -03:00
Edresson Casanova	e6c51e3666	Add intuitive error messages	2023-11-27 10:53:43 -03:00
Edresson Casanova	c5cb7eb791	Add erros messages	2023-11-27 10:41:09 -03:00
Edresson Casanova	eaa5355c91	Add parameters to be able to set then on colab demo	2023-11-27 10:01:48 -03:00
Edresson Casanova	335b8c37b3	Update gradio demo	2023-11-24 16:31:14 -03:00
Edresson Casanova	70f2cb9c0e	Update gradio demo	2023-11-24 15:53:34 -03:00
Edresson Casanova	c76fb856d1	Update gradio demo	2023-11-24 15:40:35 -03:00
Edresson Casanova	8967fc7ef2	Update gradio demo	2023-11-24 14:26:26 -03:00
Edresson Casanova	af74cd4426	Bug fix on XTTS inference	2023-11-24 12:07:00 -03:00
Edresson Casanova	3fc2880127	Convert stereo to mono	2023-11-24 10:25:24 -03:00
Edresson Casanova	fa9bb26ebb	Update demo	2023-11-24 10:22:12 -03:00
Edresson Casanova	626d9e16fb	Fix demo freezing issue	2023-11-24 08:44:21 -03:00
Edresson Casanova	7cc348ed76	Uses tabs instead of columns	2023-11-23 17:50:41 -03:00
Edresson Casanova	cc4f37e1b0	Add training and inference columns	2023-11-23 16:30:49 -03:00
Edresson Casanova	774c4c1743	Add XTTS FT demo data processing pipeline	2023-11-22 18:11:52 -03:00
Eren Gölge	c011ab7455	Update to v0.20.6	2023-11-17 15:16:32 +01:00
Eren G??lge	52cb1e2f68	Update model hash for v2.0.2	2023-11-17 15:16:32 +01:00
Edresson Casanova	6075fa208c	Ensures that only GPT model is in training mode during XTTS GPT training (#3241 ) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test	2023-11-17 15:15:22 +01:00
Eren G??lge	a3279f9294	Make style	2023-11-17 15:15:22 +01:00
Eren G??lge	f21067a84a	Make k_diffusion optional	2023-11-17 15:15:21 +01:00
Julian Weber	fbc18b8c34	Fix zh bug (#3238 )	2023-11-16 17:51:37 +01:00
Julian Weber	675f983550	Add sentence splitting (#3227 ) * Add sentence spliting * update requirements * update default args v2 * Add spanish * Fix return gpt_latents * Update requirements * Fix requirements	2023-11-16 11:01:11 +01:00
Enno Hermann	3c2d5a9e03	Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb (#3230 ) * chore: remove unused argument * refactor(audio.processor): remove duplicate stft+griffin_lim * chore(audio.processor): remove unused compute_stft_paddings Same function available in numpy_transforms * refactor(audio.processor): remove duplicate db_to_amp * refactor(audio.processor): remove duplicate amp_to_db * refactor(audio.processor): remove duplicate linear_to_mel * refactor(audio.processor): remove duplicate mel_to_linear * refactor(audio.processor): remove duplicate build_mel_basis * refactor(audio.processor): remove duplicate stft_parameters * refactor(audio.processor): use pre-/deemphasis from numpy_transforms * refactor(audio.processor): use rms_volume_norm from numpy_transforms * chore(audio.processor): remove duplicate assert Already checked in numpy_transforms.compute_f0 * refactor(audio.processor): use find_endpoint from numpy_transforms * refactor(audio.processor): use trim_silence from numpy_transforms * refactor(audio.processor): use volume_norm from numpy_transforms * refactor(audio.processor): use load_wav from numpy_transforms * fix(bin.extract_tts_spectrograms): set quantization bits * fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code Fixes #2447, #2574 * refactor(audio.processor): remove duplicate quantization methods	2023-11-16 10:57:06 +01:00
Eren Gölge	88630c60e5	Update to v0.20.5	2023-11-15 14:02:51 +01:00
Edresson Casanova	73a5bd08c0	Fix XTTS GPT padding and inference issues (#3216 ) * Fix end artifact for fine tuning models * Bug fix on zh-cn inference * Remove ununsed code	2023-11-15 14:02:05 +01:00
Julian Weber	04901fb2e4	Add speed control for inference (#3214 ) * Add speed control for inference * Fix XTTS tests * Add speed control tests	2023-11-14 16:07:17 +01:00
Eren Gölge	d96f3885d5	Update to v0.20.4	2023-11-13 17:07:25 +01:00
Eren Gölge	ac3df409a6	Merge pull request #3208 from coqui-ai/fix_max_mel_len fix max generation length for XTTS	2023-11-13 14:32:56 +01:00
Eren G??lge	92fa988aec	Fixup	2023-11-13 13:44:06 +01:00
WeberJulian	b85536b23f	fix max generation length	2023-11-13 13:18:45 +01:00
Eren G??lge	b2682d39c5	Make style	2023-11-13 13:01:01 +01:00
Eren G??lge	a16360af85	Implement chunking gpt_cond	2023-11-13 13:00:08 +01:00
Eren Gölge	6f1cba2f81	Update to v0.20.3	2023-11-09 17:41:37 +01:00
Enno Hermann	3b1e7038bc	fix(formatters): set missing root_path attribute (#3182 ) Fixes #2778	2023-11-09 16:49:52 +01:00
Aarni Koskela	a8e9163fb3	xtts/tokenizer: merge duplicate implementations of preprocess_text (#3170 ) This was found via ruff: > F811 Redefinition of unused `preprocess_text` from line 570	2023-11-09 16:32:12 +01:00
Matthew Boakes	1b9c400bca	PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176 ) * Replaced PyTorch weight_norm With parametrizations.weight_norm * TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism * Corrected Code Style --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-11-09 16:31:03 +01:00
Gorkem	66a1e248d0	torchaudio should use proper backend to load audio (#3179 )	2023-11-09 16:28:39 +01:00
Eren Gölge	46d9c27212	Update to v0.20.2	2023-11-08 16:07:56 +01:00
Julian Weber	03ad90135b	Add lang code in XTTS doc (#3158 ) * Add lang code in XTTS doc * Remove ununsed config and args * update docs * woops	2023-11-08 13:47:33 +01:00
Gorkem	78a596618a	Fix for exception on streaming if last chunk empty (#3160 )	2023-11-08 11:32:02 +01:00
Enno Hermann	99edd6daa3	Fix ModelManager.list_models() (#3128 ) * fix(utils.manage): remove hard-coded model_type variable * refactor(utils.manage): address lint issues, fix typos Addressed the following: TTS/utils/manage.py:307:12: R1705: Unnecessary "else" after "return" (no-else-return) TTS/utils/manage.py:308:21: W1514: Using open without explicitly specifying an encoding (unspecified-encoding) TTS/utils/manage.py:299:4: R1710: Either all return statements in a function should return an expression, or none of them should. (inconsistent-return-statements) TTS/utils/manage.py:299:4: R0201: Method could be a function (no-self-use) TTS/utils/manage.py:314:4: R0201: Method could be a function (no-self-use)	2023-11-08 11:29:01 +01:00
Eren Gölge	77b18126c7	Merge pull request #3126 from akx/freevc-config-module Move FreeVCConfig to TTS.vc.configs (like all other config classes)	2023-11-08 11:24:47 +01:00
Eren Gölge	cc6e9fcaa7	Fix #3153 (#3169 )	2023-11-08 11:13:58 +01:00
Eren Gölge	a24ebcd8a6	Fix coqui api (#3168 )	2023-11-08 10:51:23 +01:00
Julian Weber	ce1a39a9a4	Add char limit warn (#3130 ) * Add char limit warning * Adding v2 langs * cached_property for cutlet * Fix import	2023-11-08 10:24:23 +01:00

1 2 3 4 5 ...

1938 Commits