Edresson Casanova
e9a2c0606a
Add gc.collect()
2023-12-01 15:37:09 -03:00
Edresson Casanova
490af290d3
Delete unused variables
2023-12-01 15:21:33 -03:00
Edresson Casanova
eb18b27afc
Delete trainer to freeze memory
2023-12-01 14:07:33 -03:00
Edresson Casanova
5dd217a759
Update XTTS finetuner docs
2023-12-01 09:47:09 -03:00
Edresson Casanova
1a60767d83
Add max_audio_length parameter
2023-11-27 12:10:43 -03:00
Edresson Casanova
ceb8b05abe
Update
2023-11-27 11:16:41 -03:00
Edresson Casanova
e6c51e3666
Add intuitive error messages
2023-11-27 10:53:43 -03:00
Edresson Casanova
c5cb7eb791
Add erros messages
2023-11-27 10:41:09 -03:00
Edresson Casanova
eaa5355c91
Add parameters to be able to set then on colab demo
2023-11-27 10:01:48 -03:00
Edresson Casanova
335b8c37b3
Update gradio demo
2023-11-24 16:31:14 -03:00
Edresson Casanova
70f2cb9c0e
Update gradio demo
2023-11-24 15:53:34 -03:00
Edresson Casanova
c76fb856d1
Update gradio demo
2023-11-24 15:40:35 -03:00
Edresson Casanova
8967fc7ef2
Update gradio demo
2023-11-24 14:26:26 -03:00
Edresson Casanova
af74cd4426
Bug fix on XTTS inference
2023-11-24 12:07:00 -03:00
Edresson Casanova
3fc2880127
Convert stereo to mono
2023-11-24 10:25:24 -03:00
Edresson Casanova
fa9bb26ebb
Update demo
2023-11-24 10:22:12 -03:00
Edresson Casanova
626d9e16fb
Fix demo freezing issue
2023-11-24 08:44:21 -03:00
Edresson Casanova
7cc348ed76
Uses tabs instead of columns
2023-11-23 17:50:41 -03:00
Edresson Casanova
cc4f37e1b0
Add training and inference columns
2023-11-23 16:30:49 -03:00
Edresson Casanova
774c4c1743
Add XTTS FT demo data processing pipeline
2023-11-22 18:11:52 -03:00
Eren Gölge
c011ab7455
Update to v0.20.6
2023-11-17 15:16:32 +01:00
Eren G??lge
52cb1e2f68
Update model hash for v2.0.2
2023-11-17 15:16:32 +01:00
Edresson Casanova
6075fa208c
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:15:22 +01:00
Eren G??lge
a3279f9294
Make style
2023-11-17 15:15:22 +01:00
Eren G??lge
f21067a84a
Make k_diffusion optional
2023-11-17 15:15:21 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Enno Hermann
3c2d5a9e03
Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb ( #3230 )
...
* chore: remove unused argument
* refactor(audio.processor): remove duplicate stft+griffin_lim
* chore(audio.processor): remove unused compute_stft_paddings
Same function available in numpy_transforms
* refactor(audio.processor): remove duplicate db_to_amp
* refactor(audio.processor): remove duplicate amp_to_db
* refactor(audio.processor): remove duplicate linear_to_mel
* refactor(audio.processor): remove duplicate mel_to_linear
* refactor(audio.processor): remove duplicate build_mel_basis
* refactor(audio.processor): remove duplicate stft_parameters
* refactor(audio.processor): use pre-/deemphasis from numpy_transforms
* refactor(audio.processor): use rms_volume_norm from numpy_transforms
* chore(audio.processor): remove duplicate assert
Already checked in numpy_transforms.compute_f0
* refactor(audio.processor): use find_endpoint from numpy_transforms
* refactor(audio.processor): use trim_silence from numpy_transforms
* refactor(audio.processor): use volume_norm from numpy_transforms
* refactor(audio.processor): use load_wav from numpy_transforms
* fix(bin.extract_tts_spectrograms): set quantization bits
* fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code
Fixes #2447 , #2574
* refactor(audio.processor): remove duplicate quantization methods
2023-11-16 10:57:06 +01:00
Eren Gölge
88630c60e5
Update to v0.20.5
2023-11-15 14:02:51 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Julian Weber
04901fb2e4
Add speed control for inference ( #3214 )
...
* Add speed control for inference
* Fix XTTS tests
* Add speed control tests
2023-11-14 16:07:17 +01:00
Eren Gölge
d96f3885d5
Update to v0.20.4
2023-11-13 17:07:25 +01:00
Eren Gölge
ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
...
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
Eren G??lge
92fa988aec
Fixup
2023-11-13 13:44:06 +01:00
WeberJulian
b85536b23f
fix max generation length
2023-11-13 13:18:45 +01:00
Eren G??lge
b2682d39c5
Make style
2023-11-13 13:01:01 +01:00
Eren G??lge
a16360af85
Implement chunking gpt_cond
2023-11-13 13:00:08 +01:00
Eren Gölge
6f1cba2f81
Update to v0.20.3
2023-11-09 17:41:37 +01:00
Enno Hermann
3b1e7038bc
fix(formatters): set missing root_path attribute ( #3182 )
...
Fixes #2778
2023-11-09 16:49:52 +01:00
Aarni Koskela
a8e9163fb3
xtts/tokenizer: merge duplicate implementations of preprocess_text ( #3170 )
...
This was found via ruff:
> F811 Redefinition of unused `preprocess_text` from line 570
2023-11-09 16:32:12 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Gorkem
66a1e248d0
torchaudio should use proper backend to load audio ( #3179 )
2023-11-09 16:28:39 +01:00
Eren Gölge
46d9c27212
Update to v0.20.2
2023-11-08 16:07:56 +01:00
Julian Weber
03ad90135b
Add lang code in XTTS doc ( #3158 )
...
* Add lang code in XTTS doc
* Remove ununsed config and args
* update docs
* woops
2023-11-08 13:47:33 +01:00
Gorkem
78a596618a
Fix for exception on streaming if last chunk empty ( #3160 )
2023-11-08 11:32:02 +01:00
Enno Hermann
99edd6daa3
Fix ModelManager.list_models() ( #3128 )
...
* fix(utils.manage): remove hard-coded model_type variable
* refactor(utils.manage): address lint issues, fix typos
Addressed the following:
TTS/utils/manage.py:307:12: R1705: Unnecessary "else" after "return" (no-else-return)
TTS/utils/manage.py:308:21: W1514: Using open without explicitly specifying an encoding (unspecified-encoding)
TTS/utils/manage.py:299:4: R1710: Either all return statements in a function should return an expression, or none of them should. (inconsistent-return-statements)
TTS/utils/manage.py:299:4: R0201: Method could be a function (no-self-use)
TTS/utils/manage.py:314:4: R0201: Method could be a function (no-self-use)
2023-11-08 11:29:01 +01:00
Eren Gölge
77b18126c7
Merge pull request #3126 from akx/freevc-config-module
...
Move FreeVCConfig to TTS.vc.configs (like all other config classes)
2023-11-08 11:24:47 +01:00
Eren Gölge
cc6e9fcaa7
Fix #3153 ( #3169 )
2023-11-08 11:13:58 +01:00
Eren Gölge
a24ebcd8a6
Fix coqui api ( #3168 )
2023-11-08 10:51:23 +01:00
Julian Weber
ce1a39a9a4
Add char limit warn ( #3130 )
...
* Add char limit warning
* Adding v2 langs
* cached_property for cutlet
* Fix import
2023-11-08 10:24:23 +01:00