Edresson Casanova
c5cb7eb791
Add erros messages
2023-11-27 10:41:09 -03:00
Edresson Casanova
eaa5355c91
Add parameters to be able to set then on colab demo
2023-11-27 10:01:48 -03:00
Edresson Casanova
335b8c37b3
Update gradio demo
2023-11-24 16:31:14 -03:00
Edresson Casanova
70f2cb9c0e
Update gradio demo
2023-11-24 15:53:34 -03:00
Edresson Casanova
c76fb856d1
Update gradio demo
2023-11-24 15:40:35 -03:00
Edresson Casanova
8967fc7ef2
Update gradio demo
2023-11-24 14:26:26 -03:00
Edresson Casanova
af74cd4426
Bug fix on XTTS inference
2023-11-24 12:07:00 -03:00
Edresson Casanova
3fc2880127
Convert stereo to mono
2023-11-24 10:25:24 -03:00
Edresson Casanova
fa9bb26ebb
Update demo
2023-11-24 10:22:12 -03:00
Edresson Casanova
626d9e16fb
Fix demo freezing issue
2023-11-24 08:44:21 -03:00
Edresson Casanova
7cc348ed76
Uses tabs instead of columns
2023-11-23 17:50:41 -03:00
Edresson Casanova
cc4f37e1b0
Add training and inference columns
2023-11-23 16:30:49 -03:00
Edresson Casanova
774c4c1743
Add XTTS FT demo data processing pipeline
2023-11-22 18:11:52 -03:00
Eren Gölge
29dede20d3
Merge pull request #3249 from coqui-ai/run_ci_for_v0.20.6
...
Run CI for v0.20.6
2023-11-17 15:45:26 +01:00
Eren Gölge
c011ab7455
Update to v0.20.6
2023-11-17 15:16:32 +01:00
Eren G??lge
52cb1e2f68
Update model hash for v2.0.2
2023-11-17 15:16:32 +01:00
Edresson Casanova
6075fa208c
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:15:22 +01:00
Eren G??lge
a3279f9294
Make style
2023-11-17 15:15:22 +01:00
Eren G??lge
f21067a84a
Make k_diffusion optional
2023-11-17 15:15:21 +01:00
Eren G??lge
44494daa27
Update CI version
2023-11-17 15:15:21 +01:00
Eren G??lge
c864acf2b7
Update versions
2023-11-17 15:15:21 +01:00
Edresson Casanova
11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren Gölge
14579a4607
Merge pull request #3248 from coqui-ai/slacker_deps
...
Update versions
2023-11-17 15:13:19 +01:00
Eren G??lge
44880f09ed
Make style
2023-11-17 13:43:34 +01:00
Eren G??lge
26efdf6ee7
Make k_diffusion optional
2023-11-17 13:42:33 +01:00
Eren G??lge
08d11e9198
Update CI version
2023-11-17 13:01:32 +01:00
Eren G??lge
63d7145647
Update versions
2023-11-17 12:10:46 +01:00
Eren Gölge
7e4375da2b
Update to v0.20.6
2023-11-16 17:52:13 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Enno Hermann
3c2d5a9e03
Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb ( #3230 )
...
* chore: remove unused argument
* refactor(audio.processor): remove duplicate stft+griffin_lim
* chore(audio.processor): remove unused compute_stft_paddings
Same function available in numpy_transforms
* refactor(audio.processor): remove duplicate db_to_amp
* refactor(audio.processor): remove duplicate amp_to_db
* refactor(audio.processor): remove duplicate linear_to_mel
* refactor(audio.processor): remove duplicate mel_to_linear
* refactor(audio.processor): remove duplicate build_mel_basis
* refactor(audio.processor): remove duplicate stft_parameters
* refactor(audio.processor): use pre-/deemphasis from numpy_transforms
* refactor(audio.processor): use rms_volume_norm from numpy_transforms
* chore(audio.processor): remove duplicate assert
Already checked in numpy_transforms.compute_f0
* refactor(audio.processor): use find_endpoint from numpy_transforms
* refactor(audio.processor): use trim_silence from numpy_transforms
* refactor(audio.processor): use volume_norm from numpy_transforms
* refactor(audio.processor): use load_wav from numpy_transforms
* fix(bin.extract_tts_spectrograms): set quantization bits
* fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code
Fixes #2447 , #2574
* refactor(audio.processor): remove duplicate quantization methods
2023-11-16 10:57:06 +01:00
Eren Gölge
88630c60e5
Update to v0.20.5
2023-11-15 14:02:51 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Ikko Eltociear Ashimine
15f0ac57d6
Update README.md ( #3215 )
...
Dicord -> Discord
2023-11-15 13:59:56 +01:00
Julian Weber
04901fb2e4
Add speed control for inference ( #3214 )
...
* Add speed control for inference
* Fix XTTS tests
* Add speed control tests
2023-11-14 16:07:17 +01:00
Eren Gölge
d96f3885d5
Update to v0.20.4
2023-11-13 17:07:25 +01:00
Eren Gölge
ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
...
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
Eren Gölge
f32a465711
Merge pull request #3207 from coqui-ai/update_xtts_cloning
...
Update XTTS cloning
2023-11-13 14:32:43 +01:00
Eren G??lge
92fa988aec
Fixup
2023-11-13 13:44:06 +01:00
WeberJulian
b85536b23f
fix max generation length
2023-11-13 13:18:45 +01:00
Eren G??lge
b2682d39c5
Make style
2023-11-13 13:01:01 +01:00
Eren G??lge
a16360af85
Implement chunking gpt_cond
2023-11-13 13:00:08 +01:00
Eren Gölge
6f1cba2f81
Update to v0.20.3
2023-11-09 17:41:37 +01:00
Enno Hermann
3b1e7038bc
fix(formatters): set missing root_path attribute ( #3182 )
...
Fixes #2778
2023-11-09 16:49:52 +01:00
Aarni Koskela
a8e9163fb3
xtts/tokenizer: merge duplicate implementations of preprocess_text ( #3170 )
...
This was found via ruff:
> F811 Redefinition of unused `preprocess_text` from line 570
2023-11-09 16:32:12 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Gorkem
66a1e248d0
torchaudio should use proper backend to load audio ( #3179 )
2023-11-09 16:28:39 +01:00
Eren Gölge
46d9c27212
Update to v0.20.2
2023-11-08 16:07:56 +01:00
Julian Weber
58cb0d8dd0
Remove v1 doc and tests ( #3172 )
...
* remove v1 in inference.md
* remove v1 in README.md
* Update test_models.py
2023-11-08 14:51:42 +01:00
Julian Weber
03ad90135b
Add lang code in XTTS doc ( #3158 )
...
* Add lang code in XTTS doc
* Remove ununsed config and args
* update docs
* woops
2023-11-08 13:47:33 +01:00