Enno Hermann
aa0fbdf27e
fix(bin.extract_tts_spectrograms): set quantization bits
2023-11-15 16:50:59 +01:00
Enno Hermann
13e640f17e
refactor(audio.processor): use load_wav from numpy_transforms
2023-11-15 16:50:59 +01:00
Enno Hermann
9a43eafd60
refactor(audio.processor): use volume_norm from numpy_transforms
2023-11-15 16:33:13 +01:00
Enno Hermann
0a0e7a3bae
refactor(audio.processor): use trim_silence from numpy_transforms
2023-11-15 16:33:13 +01:00
Enno Hermann
842a632cd5
refactor(audio.processor): use find_endpoint from numpy_transforms
2023-11-15 16:33:13 +01:00
Enno Hermann
5232bf9e36
chore(audio.processor): remove duplicate assert
...
Already checked in numpy_transforms.compute_f0
2023-11-15 16:33:13 +01:00
Enno Hermann
b620092865
refactor(audio.processor): use rms_volume_norm from numpy_transforms
2023-11-15 16:33:13 +01:00
Enno Hermann
11e98d3dac
refactor(audio.processor): use pre-/deemphasis from numpy_transforms
2023-11-15 16:33:13 +01:00
Enno Hermann
f37cc4c028
refactor(audio.processor): remove duplicate stft_parameters
2023-11-15 16:33:13 +01:00
Enno Hermann
da229f3912
refactor(audio.processor): remove duplicate build_mel_basis
2023-11-15 16:33:13 +01:00
Enno Hermann
754877784b
refactor(audio.processor): remove duplicate mel_to_linear
2023-11-15 16:33:08 +01:00
Enno Hermann
fd9d6d4b0f
refactor(audio.processor): remove duplicate linear_to_mel
2023-11-15 16:32:13 +01:00
Enno Hermann
4fd5c46937
refactor(audio.processor): remove duplicate amp_to_db
2023-11-15 16:32:13 +01:00
Enno Hermann
794f41c611
refactor(audio.processor): remove duplicate db_to_amp
2023-11-15 16:32:13 +01:00
Enno Hermann
5a5da76260
chore(audio.processor): remove unused compute_stft_paddings
...
Same function available in numpy_transforms
2023-11-15 16:32:13 +01:00
Enno Hermann
d75879802a
refactor(audio.processor): remove duplicate stft+griffin_lim
2023-11-15 16:32:06 +01:00
Enno Hermann
8fa4de1c8c
chore: remove unused argument
2023-11-13 21:43:06 +01:00
Eren Gölge
6f1cba2f81
Update to v0.20.3
2023-11-09 17:41:37 +01:00
Enno Hermann
3b1e7038bc
fix(formatters): set missing root_path attribute ( #3182 )
...
Fixes #2778
2023-11-09 16:49:52 +01:00
Aarni Koskela
a8e9163fb3
xtts/tokenizer: merge duplicate implementations of preprocess_text ( #3170 )
...
This was found via ruff:
> F811 Redefinition of unused `preprocess_text` from line 570
2023-11-09 16:32:12 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Gorkem
66a1e248d0
torchaudio should use proper backend to load audio ( #3179 )
2023-11-09 16:28:39 +01:00
Eren Gölge
46d9c27212
Update to v0.20.2
2023-11-08 16:07:56 +01:00
Julian Weber
03ad90135b
Add lang code in XTTS doc ( #3158 )
...
* Add lang code in XTTS doc
* Remove ununsed config and args
* update docs
* woops
2023-11-08 13:47:33 +01:00
Gorkem
78a596618a
Fix for exception on streaming if last chunk empty ( #3160 )
2023-11-08 11:32:02 +01:00
Enno Hermann
99edd6daa3
Fix ModelManager.list_models() ( #3128 )
...
* fix(utils.manage): remove hard-coded model_type variable
* refactor(utils.manage): address lint issues, fix typos
Addressed the following:
TTS/utils/manage.py:307:12: R1705: Unnecessary "else" after "return" (no-else-return)
TTS/utils/manage.py:308:21: W1514: Using open without explicitly specifying an encoding (unspecified-encoding)
TTS/utils/manage.py:299:4: R1710: Either all return statements in a function should return an expression, or none of them should. (inconsistent-return-statements)
TTS/utils/manage.py:299:4: R0201: Method could be a function (no-self-use)
TTS/utils/manage.py:314:4: R0201: Method could be a function (no-self-use)
2023-11-08 11:29:01 +01:00
Eren Gölge
77b18126c7
Merge pull request #3126 from akx/freevc-config-module
...
Move FreeVCConfig to TTS.vc.configs (like all other config classes)
2023-11-08 11:24:47 +01:00
Eren Gölge
cc6e9fcaa7
Fix #3153 ( #3169 )
2023-11-08 11:13:58 +01:00
Eren Gölge
a24ebcd8a6
Fix coqui api ( #3168 )
2023-11-08 10:51:23 +01:00
Julian Weber
ce1a39a9a4
Add char limit warn ( #3130 )
...
* Add char limit warning
* Adding v2 langs
* cached_property for cutlet
* Fix import
2023-11-08 10:24:23 +01:00
Eren Gölge
f846a9f300
Update to v0.20.1
2023-11-07 14:17:36 +01:00
Edresson Casanova
cbdbc44e0f
Fix XTTS v2.0 training recipe ( #3154 )
...
* Fix XTTS v2.0 training recipe
* Update XTTS v2 model hash
2023-11-07 14:16:44 +01:00
Edresson Casanova
5f9ab6cfaa
Fix style
...
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-11-06 19:22:34 -03:00
Edresson Casanova
2470599d18
Drop XTTS v1
2023-11-06 19:12:04 -03:00
Edresson Casanova
13243df526
Update XTTS v1.1 files
2023-11-06 19:10:21 -03:00
Edresson Casanova
09fb317e6d
Remove unused code
2023-11-06 17:36:32 -03:00
Edresson Casanova
b146de4ce8
Bug fix on XTTS v2.0 Trainer
2023-11-06 20:26:01 +01:00
Edresson Casanova
1b6f8d0e46
Update unit tests and recipes
2023-11-06 20:25:06 +01:00
Edresson Casanova
72b2bac0f8
Load reference in 24khz to avoid issued with multiple sr references
2023-11-06 20:25:06 +01:00
Edresson Casanova
00294ffdf6
Update XTTS docs
2023-11-06 20:24:06 +01:00
Edresson Casanova
459ad70dc8
Add support for multiples speaker references on XTTS inference
2023-11-06 20:22:35 +01:00
Eren Gölge
f0cb19ecca
Drop diffusion from XTTS ( #3150 )
...
* Drop diffusion for XTTS
* Make style
* Drop diffusion deps in code
* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge
5d418bb84a
Update docs
2023-11-06 18:48:41 +01:00
Eren G??lge
9bbf6eb8dd
Drop use_ne_hifigan
2023-11-06 18:43:38 +01:00
Eren G??lge
9d54bd7655
Fixup XTTS
2023-11-06 18:13:58 +01:00
Eren Gölge
c713a839da
Update VERSION
2023-11-06 15:51:56 +01:00
Edresson Casanova
e45227d9ff
XTTS v2.0 ( #3137 )
...
* Implement most similar ref training approach
* Use non-enhanced hifigan for test samples
* Add Perceiver
* Update GPT Trainer for perceiver support
* Update XTTS docs
* Bug fix masking with XTTS perceiver
* Bug fix on gpt forward
* Bug Fix on XTTS v2.0 training
* Add XTTS v2.0 unit tests
* Add XTTS v2.0 inference unit tests
* Bug Fix on diffusion inference
* Add XTTS v2.0 training recipe
* Placeholder model entry
* Add cloning params to config
* Make prompt embedding configurable
* Make cloning configurable
* Cheap fix for a cheaper fix
* Prevent resampling
* Update model entry
* Update docs
* Update requirements
* Code linting
* Add xtts v2 to sep tests
* Bug fix on XTTS get_gpt_cond_latents
* Bug fix on rebase
* Make style
* Bug fix in Japenese tokenizer
* Add num2words to deps
* Remove unused kwarg and added num_beams=1 as default
---------
Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
Aarni Koskela
5ae369d629
Move FreeVCConfig to TTS.vc.configs (like all other config classes)
2023-10-31 16:56:25 +02:00
Eren Gölge
6fef4f9067
Bump up to v0.19.1
2023-10-30 10:37:28 +01:00