Commit Graph

4573 Commits

Author SHA1 Message Date
Enno Hermann 8f1db7510a refactor(audio.processor): remove duplicate quantization methods 2023-11-15 19:26:35 +01:00
Enno Hermann ddbaecdb5b fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code
Fixes #2447, #2574
2023-11-15 16:50:59 +01:00
Enno Hermann aa0fbdf27e fix(bin.extract_tts_spectrograms): set quantization bits 2023-11-15 16:50:59 +01:00
Enno Hermann 13e640f17e refactor(audio.processor): use load_wav from numpy_transforms 2023-11-15 16:50:59 +01:00
Enno Hermann 9a43eafd60 refactor(audio.processor): use volume_norm from numpy_transforms 2023-11-15 16:33:13 +01:00
Enno Hermann 0a0e7a3bae refactor(audio.processor): use trim_silence from numpy_transforms 2023-11-15 16:33:13 +01:00
Enno Hermann 842a632cd5 refactor(audio.processor): use find_endpoint from numpy_transforms 2023-11-15 16:33:13 +01:00
Enno Hermann 5232bf9e36 chore(audio.processor): remove duplicate assert
Already checked in numpy_transforms.compute_f0
2023-11-15 16:33:13 +01:00
Enno Hermann b620092865 refactor(audio.processor): use rms_volume_norm from numpy_transforms 2023-11-15 16:33:13 +01:00
Enno Hermann 11e98d3dac refactor(audio.processor): use pre-/deemphasis from numpy_transforms 2023-11-15 16:33:13 +01:00
Enno Hermann f37cc4c028 refactor(audio.processor): remove duplicate stft_parameters 2023-11-15 16:33:13 +01:00
Enno Hermann da229f3912 refactor(audio.processor): remove duplicate build_mel_basis 2023-11-15 16:33:13 +01:00
Enno Hermann 754877784b refactor(audio.processor): remove duplicate mel_to_linear 2023-11-15 16:33:08 +01:00
Enno Hermann fd9d6d4b0f refactor(audio.processor): remove duplicate linear_to_mel 2023-11-15 16:32:13 +01:00
Enno Hermann 4fd5c46937 refactor(audio.processor): remove duplicate amp_to_db 2023-11-15 16:32:13 +01:00
Enno Hermann 794f41c611 refactor(audio.processor): remove duplicate db_to_amp 2023-11-15 16:32:13 +01:00
Enno Hermann 5a5da76260 chore(audio.processor): remove unused compute_stft_paddings
Same function available in numpy_transforms
2023-11-15 16:32:13 +01:00
Enno Hermann d75879802a refactor(audio.processor): remove duplicate stft+griffin_lim 2023-11-15 16:32:06 +01:00
Enno Hermann 8fa4de1c8c chore: remove unused argument 2023-11-13 21:43:06 +01:00
Eren Gölge 6f1cba2f81
Update to v0.20.3 2023-11-09 17:41:37 +01:00
Enno Hermann 3b1e7038bc
fix(formatters): set missing root_path attribute (#3182)
Fixes #2778
2023-11-09 16:49:52 +01:00
Aarni Koskela a8e9163fb3
xtts/tokenizer: merge duplicate implementations of preprocess_text (#3170)
This was found via ruff:

> F811 Redefinition of unused `preprocess_text` from line 570
2023-11-09 16:32:12 +01:00
Matthew Boakes 1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176)
* Replaced PyTorch weight_norm With parametrizations.weight_norm

* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism

* Corrected Code Style

---------

Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Gorkem 66a1e248d0
torchaudio should use proper backend to load audio (#3179) 2023-11-09 16:28:39 +01:00
Eren Gölge 46d9c27212
Update to v0.20.2 2023-11-08 16:07:56 +01:00
Julian Weber 58cb0d8dd0
Remove v1 doc and tests (#3172)
* remove v1 in inference.md

* remove v1 in README.md

* Update test_models.py
2023-11-08 14:51:42 +01:00
Julian Weber 03ad90135b
Add lang code in XTTS doc (#3158)
* Add lang code in XTTS doc

* Remove ununsed config and args

* update docs

* woops
2023-11-08 13:47:33 +01:00
Gorkem 78a596618a
Fix for exception on streaming if last chunk empty (#3160) 2023-11-08 11:32:02 +01:00
Enno Hermann 99edd6daa3
Fix ModelManager.list_models() (#3128)
* fix(utils.manage): remove hard-coded model_type variable

* refactor(utils.manage): address lint issues, fix typos

Addressed the following:
TTS/utils/manage.py:307:12: R1705: Unnecessary "else" after "return" (no-else-return)
TTS/utils/manage.py:308:21: W1514: Using open without explicitly specifying an encoding (unspecified-encoding)
TTS/utils/manage.py:299:4: R1710: Either all return statements in a function should return an expression, or none of them should. (inconsistent-return-statements)
TTS/utils/manage.py:299:4: R0201: Method could be a function (no-self-use)
TTS/utils/manage.py:314:4: R0201: Method could be a function (no-self-use)
2023-11-08 11:29:01 +01:00
Eren Gölge 77b18126c7
Merge pull request #3126 from akx/freevc-config-module
Move FreeVCConfig to TTS.vc.configs (like all other config classes)
2023-11-08 11:24:47 +01:00
Eren Gölge cc6e9fcaa7
Fix #3153 (#3169) 2023-11-08 11:13:58 +01:00
Eren Gölge a24ebcd8a6
Fix coqui api (#3168) 2023-11-08 10:51:23 +01:00
Julian Weber ce1a39a9a4
Add char limit warn (#3130)
* Add char limit warning

* Adding v2 langs

* cached_property for cutlet

* Fix import
2023-11-08 10:24:23 +01:00
Eren Gölge f846a9f300
Update to v0.20.1 2023-11-07 14:17:36 +01:00
Edresson Casanova cbdbc44e0f
Fix XTTS v2.0 training recipe (#3154)
* Fix XTTS v2.0 training recipe

* Update XTTS v2 model hash
2023-11-07 14:16:44 +01:00
Eren Gölge 5e992d8704
Merge pull request #3149 from coqui-ai/fixup_xtts_v2
Bug fixes and add support for multiples speaker references on XTTS inference
2023-11-07 10:36:20 +01:00
Edresson Casanova 5f9ab6cfaa
Fix style
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-11-06 19:22:34 -03:00
Edresson Casanova 905900afc9 Update XTTS v1.1 recipe 2023-11-06 19:14:50 -03:00
Edresson Casanova 2470599d18 Drop XTTS v1 2023-11-06 19:12:04 -03:00
Edresson Casanova 13243df526 Update XTTS v1.1 files 2023-11-06 19:10:21 -03:00
Edresson Casanova cabff9f323 Update XTTS v2.0 recipe 2023-11-06 17:47:14 -03:00
Edresson Casanova 09fb317e6d Remove unused code 2023-11-06 17:36:32 -03:00
Edresson Casanova b146de4ce8 Bug fix on XTTS v2.0 Trainer 2023-11-06 20:26:01 +01:00
Edresson Casanova f444f296f2 Add multiples references on xtts inference tests 2023-11-06 20:25:06 +01:00
Edresson Casanova 1b6f8d0e46 Update unit tests and recipes 2023-11-06 20:25:06 +01:00
Edresson Casanova 72b2bac0f8 Load reference in 24khz to avoid issued with multiple sr references 2023-11-06 20:25:06 +01:00
Edresson Casanova 00294ffdf6 Update XTTS docs 2023-11-06 20:24:06 +01:00
Edresson Casanova 459ad70dc8 Add support for multiples speaker references on XTTS inference 2023-11-06 20:22:35 +01:00
Edresson Casanova 9942000c50 Update XTTS v2 recipe model files 2023-11-06 20:20:28 +01:00
Eren Gölge f0cb19ecca
Drop diffusion from XTTS (#3150)
* Drop diffusion for XTTS

* Make style

* Drop diffusion deps in code

* Restore thrashed
2023-11-06 20:15:49 +01:00