Enno Hermann
c30fb0f56b
chore: remove duplicate init_weights
2024-06-26 11:46:37 +02:00
Enno Hermann
cd7b6daf46
fix: clarify types, fix missing functions
2024-06-26 10:17:04 +02:00
Enno Hermann
03de4b889e
docs: fix readthedocs links
...
[ci skip]
2024-06-13 22:48:34 +02:00
Enno Hermann
018f1e6453
docs(bark): update docstrings and type hints
2024-05-15 22:56:55 +02:00
Enno Hermann
59a6c9fdf2
fix(bark): add missing argument for load_voice()
...
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann
b711e19cb6
refactor: remove verbose arguments
...
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
d772724125
fix: update repository links, package names, metadata
2024-04-03 12:02:44 +02:00
Enno Hermann
e5c6da1c98
Merge pull request #20 from eginhard/return-complex
...
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann
e95f8950eb
fix: torch.stft will soon require return_complex=True
...
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)
https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann
89a061f1d1
docs(tts.models.vits): clarify use of discriminator/generator
...
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann
efdafd5a7f
style: run black
2024-03-07 11:46:51 +01:00
Enno Hermann
24298da5fc
Merge pull request #1 from eginhard/lint-overhaul
...
Lint overhaul (pylint to ruff)
2024-03-06 16:10:26 +01:00
Edresson Casanova
5dcc16d193
Bug fix in MP3 and FLAC compute length on TTSDataset ( #3092 )
...
* Bug Fix on XTTS load
* Bug fix in MP3 length on TTSDataset
* Update TTS/tts/datasets/dataset.py
Co-authored-by: Aarni Koskela <akx@iki.fi>
* Uses mutagen for all audio formats
* Add dataloader test wit hall supported audio formats
* Use mutagen.File
* Update
* Fix aux unit tests
* Bug fixe on unit tests
---------
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-12-27 13:23:43 -03:00
Aarni Koskela
d6ea806469
Run `make style`
2023-12-13 14:56:41 +02:00
Aarni Koskela
aa549e9028
Fix trailing whitespace
2023-12-13 14:56:41 +02:00
Aarni Koskela
64bb41f4fa
Ruff autofix C41
2023-12-13 14:56:41 +02:00
Aarni Koskela
449820ec7d
Ruff autofix E71*
2023-12-13 14:56:41 +02:00
Aarni Koskela
90991e89b4
Ruff autofix unused imports and import order
2023-12-13 14:56:41 +02:00
Eren Gölge
8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
...
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
WeberJulian
5cd750ac7e
Fix API and CI
2023-12-11 20:21:53 +01:00
WeberJulian
e3c9dab7a3
Make CLI work
2023-12-11 18:49:18 +01:00
WeberJulian
36143fee26
Add basic speaker manager
2023-12-11 15:25:46 +01:00
Frederico S. Oliveira
163f9a3fdf
Merge branch 'coqui-ai:dev' into dev
2023-12-11 10:04:07 -03:00
Edresson Casanova
5f900f156a
Add XTTS Fine tuning gradio demo ( #3296 )
...
* Add XTTS FT demo data processing pipeline
* Add training and inference columns
* Uses tabs instead of columns
* Fix demo freezing issue
* Update demo
* Convert stereo to mono
* Bug fix on XTTS inference
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Add parameters to be able to set then on colab demo
* Add erros messages
* Add intuitive error messages
* Update
* Add max_audio_length parameter
* Add XTTS fine-tuner docs
* Update XTTS finetuner docs
* Delete trainer to freeze memory
* Delete unused variables
* Add gc.collect()
* Update xtts.md
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Frederico S. Oliveira
bcd500fa7b
Fixing bug
...
Correction in training the Fastspeech/Fastspeech2/FastPitch/SpeedySpeech model using external speaker embedding.
2023-11-30 17:27:05 -03:00
Eren G??lge
44880f09ed
Make style
2023-11-17 13:43:34 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Julian Weber
04901fb2e4
Add speed control for inference ( #3214 )
...
* Add speed control for inference
* Fix XTTS tests
* Add speed control tests
2023-11-14 16:07:17 +01:00
Eren G??lge
92fa988aec
Fixup
2023-11-13 13:44:06 +01:00
Eren G??lge
a16360af85
Implement chunking gpt_cond
2023-11-13 13:00:08 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Gorkem
66a1e248d0
torchaudio should use proper backend to load audio ( #3179 )
2023-11-09 16:28:39 +01:00
Julian Weber
03ad90135b
Add lang code in XTTS doc ( #3158 )
...
* Add lang code in XTTS doc
* Remove ununsed config and args
* update docs
* woops
2023-11-08 13:47:33 +01:00
Gorkem
78a596618a
Fix for exception on streaming if last chunk empty ( #3160 )
2023-11-08 11:32:02 +01:00
Edresson Casanova
09fb317e6d
Remove unused code
2023-11-06 17:36:32 -03:00
Edresson Casanova
1b6f8d0e46
Update unit tests and recipes
2023-11-06 20:25:06 +01:00
Edresson Casanova
72b2bac0f8
Load reference in 24khz to avoid issued with multiple sr references
2023-11-06 20:25:06 +01:00
Edresson Casanova
00294ffdf6
Update XTTS docs
2023-11-06 20:24:06 +01:00
Edresson Casanova
459ad70dc8
Add support for multiples speaker references on XTTS inference
2023-11-06 20:22:35 +01:00
Eren Gölge
f0cb19ecca
Drop diffusion from XTTS ( #3150 )
...
* Drop diffusion for XTTS
* Make style
* Drop diffusion deps in code
* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge
5d418bb84a
Update docs
2023-11-06 18:48:41 +01:00
Eren G??lge
9bbf6eb8dd
Drop use_ne_hifigan
2023-11-06 18:43:38 +01:00
Eren G??lge
9d54bd7655
Fixup XTTS
2023-11-06 18:13:58 +01:00
Edresson Casanova
e45227d9ff
XTTS v2.0 ( #3137 )
...
* Implement most similar ref training approach
* Use non-enhanced hifigan for test samples
* Add Perceiver
* Update GPT Trainer for perceiver support
* Update XTTS docs
* Bug fix masking with XTTS perceiver
* Bug fix on gpt forward
* Bug Fix on XTTS v2.0 training
* Add XTTS v2.0 unit tests
* Add XTTS v2.0 inference unit tests
* Bug Fix on diffusion inference
* Add XTTS v2.0 training recipe
* Placeholder model entry
* Add cloning params to config
* Make prompt embedding configurable
* Make cloning configurable
* Cheap fix for a cheaper fix
* Prevent resampling
* Update model entry
* Update docs
* Update requirements
* Code linting
* Add xtts v2 to sep tests
* Bug fix on XTTS get_gpt_cond_latents
* Bug fix on rebase
* Make style
* Bug fix in Japenese tokenizer
* Add num2words to deps
* Remove unused kwarg and added num_beams=1 as default
---------
Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela
38f6f8f0bb
Run `make style` & re-enable it in CI ( #3127 )
2023-11-06 11:36:37 +01:00
WeberJulian
1c98821359
Remove unused load_audio function
2023-10-27 22:27:18 +02:00
WeberJulian
d4e08c8d6c
Add features to get_conditioning_latents
2023-10-26 14:57:33 +02:00