Enno Hermann
33ac0d6ee1
refactor(xtts): use build_hf_gpt_transformer from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
490c973371
refactor(xtts): use position embedding from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
66701e1e51
refactor(xtts): reuse functions/classes from tortoise
2024-11-21 12:51:42 +01:00
Enno Hermann
1b6d3ebd33
refactor(xtts): remove duplicate hifigan generator
2024-11-21 11:53:35 +01:00
Enno Hermann
993da778b4
chore: use original instead of scarf urls
...
These allowed Coqui to get download stats, which we don't need anymore
2024-11-10 22:19:25 +01:00
Enno Hermann
6314032fd7
Merge pull request #113 from idiap/pytorch
...
fix: only enable load with weights_only in pytorch>=2.4
2024-11-04 22:14:42 +01:00
Enno Hermann
8e66be2c32
fix: only enable load with weights_only in pytorch>=2.4
...
Allows moving the minimum Pytorch version back to 2.1
2024-11-04 18:39:51 +01:00
Enno Hermann
964b813235
fix(gpt): set attention mask and address other warnings
2024-10-25 18:49:04 +02:00
Johnny Street
018d4ba1db
fix(xtts): support transformers>=4.43.0 in streaming inference
2024-10-20 15:20:26 +02:00
Enno Hermann
e5dd06b3bb
Merge pull request #77 from shavit/71-torch-load
...
Load weights only in torch.load
2024-09-12 23:28:57 +01:00
shavit
17ca24c3d6
fix: load weights only in torch.load
2024-09-12 23:37:19 +02:00
Enno Hermann
1920328822
feat(xtts): support hindi in tokenizer ( #64 )
...
Added proper tokenizer support for Hindi Language which would prevent crash while fine tuning Hindi language.
Co-authored-by: Akshat Bhardwaj <157223825+akshatrocky@users.noreply.github.com>
2024-09-12 21:29:21 +02:00
Daniel Walmsley
20bbb411c2
fix(xtts): update streaming for transformers>=4.42.0 ( #59 )
...
* Fix Stream Generator on MacOS
* Make it work on mps
* Implement custom tensor.isin
* Fix for latest TF
* Comment out hack for now
* Remove unused code
* build: increase minimum transformers version
* style: fix
---------
Co-authored-by: Enno Hermann <Eginhard@users.noreply.github.com>
2024-07-25 16:24:10 +02:00
Enno Hermann
da82d55329
refactor: use load_fsspec from trainer
...
Made automatically with:
rg "from TTS.utils.io import load_fsspec" --files-with-matches | xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'
2024-06-29 15:07:10 +02:00
Enno Hermann
4bd3df2607
refactor: remove duplicate get_padding
2024-06-26 11:54:36 +02:00
Enno Hermann
4d9e18ea7d
chore(stream_generator): address lint issues
2024-06-17 09:52:35 +02:00
Enno Hermann
2a281237d7
refactor(stream_generator): update code for transformers>=4.41.1
...
In line with
eed9ed6798/src/transformers/generation/utils.py
2024-06-17 09:52:35 +02:00
Enno Hermann
4b6da4e7ba
refactor(stream_generator): update special tokens for transformers>=4.41.1
...
Fixes #31 . The handling of special tokens in `transformers` was changed in
https://github.com/huggingface/transformers/pull/30624 and
https://github.com/huggingface/transformers/pull/30746 . This updates the XTTS
streaming code accordingly.
2024-06-17 09:52:35 +02:00
Enno Hermann
6d563af623
chore: remove obsolete code for torch<2
...
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann
865a48156d
fix: make korean g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
55ed162f2a
fix: make chinese g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
2ad790d169
Merge pull request #4 from idiap/hindi
...
feat(xtts): support Hindi for sentence-splitting and fine-tuning
2024-04-11 16:49:44 +02:00
Enno Hermann
d41686502e
feat(xtts): support hindi for sentence-splitting and fine-tuning
...
The XTTS model itself already supports Hindi, it was just in these components.
2024-04-08 15:57:56 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
309f39a45f
fix(xtts_manager): name_to_id() should return dict
...
This is how the other embedding managers work
2024-03-08 14:47:00 +01:00
Enno Hermann
efdafd5a7f
style: run black
2024-03-07 11:46:51 +01:00
Aarni Koskela
d6ea806469
Run `make style`
2023-12-13 14:56:41 +02:00
Aarni Koskela
bd172dabbf
xtts/stream_generator: remove duplicate import + code
2023-12-13 14:56:41 +02:00
Aarni Koskela
32abb1a7c4
xtts/perceiver_encoder: Delete duplicate exists()
2023-12-13 14:56:41 +02:00
Aarni Koskela
33b69c6c09
Add some noqa directives (for now)
2023-12-13 14:56:41 +02:00
Aarni Koskela
00f8f4892a
Ruff autofix unnecessary passes
2023-12-13 14:56:41 +02:00
Aarni Koskela
64bb41f4fa
Ruff autofix C41
2023-12-13 14:56:41 +02:00
Aarni Koskela
90991e89b4
Ruff autofix unused imports and import order
2023-12-13 14:56:41 +02:00
Eren Gölge
8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
...
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
WeberJulian
5cd750ac7e
Fix API and CI
2023-12-11 20:21:53 +01:00
WeberJulian
a5c0d9780f
rename manager
2023-12-11 18:48:31 +01:00
WeberJulian
36143fee26
Add basic speaker manager
2023-12-11 15:25:46 +01:00
Aaron-Li
b6e929696a
support multiple GPU training
2023-12-08 16:55:32 +08:00
Eren Gölge
e49c512d99
Merge pull request #3351 from aaron-lii/chinese-puncs
...
fix pause problem of Chinese speech
2023-12-04 15:57:42 +01:00
Edresson Casanova
5f900f156a
Add XTTS Fine tuning gradio demo ( #3296 )
...
* Add XTTS FT demo data processing pipeline
* Add training and inference columns
* Uses tabs instead of columns
* Fix demo freezing issue
* Update demo
* Convert stereo to mono
* Bug fix on XTTS inference
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Add parameters to be able to set then on colab demo
* Add erros messages
* Add intuitive error messages
* Update
* Add max_audio_length parameter
* Add XTTS fine-tuner docs
* Update XTTS finetuner docs
* Delete trainer to freeze memory
* Delete unused variables
* Add gc.collect()
* Update xtts.md
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Aaron-Li
7b8808186a
fix pause problem of Chinese speech
2023-12-01 23:30:03 +08:00
Eren G??lge
3b8894a3dd
Make style
2023-11-27 14:15:50 +01:00
Eren G??lge
32065139e7
Simple text cleaner for "hi"
2023-11-24 15:14:34 +01:00
Edresson Casanova
11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren G??lge
44880f09ed
Make style
2023-11-17 13:43:34 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Eren Gölge
ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
...
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
WeberJulian
b85536b23f
fix max generation length
2023-11-13 13:18:45 +01:00