Abraham Mathews
6ea3b75b84
Update xtts.py ( #53 )
...
docs(xtts): fix typo in example
2024-07-02 13:43:52 +02:00
Enno Hermann
da82d55329
refactor: use load_fsspec from trainer
...
Made automatically with:
rg "from TTS.utils.io import load_fsspec" --files-with-matches | xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'
2024-06-29 15:07:10 +02:00
Enno Hermann
0fb26f97df
refactor: use get_user_data_dir from trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
4bd3df2607
refactor: remove duplicate get_padding
2024-06-26 11:54:36 +02:00
Enno Hermann
c30fb0f56b
chore: remove duplicate init_weights
2024-06-26 11:46:37 +02:00
Enno Hermann
c5241d71ab
chore: address pytorch deprecations
...
torch.range(a, b) == torch.arange(a, b+1)
meshgrid indexing: https://github.com/pytorch/pytorch/issues/50276
checkpoint use_reentrant:
https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745
optimizer.step() before scheduler.step():
https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
2024-06-26 11:38:25 +02:00
Enno Hermann
f8df19a10c
refactor: remove duplicate convert_pad_shape
2024-06-26 10:17:04 +02:00
Enno Hermann
cd7b6daf46
fix: clarify types, fix missing functions
2024-06-26 10:17:04 +02:00
Enno Hermann
4d9e18ea7d
chore(stream_generator): address lint issues
2024-06-17 09:52:35 +02:00
Enno Hermann
2a281237d7
refactor(stream_generator): update code for transformers>=4.41.1
...
In line with
eed9ed6798/src/transformers/generation/utils.py
2024-06-17 09:52:35 +02:00
Enno Hermann
4b6da4e7ba
refactor(stream_generator): update special tokens for transformers>=4.41.1
...
Fixes #31 . The handling of special tokens in `transformers` was changed in
https://github.com/huggingface/transformers/pull/30624 and
https://github.com/huggingface/transformers/pull/30746 . This updates the XTTS
streaming code accordingly.
2024-06-17 09:52:35 +02:00
Enno Hermann
9cfcc0a0f5
chore(cleaners): add type hints
2024-06-14 15:20:04 +02:00
Enno Hermann
e5c208d254
feat(cleaners): add multilingual phoneme cleaner
...
This doesn't convert numbers into English words.
2024-06-14 15:06:03 +02:00
Enno Hermann
03de4b889e
docs: fix readthedocs links
...
[ci skip]
2024-06-13 22:48:34 +02:00
Enno Hermann
07cbcf825c
fix(espeak_wrapper): read phonemize() input from file
...
Avoids utf8 encoding issues on Windows when passing the text directly.
Fixes https://github.com/coqui-ai/TTS/discussions/3761
2024-05-29 10:10:05 +02:00
Enno Hermann
49fcbd908b
fix(espeak_wrapper): avoid stuck process on windows
...
Fixes #24
2024-05-29 07:39:03 +02:00
Enno Hermann
203f60f1e1
refactor(espeak_wrapper): remove sync argument
...
_espeak_exe is always called with sync=True, so remove code for sync==False
2024-05-28 21:30:55 +02:00
Enno Hermann
df088e99df
Merge pull request #19 from idiap/toml
...
Move from setup.py to pyproject.toml, simplify requirements
2024-05-27 08:59:09 +01:00
Enno Hermann
018f1e6453
docs(bark): update docstrings and type hints
2024-05-15 22:56:55 +02:00
Enno Hermann
59a6c9fdf2
fix(bark): add missing argument for load_voice()
...
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann
6d563af623
chore: remove obsolete code for torch<2
...
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann
865a48156d
fix: make korean g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
55ed162f2a
fix: make chinese g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ea893c3795
fix: make bangla g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ec50006855
style: run pre-commit
...
Automatic changes from: pre-commit run --all-files
2024-05-08 12:17:47 +02:00
Enno Hermann
fb92e13ebb
build: remove unused/obsolete code
2024-05-08 12:13:41 +02:00
Enno Hermann
962f9bbbcf
refactor(espeak_wrapper): fix ruff lint suggestions
2024-05-01 13:31:39 +02:00
Enno Hermann
7b2289a454
fix(espeak_wrapper): capture stderr separately
...
Fixes https://github.com/coqui-ai/TTS/issues/2728
Previously, error messages from espeak were treated as normal output and also
converted to phonemes. This captures and logs them separately.
2024-05-01 12:31:49 +02:00
Enno Hermann
52a52b5e21
fix(LanguageManager): allow initialisation from config with language ids file
...
Previously, running `LanguageManager.init_from_config(config)` would never use
the `language_ids_file` if that field is present because it was overwritten in
the next line with a new manager that manually parses languages from the
datasets in the config. Now that is only used as a fallback.
2024-04-19 11:57:27 +02:00
Enno Hermann
b3c9685aee
fix(tokenizer): add debug logging
2024-04-11 16:58:12 +02:00
Enno Hermann
2ad790d169
Merge pull request #4 from idiap/hindi
...
feat(xtts): support Hindi for sentence-splitting and fine-tuning
2024-04-11 16:49:44 +02:00
Enno Hermann
d41686502e
feat(xtts): support hindi for sentence-splitting and fine-tuning
...
The XTTS model itself already supports Hindi, it was just in these components.
2024-04-08 15:57:56 +02:00
Enno Hermann
b711e19cb6
refactor: remove verbose arguments
...
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
d772724125
fix: update repository links, package names, metadata
2024-04-03 12:02:44 +02:00
Enno Hermann
7630abb43f
refactor(bin.find_unique_chars): use existing function
2024-03-30 22:22:40 +01:00
Enno Hermann
adbcba06da
refactor(dataset): get audio length with torchaudio
...
Removes a (GPL) dependency
2024-03-14 20:48:29 +01:00
Enno Hermann
e5c6da1c98
Merge pull request #20 from eginhard/return-complex
...
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann
e95f8950eb
fix: torch.stft will soon require return_complex=True
...
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)
https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann
89a061f1d1
docs(tts.models.vits): clarify use of discriminator/generator
...
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann
2e8f47a33d
Merge pull request #10 from eginhard/fix-pinyin
...
fix chinese pinyin phonemes
2024-03-09 16:23:28 +01:00
Enno Hermann
309f39a45f
fix(xtts_manager): name_to_id() should return dict
...
This is how the other embedding managers work
2024-03-08 14:47:00 +01:00
Enno Hermann
efdafd5a7f
style: run black
2024-03-07 11:46:51 +01:00
Enno Hermann
017c84d005
style: make style && make lint
2024-03-06 22:45:35 +01:00
Enno Hermann
e05243c4c8
refactor: read/write csv files with standard library
2024-03-06 16:18:09 +01:00
Enno Hermann
24298da5fc
Merge pull request #1 from eginhard/lint-overhaul
...
Lint overhaul (pylint to ruff)
2024-03-06 16:10:26 +01:00
wangjie
b184e9f0fe
fix chinese pinyin phonemes
2024-01-12 09:11:56 +08:00
Edresson Casanova
5dcc16d193
Bug fix in MP3 and FLAC compute length on TTSDataset ( #3092 )
...
* Bug Fix on XTTS load
* Bug fix in MP3 length on TTSDataset
* Update TTS/tts/datasets/dataset.py
Co-authored-by: Aarni Koskela <akx@iki.fi>
* Uses mutagen for all audio formats
* Add dataloader test wit hall supported audio formats
* Use mutagen.File
* Update
* Fix aux unit tests
* Bug fixe on unit tests
---------
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-12-27 13:23:43 -03:00
Aarni Koskela
d6ea806469
Run `make style`
2023-12-13 14:56:41 +02:00
Aarni Koskela
bd172dabbf
xtts/stream_generator: remove duplicate import + code
2023-12-13 14:56:41 +02:00