Enno Hermann
1f27f994a1
refactor(utils): remove duplicate set_partial_state_dict
2024-11-21 11:53:35 +01:00
Enno Hermann
1b6d3ebd33
refactor(xtts): remove duplicate hifigan generator
2024-11-21 11:53:35 +01:00
Enno Hermann
fbbae5ac6a
refactor(audio): remove duplicate rms_volume_norm function
2024-11-18 10:01:52 +01:00
Enno Hermann
8ba3233ec6
refactor(audio): remove duplicate save_wav code
2024-11-18 10:01:50 +01:00
Enno Hermann
5784f6705a
refactor(audio): improve type hints, address lint issues
2024-11-18 10:00:55 +01:00
Enno Hermann
48f5be2ccb
feat(audio): automatically convert audio to mono
2024-11-18 01:11:43 +01:00
Enno Hermann
627bbe4150
fix(xtts): more helpful error message when vocab.json not found
2024-11-17 00:30:32 +01:00
Enno Hermann
e81f8d079c
fix: more helpful error message when formatter is not found
2024-11-17 00:29:49 +01:00
Enno Hermann
993da778b4
chore: use original instead of scarf urls
...
These allowed Coqui to get download stats, which we don't need anymore
2024-11-10 22:19:25 +01:00
Enno Hermann
2df9bfa78e
refactor: handle deprecation of torch.cuda.amp.autocast ( #144 )
...
torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be
deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu",
args...) instead.
https://pytorch.org/docs/stable/amp.html
2024-11-09 18:37:08 +01:00
Shavit
540e8d6cf2
fix(bin.synthesize): return speakers names only ( #147 )
2024-11-09 18:35:54 +01:00
Enno Hermann
0971bc236e
refactor: use external package for monotonic alignment
2024-11-08 12:35:04 +01:00
Enno Hermann
6314032fd7
Merge pull request #113 from idiap/pytorch
...
fix: only enable load with weights_only in pytorch>=2.4
2024-11-04 22:14:42 +01:00
Enno Hermann
8e66be2c32
fix: only enable load with weights_only in pytorch>=2.4
...
Allows moving the minimum Pytorch version back to 2.1
2024-11-04 18:39:51 +01:00
Enno Hermann
47ad0bf190
fix(text.characters): add nasal diacritic ( #127 )
2024-11-04 17:40:23 +01:00
Enno Hermann
964b813235
fix(gpt): set attention mask and address other warnings
2024-10-25 18:49:04 +02:00
Johnny Street
018d4ba1db
fix(xtts): support transformers>=4.43.0 in streaming inference
2024-10-20 15:20:26 +02:00
Shavit
36611a7192
feat: normalize unicode characters in text cleaners ( #85 )
...
* Add normalizer type C to text cleaners
* Linter recommendations
* Add unicode normalize to every cleaner
* Format test_text_cleaners.py
2024-10-02 17:01:19 +02:00
Enno Hermann
e5dd06b3bb
Merge pull request #77 from shavit/71-torch-load
...
Load weights only in torch.load
2024-09-12 23:28:57 +01:00
Enno Hermann
659b4852ba
chore(bark): remove manual download of hubert model
...
Bark was previously adapted to download Hubert from HuggingFace, so the manual
download is superfluous.
2024-09-12 23:37:19 +02:00
Enno Hermann
86b58fb6d9
fix: define torch safe globals for torch.load
...
Required for loading some models using torch.load(..., weights_only=True). This
is only available from Pytorch 2.4
2024-09-12 23:37:19 +02:00
shavit
17ca24c3d6
fix: load weights only in torch.load
2024-09-12 23:37:19 +02:00
Enno Hermann
1920328822
feat(xtts): support hindi in tokenizer ( #64 )
...
Added proper tokenizer support for Hindi Language which would prevent crash while fine tuning Hindi language.
Co-authored-by: Akshat Bhardwaj <157223825+akshatrocky@users.noreply.github.com>
2024-09-12 21:29:21 +02:00
Enno Hermann
9c604c1de0
chore(dataset): address lint issues
2024-07-31 15:47:27 +02:00
Enno Hermann
8c460d0cd0
fix(dataset): skip files where audio length can't be computed
...
Avoids hard failures when the audio can't be decoded.
2024-07-31 15:20:56 +02:00
Daniel Walmsley
20bbb411c2
fix(xtts): update streaming for transformers>=4.42.0 ( #59 )
...
* Fix Stream Generator on MacOS
* Make it work on mps
* Implement custom tensor.isin
* Fix for latest TF
* Comment out hack for now
* Remove unused code
* build: increase minimum transformers version
* style: fix
---------
Co-authored-by: Enno Hermann <Eginhard@users.noreply.github.com>
2024-07-25 16:24:10 +02:00
Enno Hermann
20583a496e
Merge pull request #57 from idiap/xtts-vocab
...
fix(xtts): load tokenizer file based on config as last resort
2024-07-25 13:26:28 +01:00
Enno Hermann
de35920317
Merge pull request #50 from idiap/umap
...
build: move umap-learn into optional notebook dependencies
2024-07-25 13:26:09 +01:00
Enno Hermann
9192ef1aa6
fix(xtts): load tokenizer file based on config as last resort
2024-07-05 13:52:01 +02:00
Abraham Mathews
6ea3b75b84
Update xtts.py ( #53 )
...
docs(xtts): fix typo in example
2024-07-02 13:43:52 +02:00
Enno Hermann
2d06aeb79b
chore: remove unused TTS.utils.io module
...
All uses of these methods were replaced with the equivalents from coqui-tts-trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
e869b9b658
refactor: use load_checkpoint from trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
da82d55329
refactor: use load_fsspec from trainer
...
Made automatically with:
rg "from TTS.utils.io import load_fsspec" --files-with-matches | xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'
2024-06-29 15:07:10 +02:00
Enno Hermann
0fb26f97df
refactor: use get_user_data_dir from trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
28296c6458
refactor: use get_git_branch from trainer
2024-06-29 15:07:10 +02:00
Enno Hermann
59ef28d708
build: move umap-learn into optional notebook dependencies
...
Except for notebooks, it's only used to show embedding plots during speaker
encoder training, in which case a warning is now shown to install it.
2024-06-26 23:53:17 +02:00
Enno Hermann
4bd3df2607
refactor: remove duplicate get_padding
2024-06-26 11:54:36 +02:00
Enno Hermann
c30fb0f56b
chore: remove duplicate init_weights
2024-06-26 11:46:37 +02:00
Enno Hermann
c5241d71ab
chore: address pytorch deprecations
...
torch.range(a, b) == torch.arange(a, b+1)
meshgrid indexing: https://github.com/pytorch/pytorch/issues/50276
checkpoint use_reentrant:
https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745
optimizer.step() before scheduler.step():
https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
2024-06-26 11:38:25 +02:00
Enno Hermann
a755328e49
refactor(freevc): remove duplicate sequence_mask
2024-06-26 10:17:04 +02:00
Enno Hermann
f8df19a10c
refactor: remove duplicate convert_pad_shape
2024-06-26 10:17:04 +02:00
Enno Hermann
cd7b6daf46
fix: clarify types, fix missing functions
2024-06-26 10:17:04 +02:00
Enno Hermann
d65bcf65bb
chore(freevc): remove duplicate DDSConv and ElementwiseAffine
...
Already exist as:
TTS.tts.layers.vits.stochastic_duration_predictor.DilatedDepthSeparableConv
TTS.tts.layers.vits.stochastic_duration_predictor.ElementwiseAffine
2024-06-26 10:17:04 +02:00
Enno Hermann
9f80e043e4
refactor(freevc): use existing layernorm
2024-06-26 10:17:04 +02:00
Enno Hermann
4d9e18ea7d
chore(stream_generator): address lint issues
2024-06-17 09:52:35 +02:00
Enno Hermann
2a281237d7
refactor(stream_generator): update code for transformers>=4.41.1
...
In line with
eed9ed6798/src/transformers/generation/utils.py
2024-06-17 09:52:35 +02:00
Enno Hermann
4b6da4e7ba
refactor(stream_generator): update special tokens for transformers>=4.41.1
...
Fixes #31 . The handling of special tokens in `transformers` was changed in
https://github.com/huggingface/transformers/pull/30624 and
https://github.com/huggingface/transformers/pull/30746 . This updates the XTTS
streaming code accordingly.
2024-06-17 09:52:35 +02:00
Enno Hermann
81ac7abd58
Merge pull request #47 from idiap/numpy2
...
build: add numpy2 support
2024-06-17 08:48:18 +01:00
Enno Hermann
bd9b21d946
Merge pull request #44 from idiap/phoneme-cleaners
...
Add multilingual phoneme cleaner
2024-06-17 08:47:15 +01:00
Enno Hermann
4bc0e75a08
build: add numpy2 support
...
Identified necessary code changes with the NPY201 ruff rule. Gruut is the only
dependency that doesn't support numpy2 yet.
NB: At build time numpy>=2.0.0 should be required to be able to build wheels
compatible with both numpy1+2:
https://numpy.org/devdocs/dev/depending_on_numpy.html#numpy-2-abi-handling
2024-06-16 22:10:33 +02:00
ChristianRomberg
3a20f4725f
fix(freevc): use the specified device for pretrained speaker encoder ( #45 )
...
Fixes coqui-ai#3787
2024-06-16 21:24:03 +02:00
Enno Hermann
9cfcc0a0f5
chore(cleaners): add type hints
2024-06-14 15:20:04 +02:00
Enno Hermann
e5c208d254
feat(cleaners): add multilingual phoneme cleaner
...
This doesn't convert numbers into English words.
2024-06-14 15:06:03 +02:00
Enno Hermann
03de4b889e
docs: fix readthedocs links
...
[ci skip]
2024-06-13 22:48:34 +02:00
Enno Hermann
29e91f2e77
fix(utils.generic_utils): correctly call now()
2024-05-31 08:39:32 +02:00
Enno Hermann
77722cb0dd
fix(bin.synthesize): correctly handle boolean arguments
...
Previously, e.g. `--use_cuda false` would actually set use_cuda=True:
https://github.com/coqui-ai/TTS/discussions/3762
2024-05-31 08:39:32 +02:00
Enno Hermann
a682fa8d56
Merge pull request #33 from idiap/versions
...
Fix XTTS streaming
2024-05-29 14:16:36 +01:00
Enno Hermann
07cbcf825c
fix(espeak_wrapper): read phonemize() input from file
...
Avoids utf8 encoding issues on Windows when passing the text directly.
Fixes https://github.com/coqui-ai/TTS/discussions/3761
2024-05-29 10:10:05 +02:00
Enno Hermann
49fcbd908b
fix(espeak_wrapper): avoid stuck process on windows
...
Fixes #24
2024-05-29 07:39:03 +02:00
Enno Hermann
203f60f1e1
refactor(espeak_wrapper): remove sync argument
...
_espeak_exe is always called with sync=True, so remove code for sync==False
2024-05-28 21:30:55 +02:00
Enno Hermann
7df4c2fa47
fix: restore TTS.__version__ attribute
...
This is used by the TTS/bin/collect_env_info.py script with which users print
version information for bug reports. We restore the TTS.__version__ attribute so
that old versions of the script still work.
2024-05-28 09:35:55 +02:00
Enno Hermann
df088e99df
Merge pull request #19 from idiap/toml
...
Move from setup.py to pyproject.toml, simplify requirements
2024-05-27 08:59:09 +01:00
Enno Hermann
642cbd472f
Merge pull request #26 from idiap/server-output
...
fix(server): ensure logging output gets actually shown
2024-05-26 09:08:27 +01:00
Enno Hermann
ab7d84bf05
refactor(server): address linter issues
2024-05-23 08:42:21 +02:00
Enno Hermann
8503500d9d
chore(server): remove duplicate code
2024-05-20 12:45:47 +02:00
Enno Hermann
70bd84894d
fix(server): ensure logging output gets actually shown
2024-05-20 12:45:41 +02:00
Enno Hermann
018f1e6453
docs(bark): update docstrings and type hints
2024-05-15 22:56:55 +02:00
Enno Hermann
59a6c9fdf2
fix(bark): add missing argument for load_voice()
...
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann
6d563af623
chore: remove obsolete code for torch<2
...
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann
865a48156d
fix: make korean g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
55ed162f2a
fix: make chinese g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ea893c3795
fix: make bangla g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ec50006855
style: run pre-commit
...
Automatic changes from: pre-commit run --all-files
2024-05-08 12:17:47 +02:00
Enno Hermann
fb92e13ebb
build: remove unused/obsolete code
2024-05-08 12:13:41 +02:00
Enno Hermann
259d8fc40b
build: store version in pyproject.toml
2024-05-07 18:27:55 +02:00
Enno Hermann
962f9bbbcf
refactor(espeak_wrapper): fix ruff lint suggestions
2024-05-01 13:31:39 +02:00
Enno Hermann
7b2289a454
fix(espeak_wrapper): capture stderr separately
...
Fixes https://github.com/coqui-ai/TTS/issues/2728
Previously, error messages from espeak were treated as normal output and also
converted to phonemes. This captures and logs them separately.
2024-05-01 12:31:49 +02:00
Enno Hermann
06304504d2
Merge pull request #11 from idiap/py312
...
build: add python 3.12 support
2024-04-23 13:52:00 +02:00
Enno Hermann
2675e743b0
chore: update version to 0.23.1
...
[ci skip]
2024-04-23 09:57:43 +02:00
Enno Hermann
52a52b5e21
fix(LanguageManager): allow initialisation from config with language ids file
...
Previously, running `LanguageManager.init_from_config(config)` would never use
the `language_ids_file` if that field is present because it was overwritten in
the next line with a new manager that manually parses languages from the
datasets in the config. Now that is only used as a fallback.
2024-04-19 11:57:27 +02:00
Enno Hermann
f7d69cc1d7
chore: update version to 0.23.0
2024-04-11 17:01:09 +02:00
Enno Hermann
b3c9685aee
fix(tokenizer): add debug logging
2024-04-11 16:58:12 +02:00
Enno Hermann
2ad790d169
Merge pull request #4 from idiap/hindi
...
feat(xtts): support Hindi for sentence-splitting and fine-tuning
2024-04-11 16:49:44 +02:00
Enno Hermann
dfbe0168e9
Merge pull request #3 from idiap/logging
...
Use Python logging instead of print()
2024-04-11 08:34:44 +02:00
Enno Hermann
d41686502e
feat(xtts): support hindi for sentence-splitting and fine-tuning
...
The XTTS model itself already supports Hindi, it was just in these components.
2024-04-08 15:57:56 +02:00
Enno Hermann
aa40fd277b
docs: update links
2024-04-04 18:21:57 +02:00
Enno Hermann
e689fd1d4a
fix(utils.manage): remove bare except, improve messages
2024-04-03 15:19:45 +02:00
Enno Hermann
7dc5d1eb3d
fix: logging in executables
2024-04-03 15:19:45 +02:00
Enno Hermann
ab64844aba
feat(utils.generic_utils): add custom formatter for logging to console
2024-04-03 15:19:45 +02:00
Enno Hermann
9b2d48f8a6
feat(utils.generic_utils): improve setup_logger() arguments and output
2024-04-03 15:19:45 +02:00
Enno Hermann
b711e19cb6
refactor: remove verbose arguments
...
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
dd3768d4b1
chore: update version to v0.22.1
2024-04-03 12:31:39 +02:00
Enno Hermann
d772724125
fix: update repository links, package names, metadata
2024-04-03 12:02:44 +02:00
Enno Hermann
7630abb43f
refactor(bin.find_unique_chars): use existing function
2024-03-30 22:22:40 +01:00
Enno Hermann
adbcba06da
refactor(dataset): get audio length with torchaudio
...
Removes a (GPL) dependency
2024-03-14 20:48:29 +01:00
Enno Hermann
e5c6da1c98
Merge pull request #20 from eginhard/return-complex
...
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann
e95f8950eb
fix: torch.stft will soon require return_complex=True
...
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)
https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann
89a061f1d1
docs(tts.models.vits): clarify use of discriminator/generator
...
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann
a7753708fb
refactor: remove duplicate methods available in Trainer
2024-03-12 15:06:42 +01:00