Enno Hermann
59ef28d708
build: move umap-learn into optional notebook dependencies
...
Except for notebooks, it's only used to show embedding plots during speaker
encoder training, in which case a warning is now shown to install it.
2024-06-26 23:53:17 +02:00
Enno Hermann
ff2cd5c97d
Merge pull request #49 from idiap/vc-refactors
...
VC-related refactors and fixes
2024-06-26 14:01:21 +01:00
Enno Hermann
4bd3df2607
refactor: remove duplicate get_padding
2024-06-26 11:54:36 +02:00
Enno Hermann
c30fb0f56b
chore: remove duplicate init_weights
2024-06-26 11:46:37 +02:00
Enno Hermann
c5241d71ab
chore: address pytorch deprecations
...
torch.range(a, b) == torch.arange(a, b+1)
meshgrid indexing: https://github.com/pytorch/pytorch/issues/50276
checkpoint use_reentrant:
https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745
optimizer.step() before scheduler.step():
https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
2024-06-26 11:38:25 +02:00
Enno Hermann
a755328e49
refactor(freevc): remove duplicate sequence_mask
2024-06-26 10:17:04 +02:00
Enno Hermann
f8df19a10c
refactor: remove duplicate convert_pad_shape
2024-06-26 10:17:04 +02:00
Enno Hermann
cd7b6daf46
fix: clarify types, fix missing functions
2024-06-26 10:17:04 +02:00
Enno Hermann
d65bcf65bb
chore(freevc): remove duplicate DDSConv and ElementwiseAffine
...
Already exist as:
TTS.tts.layers.vits.stochastic_duration_predictor.DilatedDepthSeparableConv
TTS.tts.layers.vits.stochastic_duration_predictor.ElementwiseAffine
2024-06-26 10:17:04 +02:00
Enno Hermann
9f80e043e4
refactor(freevc): use existing layernorm
2024-06-26 10:17:04 +02:00
Enno Hermann
857cd55ce5
test(helpers): fix test_rand_segment, test_generate_path
2024-06-26 10:16:46 +02:00
Enno Hermann
c9f7197862
test(helpers): add test_ prefix so tests actually run
2024-06-25 23:03:40 +02:00
Enno Hermann
98c0f86cb3
Merge pull request #46 from idiap/fix-xtts-streaming
...
Fix XTTS streaming for transformers update
2024-06-18 14:54:15 +01:00
Enno Hermann
4d9e18ea7d
chore(stream_generator): address lint issues
2024-06-17 09:52:35 +02:00
Enno Hermann
2a281237d7
refactor(stream_generator): update code for transformers>=4.41.1
...
In line with
eed9ed6798/src/transformers/generation/utils.py
2024-06-17 09:52:35 +02:00
Enno Hermann
4b6da4e7ba
refactor(stream_generator): update special tokens for transformers>=4.41.1
...
Fixes #31 . The handling of special tokens in `transformers` was changed in
https://github.com/huggingface/transformers/pull/30624 and
https://github.com/huggingface/transformers/pull/30746 . This updates the XTTS
streaming code accordingly.
2024-06-17 09:52:35 +02:00
Enno Hermann
81ac7abd58
Merge pull request #47 from idiap/numpy2
...
build: add numpy2 support
2024-06-17 08:48:18 +01:00
Enno Hermann
bd9b21d946
Merge pull request #44 from idiap/phoneme-cleaners
...
Add multilingual phoneme cleaner
2024-06-17 08:47:15 +01:00
Enno Hermann
4bc0e75a08
build: add numpy2 support
...
Identified necessary code changes with the NPY201 ruff rule. Gruut is the only
dependency that doesn't support numpy2 yet.
NB: At build time numpy>=2.0.0 should be required to be able to build wheels
compatible with both numpy1+2:
https://numpy.org/devdocs/dev/depending_on_numpy.html#numpy-2-abi-handling
2024-06-16 22:10:33 +02:00
ChristianRomberg
3a20f4725f
fix(freevc): use the specified device for pretrained speaker encoder ( #45 )
...
Fixes coqui-ai#3787
2024-06-16 21:24:03 +02:00
Enno Hermann
9cfcc0a0f5
chore(cleaners): add type hints
2024-06-14 15:20:04 +02:00
Enno Hermann
a1495d4bc1
fix(recipes): use multilingual phoneme cleaner in non-english recipes
2024-06-14 15:09:01 +02:00
Enno Hermann
e5c208d254
feat(cleaners): add multilingual phoneme cleaner
...
This doesn't convert numbers into English words.
2024-06-14 15:06:03 +02:00
Enno Hermann
063e9e9de9
Merge pull request #38 from idiap/cli
...
Improve CLI handling of boolean arguments
2024-06-14 13:20:35 +01:00
Enno Hermann
03de4b889e
docs: fix readthedocs links
...
[ci skip]
2024-06-13 22:48:34 +02:00
Enno Hermann
bdd44cf28a
docs: update readme
2024-05-31 08:39:32 +02:00
Enno Hermann
29e91f2e77
fix(utils.generic_utils): correctly call now()
2024-05-31 08:39:32 +02:00
Enno Hermann
77722cb0dd
fix(bin.synthesize): correctly handle boolean arguments
...
Previously, e.g. `--use_cuda false` would actually set use_cuda=True:
https://github.com/coqui-ai/TTS/discussions/3762
2024-05-31 08:39:32 +02:00
Enno Hermann
a682fa8d56
Merge pull request #33 from idiap/versions
...
Fix XTTS streaming
2024-05-29 14:16:36 +01:00
Enno Hermann
c5f3d63c67
Merge pull request #34 from idiap/espeak
...
Fix Espeak issues on Windows
2024-05-29 14:16:10 +01:00
Enno Hermann
07cbcf825c
fix(espeak_wrapper): read phonemize() input from file
...
Avoids utf8 encoding issues on Windows when passing the text directly.
Fixes https://github.com/coqui-ai/TTS/discussions/3761
2024-05-29 10:10:05 +02:00
Enno Hermann
03430de88e
chore: bump version to 0.24.1
2024-05-29 09:57:55 +02:00
Enno Hermann
49fcbd908b
fix(espeak_wrapper): avoid stuck process on windows
...
Fixes #24
2024-05-29 07:39:03 +02:00
Enno Hermann
203f60f1e1
refactor(espeak_wrapper): remove sync argument
...
_espeak_exe is always called with sync=True, so remove code for sync==False
2024-05-28 21:30:55 +02:00
Enno Hermann
df4a1f5ea6
docs: update readme
...
[ci skip]
2024-05-28 16:19:52 +02:00
Enno Hermann
dc629f825d
build: set upper version limit for transformers
...
transformers>=4.41 break XTTS streaming, see #31
2024-05-28 12:29:42 +02:00
Enno Hermann
7df4c2fa47
fix: restore TTS.__version__ attribute
...
This is used by the TTS/bin/collect_env_info.py script with which users print
version information for bug reports. We restore the TTS.__version__ attribute so
that old versions of the script still work.
2024-05-28 09:35:55 +02:00
Enno Hermann
df088e99df
Merge pull request #19 from idiap/toml
...
Move from setup.py to pyproject.toml, simplify requirements
2024-05-27 08:59:09 +01:00
Enno Hermann
642cbd472f
Merge pull request #26 from idiap/server-output
...
fix(server): ensure logging output gets actually shown
2024-05-26 09:08:27 +01:00
Enno Hermann
7bf9033e53
chore: update repo info
...
[ci skip]
2024-05-25 17:33:44 +02:00
Enno Hermann
ab7d84bf05
refactor(server): address linter issues
2024-05-23 08:42:21 +02:00
Enno Hermann
8503500d9d
chore(server): remove duplicate code
2024-05-20 12:45:47 +02:00
Enno Hermann
70bd84894d
fix(server): ensure logging output gets actually shown
2024-05-20 12:45:41 +02:00
Enno Hermann
602325021b
chore: update version to 0.24.0
...
[ci skip]
2024-05-16 18:05:51 +02:00
Enno Hermann
924f42e3af
ci: update release workflow
...
[ci skip]
2024-05-16 18:03:09 +02:00
Enno Hermann
d73c9ccba3
Merge pull request #22 from idiap/bark
...
fix(bark): add missing argument for load_voice()
2024-05-16 15:21:33 +01:00
Enno Hermann
018f1e6453
docs(bark): update docstrings and type hints
2024-05-15 22:56:55 +02:00
Enno Hermann
59a6c9fdf2
fix(bark): add missing argument for load_voice()
...
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann
6d563af623
chore: remove obsolete code for torch<2
...
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann
865a48156d
fix: make korean g2p deps optional
2024-05-08 18:08:40 +02:00