Enno Hermann
6a52c8a855
fix(bin): log to stdout in cli tools, unless pipe_out is set
...
This way the outputs are available for further downstream processing, e.g. with
grep. For TTS/bin/synthesize.py, if --pipe_out is set, log to stderr because
then only the output audio stream should be on stdout, e.g. to pipe it to aplay.
2024-12-17 11:38:39 +01:00
Enno Hermann
9d5fc60a5d
feat(manager): print download location when listing models ( #213 )
2024-12-16 10:28:25 +01:00
Enno Hermann
0df04cc259
docs: add notes about xtts fine-tuning
2024-12-14 16:19:38 +01:00
Enno Hermann
a425ba599d
feat: allow both Path and strings where possible and add type hints
2024-12-14 16:19:38 +01:00
Enno Hermann
e38dcbea7a
docs: streamline readme and reuse content in other docs pages
...
[ci skip]
2024-12-12 18:29:23 +01:00
Enno Hermann
849e75e967
docs: improve documentation
2024-12-12 18:23:17 +01:00
Enno Hermann
c0d9ed3d18
fix: handle difference in xtts/tortoise attention ( #199 )
2024-12-09 16:13:13 +01:00
Enno Hermann
b545ab8b80
Merge pull request #197 from idiap/api
...
Expand Python API capabilities
2024-12-06 18:02:54 +01:00
Enno Hermann
e0f621180f
refactor(bin.synthesize): use Python API for CLI
2024-12-06 17:07:54 +01:00
Enno Hermann
806af96e4c
refactor(api): use save_wav() from Synthesizer instance
2024-12-06 15:26:06 +01:00
Enno Hermann
89abd98620
feat(api): support passing speaker/language id file paths
2024-12-06 15:26:06 +01:00
Enno Hermann
a05177ce71
chore(api): add type hints
2024-12-06 15:26:06 +01:00
Enno Hermann
85dbb3b8b3
feat(api): allow mixing TTS and vocoder model name and path
2024-12-06 15:26:06 +01:00
Enno Hermann
e8d99aaf2b
Merge pull request #184 from idiap/xtts-error
...
fix(xtts): clearer error message when file given to checkpoint_dir
2024-12-06 06:46:48 +01:00
Enno Hermann
1a4e58d0ce
feat(api): support passing a custom speaker encoder by path
2024-12-05 21:19:07 +01:00
Enno Hermann
5daed879e0
chore(bin.synthesize): remove unused argument
2024-12-05 21:19:07 +01:00
Enno Hermann
42ad9b00c6
feat(api): support specifying vocoders by name
2024-12-05 21:19:07 +01:00
Enno Hermann
5cfb4ecccd
refactor(api): require keyword arguments except for model_name
2024-12-05 21:19:07 +01:00
Enno Hermann
8c381e3e48
docs: use .to("cuda") instead of deprecated gpu=True
2024-12-05 21:19:07 +01:00
Enno Hermann
fe14ca6b68
refactor(xtts): remove duplicate xtts audio config
2024-12-05 15:46:28 +01:00
Enno Hermann
3539e65d8e
refactor(synthesizer): set sample rate in loading methods
2024-12-02 23:26:28 +01:00
Enno Hermann
7d0416f99b
refactor(vc): rename TTS.vc.modules to TTS.vc.layers for consistency
...
Same as in TTS.tts and TTS.vocoder
2024-12-02 23:26:28 +01:00
Enno Hermann
546f43cb25
refactor: only use keyword args in Synthesizer
2024-12-02 23:26:27 +01:00
Enno Hermann
6927e0bb89
fix(api): clearer error message when model doesn't support VC
2024-12-02 23:26:27 +01:00
Enno Hermann
fce3137e0d
feat: add openvoice vc model
2024-12-02 23:26:27 +01:00
Enno Hermann
ca02d0352b
feat(openvoice): add to .models.json
2024-12-02 22:34:56 +01:00
Enno Hermann
95998374bf
feat(openvoice): add config classes
2024-12-02 22:34:56 +01:00
Enno Hermann
b97d5378a5
refactor(openvoice): remove duplicate and unused code
2024-12-02 22:34:56 +01:00
Enno Hermann
4124b9d663
feat(vits): add tau parameter to posterior encoder
2024-12-02 22:34:56 +01:00
akulkarni
6de98ff480
feat(openvoice): initial integration
2024-12-02 22:34:56 +01:00
Enno Hermann
ce202532cf
fix(xtts): clearer error message when file given to checkpoint_dir
2024-12-02 16:54:11 +01:00
Enno Hermann
63625e79af
refactor: import get_last_checkpoint from trainer.io
2024-11-29 13:59:43 +01:00
Enno Hermann
170d3dae92
refactor: remove duplicate to_camel
2024-11-24 19:57:14 +01:00
Enno Hermann
7330ad8854
refactor: move duplicate alignment functions into helpers
2024-11-24 19:57:14 +01:00
Enno Hermann
76df6421de
refactor: move more audio processing into torch_transforms
2024-11-24 19:57:14 +01:00
Enno Hermann
b1ac884e07
refactor: move shared function into dataset.py
2024-11-24 19:57:14 +01:00
Enno Hermann
54f4228a46
refactor(xtts): use existing cleaners
2024-11-24 19:57:14 +01:00
Enno Hermann
b45a7a4220
refactor: move exists() and default() into generic_utils
2024-11-24 19:57:14 +01:00
Enno Hermann
fa844e0fb7
refactor(tacotron): remove duplicate function
2024-11-24 19:57:14 +01:00
Enno Hermann
0f69d31f70
refactor(vocoder): remove duplicate function
2024-11-24 19:57:14 +01:00
Enno Hermann
6ecf47312c
refactor(xtts): use tortoise conditioning encoder
2024-11-24 19:57:14 +01:00
Enno Hermann
69a599d403
refactor(freevc): remove duplicate code
2024-11-24 19:57:14 +01:00
Enno Hermann
2e5f68df6a
refactor(wavernn): remove duplicate Stretch2d
...
I checked that the implementations are the same
2024-11-23 01:04:17 +01:00
Enno Hermann
e63962c226
refactor(losses): move shared losses into losses.py
2024-11-23 01:04:17 +01:00
Enno Hermann
6f25c2b904
refactor(delightful_tts): remove unused classes
2024-11-23 01:04:17 +01:00
Enno Hermann
7cdfde226b
refactor: move amp_to_db/db_to_amp into torch_transforms
2024-11-23 01:04:17 +01:00
Enno Hermann
33ac0d6ee1
refactor(xtts): use build_hf_gpt_transformer from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
490c973371
refactor(xtts): use position embedding from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
5ffc0543b7
refactor(bark): remove custom layer norm
...
Pytorch LayerNorm supports bias=False since version 2.1
2024-11-21 15:44:42 +01:00
Enno Hermann
705551c60c
refactor(tortoise): remove unused do_checkpoint arguments
...
These are assigned but not used for anything.
2024-11-21 15:44:42 +01:00