Enno Hermann
|
0df04cc259
|
docs: add notes about xtts fine-tuning
|
2024-12-14 16:19:38 +01:00 |
Enno Hermann
|
a425ba599d
|
feat: allow both Path and strings where possible and add type hints
|
2024-12-14 16:19:38 +01:00 |
Enno Hermann
|
e38dcbea7a
|
docs: streamline readme and reuse content in other docs pages
[ci skip]
|
2024-12-12 18:29:23 +01:00 |
Enno Hermann
|
849e75e967
|
docs: improve documentation
|
2024-12-12 18:23:17 +01:00 |
Enno Hermann
|
c0d9ed3d18
|
fix: handle difference in xtts/tortoise attention (#199)
|
2024-12-09 16:13:13 +01:00 |
Enno Hermann
|
b545ab8b80
|
Merge pull request #197 from idiap/api
Expand Python API capabilities
|
2024-12-06 18:02:54 +01:00 |
Enno Hermann
|
e0f621180f
|
refactor(bin.synthesize): use Python API for CLI
|
2024-12-06 17:07:54 +01:00 |
Enno Hermann
|
806af96e4c
|
refactor(api): use save_wav() from Synthesizer instance
|
2024-12-06 15:26:06 +01:00 |
Enno Hermann
|
89abd98620
|
feat(api): support passing speaker/language id file paths
|
2024-12-06 15:26:06 +01:00 |
Enno Hermann
|
a05177ce71
|
chore(api): add type hints
|
2024-12-06 15:26:06 +01:00 |
Enno Hermann
|
85dbb3b8b3
|
feat(api): allow mixing TTS and vocoder model name and path
|
2024-12-06 15:26:06 +01:00 |
Enno Hermann
|
e8d99aaf2b
|
Merge pull request #184 from idiap/xtts-error
fix(xtts): clearer error message when file given to checkpoint_dir
|
2024-12-06 06:46:48 +01:00 |
Enno Hermann
|
1a4e58d0ce
|
feat(api): support passing a custom speaker encoder by path
|
2024-12-05 21:19:07 +01:00 |
Enno Hermann
|
5daed879e0
|
chore(bin.synthesize): remove unused argument
|
2024-12-05 21:19:07 +01:00 |
Enno Hermann
|
42ad9b00c6
|
feat(api): support specifying vocoders by name
|
2024-12-05 21:19:07 +01:00 |
Enno Hermann
|
5cfb4ecccd
|
refactor(api): require keyword arguments except for model_name
|
2024-12-05 21:19:07 +01:00 |
Enno Hermann
|
8c381e3e48
|
docs: use .to("cuda") instead of deprecated gpu=True
|
2024-12-05 21:19:07 +01:00 |
Enno Hermann
|
fe14ca6b68
|
refactor(xtts): remove duplicate xtts audio config
|
2024-12-05 15:46:28 +01:00 |
Enno Hermann
|
3539e65d8e
|
refactor(synthesizer): set sample rate in loading methods
|
2024-12-02 23:26:28 +01:00 |
Enno Hermann
|
7d0416f99b
|
refactor(vc): rename TTS.vc.modules to TTS.vc.layers for consistency
Same as in TTS.tts and TTS.vocoder
|
2024-12-02 23:26:28 +01:00 |
Enno Hermann
|
546f43cb25
|
refactor: only use keyword args in Synthesizer
|
2024-12-02 23:26:27 +01:00 |
Enno Hermann
|
6927e0bb89
|
fix(api): clearer error message when model doesn't support VC
|
2024-12-02 23:26:27 +01:00 |
Enno Hermann
|
fce3137e0d
|
feat: add openvoice vc model
|
2024-12-02 23:26:27 +01:00 |
Enno Hermann
|
ca02d0352b
|
feat(openvoice): add to .models.json
|
2024-12-02 22:34:56 +01:00 |
Enno Hermann
|
95998374bf
|
feat(openvoice): add config classes
|
2024-12-02 22:34:56 +01:00 |
Enno Hermann
|
b97d5378a5
|
refactor(openvoice): remove duplicate and unused code
|
2024-12-02 22:34:56 +01:00 |
Enno Hermann
|
4124b9d663
|
feat(vits): add tau parameter to posterior encoder
|
2024-12-02 22:34:56 +01:00 |
akulkarni
|
6de98ff480
|
feat(openvoice): initial integration
|
2024-12-02 22:34:56 +01:00 |
Enno Hermann
|
ce202532cf
|
fix(xtts): clearer error message when file given to checkpoint_dir
|
2024-12-02 16:54:11 +01:00 |
Enno Hermann
|
63625e79af
|
refactor: import get_last_checkpoint from trainer.io
|
2024-11-29 13:59:43 +01:00 |
Enno Hermann
|
170d3dae92
|
refactor: remove duplicate to_camel
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
7330ad8854
|
refactor: move duplicate alignment functions into helpers
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
76df6421de
|
refactor: move more audio processing into torch_transforms
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
b1ac884e07
|
refactor: move shared function into dataset.py
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
54f4228a46
|
refactor(xtts): use existing cleaners
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
b45a7a4220
|
refactor: move exists() and default() into generic_utils
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
fa844e0fb7
|
refactor(tacotron): remove duplicate function
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
0f69d31f70
|
refactor(vocoder): remove duplicate function
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
6ecf47312c
|
refactor(xtts): use tortoise conditioning encoder
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
69a599d403
|
refactor(freevc): remove duplicate code
|
2024-11-24 19:57:14 +01:00 |
Enno Hermann
|
2e5f68df6a
|
refactor(wavernn): remove duplicate Stretch2d
I checked that the implementations are the same
|
2024-11-23 01:04:17 +01:00 |
Enno Hermann
|
e63962c226
|
refactor(losses): move shared losses into losses.py
|
2024-11-23 01:04:17 +01:00 |
Enno Hermann
|
6f25c2b904
|
refactor(delightful_tts): remove unused classes
|
2024-11-23 01:04:17 +01:00 |
Enno Hermann
|
7cdfde226b
|
refactor: move amp_to_db/db_to_amp into torch_transforms
|
2024-11-23 01:04:17 +01:00 |
Enno Hermann
|
33ac0d6ee1
|
refactor(xtts): use build_hf_gpt_transformer from tortoise
|
2024-11-21 15:44:42 +01:00 |
Enno Hermann
|
490c973371
|
refactor(xtts): use position embedding from tortoise
|
2024-11-21 15:44:42 +01:00 |
Enno Hermann
|
5ffc0543b7
|
refactor(bark): remove custom layer norm
Pytorch LayerNorm supports bias=False since version 2.1
|
2024-11-21 15:44:42 +01:00 |
Enno Hermann
|
705551c60c
|
refactor(tortoise): remove unused do_checkpoint arguments
These are assigned but not used for anything.
|
2024-11-21 15:44:42 +01:00 |
Enno Hermann
|
4ba83f42ab
|
chore(tortoise): remove unused AudioMiniEncoder
There's one in tortoise.classifier that's actually used
|
2024-11-21 15:44:42 +01:00 |
Enno Hermann
|
66701e1e51
|
refactor(xtts): reuse functions/classes from tortoise
|
2024-11-21 12:51:42 +01:00 |