Enno Hermann
89abd98620
feat(api): support passing speaker/language id file paths
2024-12-06 15:26:06 +01:00
Enno Hermann
a05177ce71
chore(api): add type hints
2024-12-06 15:26:06 +01:00
Enno Hermann
85dbb3b8b3
feat(api): allow mixing TTS and vocoder model name and path
2024-12-06 15:26:06 +01:00
Enno Hermann
1a4e58d0ce
feat(api): support passing a custom speaker encoder by path
2024-12-05 21:19:07 +01:00
Enno Hermann
5daed879e0
chore(bin.synthesize): remove unused argument
2024-12-05 21:19:07 +01:00
Enno Hermann
42ad9b00c6
feat(api): support specifying vocoders by name
2024-12-05 21:19:07 +01:00
Enno Hermann
5cfb4ecccd
refactor(api): require keyword arguments except for model_name
2024-12-05 21:19:07 +01:00
Enno Hermann
8c381e3e48
docs: use .to("cuda") instead of deprecated gpu=True
2024-12-05 21:19:07 +01:00
Enno Hermann
3539e65d8e
refactor(synthesizer): set sample rate in loading methods
2024-12-02 23:26:28 +01:00
Enno Hermann
7d0416f99b
refactor(vc): rename TTS.vc.modules to TTS.vc.layers for consistency
...
Same as in TTS.tts and TTS.vocoder
2024-12-02 23:26:28 +01:00
Enno Hermann
546f43cb25
refactor: only use keyword args in Synthesizer
2024-12-02 23:26:27 +01:00
Enno Hermann
6927e0bb89
fix(api): clearer error message when model doesn't support VC
2024-12-02 23:26:27 +01:00
Enno Hermann
fce3137e0d
feat: add openvoice vc model
2024-12-02 23:26:27 +01:00
Enno Hermann
ca02d0352b
feat(openvoice): add to .models.json
2024-12-02 22:34:56 +01:00
Enno Hermann
95998374bf
feat(openvoice): add config classes
2024-12-02 22:34:56 +01:00
Enno Hermann
b97d5378a5
refactor(openvoice): remove duplicate and unused code
2024-12-02 22:34:56 +01:00
Enno Hermann
4124b9d663
feat(vits): add tau parameter to posterior encoder
2024-12-02 22:34:56 +01:00
akulkarni
6de98ff480
feat(openvoice): initial integration
2024-12-02 22:34:56 +01:00
Enno Hermann
63625e79af
refactor: import get_last_checkpoint from trainer.io
2024-11-29 13:59:43 +01:00
Enno Hermann
170d3dae92
refactor: remove duplicate to_camel
2024-11-24 19:57:14 +01:00
Enno Hermann
7330ad8854
refactor: move duplicate alignment functions into helpers
2024-11-24 19:57:14 +01:00
Enno Hermann
76df6421de
refactor: move more audio processing into torch_transforms
2024-11-24 19:57:14 +01:00
Enno Hermann
b1ac884e07
refactor: move shared function into dataset.py
2024-11-24 19:57:14 +01:00
Enno Hermann
54f4228a46
refactor(xtts): use existing cleaners
2024-11-24 19:57:14 +01:00
Enno Hermann
b45a7a4220
refactor: move exists() and default() into generic_utils
2024-11-24 19:57:14 +01:00
Enno Hermann
fa844e0fb7
refactor(tacotron): remove duplicate function
2024-11-24 19:57:14 +01:00
Enno Hermann
0f69d31f70
refactor(vocoder): remove duplicate function
2024-11-24 19:57:14 +01:00
Enno Hermann
6ecf47312c
refactor(xtts): use tortoise conditioning encoder
2024-11-24 19:57:14 +01:00
Enno Hermann
69a599d403
refactor(freevc): remove duplicate code
2024-11-24 19:57:14 +01:00
Enno Hermann
2e5f68df6a
refactor(wavernn): remove duplicate Stretch2d
...
I checked that the implementations are the same
2024-11-23 01:04:17 +01:00
Enno Hermann
e63962c226
refactor(losses): move shared losses into losses.py
2024-11-23 01:04:17 +01:00
Enno Hermann
6f25c2b904
refactor(delightful_tts): remove unused classes
2024-11-23 01:04:17 +01:00
Enno Hermann
7cdfde226b
refactor: move amp_to_db/db_to_amp into torch_transforms
2024-11-23 01:04:17 +01:00
Enno Hermann
33ac0d6ee1
refactor(xtts): use build_hf_gpt_transformer from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
490c973371
refactor(xtts): use position embedding from tortoise
2024-11-21 15:44:42 +01:00
Enno Hermann
5ffc0543b7
refactor(bark): remove custom layer norm
...
Pytorch LayerNorm supports bias=False since version 2.1
2024-11-21 15:44:42 +01:00
Enno Hermann
705551c60c
refactor(tortoise): remove unused do_checkpoint arguments
...
These are assigned but not used for anything.
2024-11-21 15:44:42 +01:00
Enno Hermann
4ba83f42ab
chore(tortoise): remove unused AudioMiniEncoder
...
There's one in tortoise.classifier that's actually used
2024-11-21 15:44:42 +01:00
Enno Hermann
66701e1e51
refactor(xtts): reuse functions/classes from tortoise
2024-11-21 12:51:42 +01:00
Enno Hermann
1f27f994a1
refactor(utils): remove duplicate set_partial_state_dict
2024-11-21 11:53:35 +01:00
Enno Hermann
1b6d3ebd33
refactor(xtts): remove duplicate hifigan generator
2024-11-21 11:53:35 +01:00
Enno Hermann
fbbae5ac6a
refactor(audio): remove duplicate rms_volume_norm function
2024-11-18 10:01:52 +01:00
Enno Hermann
8ba3233ec6
refactor(audio): remove duplicate save_wav code
2024-11-18 10:01:50 +01:00
Enno Hermann
5784f6705a
refactor(audio): improve type hints, address lint issues
2024-11-18 10:00:55 +01:00
Enno Hermann
48f5be2ccb
feat(audio): automatically convert audio to mono
2024-11-18 01:11:43 +01:00
Enno Hermann
627bbe4150
fix(xtts): more helpful error message when vocab.json not found
2024-11-17 00:30:32 +01:00
Enno Hermann
e81f8d079c
fix: more helpful error message when formatter is not found
2024-11-17 00:29:49 +01:00
Enno Hermann
993da778b4
chore: use original instead of scarf urls
...
These allowed Coqui to get download stats, which we don't need anymore
2024-11-10 22:19:25 +01:00
Enno Hermann
2df9bfa78e
refactor: handle deprecation of torch.cuda.amp.autocast ( #144 )
...
torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be
deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu",
args...) instead.
https://pytorch.org/docs/stable/amp.html
2024-11-09 18:37:08 +01:00
Shavit
540e8d6cf2
fix(bin.synthesize): return speakers names only ( #147 )
2024-11-09 18:35:54 +01:00