Commit Graph

2135 Commits

Author SHA1 Message Date
Enno Hermann 3539e65d8e refactor(synthesizer): set sample rate in loading methods 2024-12-02 23:26:28 +01:00
Enno Hermann 7d0416f99b refactor(vc): rename TTS.vc.modules to TTS.vc.layers for consistency
Same as in TTS.tts and TTS.vocoder
2024-12-02 23:26:28 +01:00
Enno Hermann 546f43cb25 refactor: only use keyword args in Synthesizer 2024-12-02 23:26:27 +01:00
Enno Hermann 6927e0bb89 fix(api): clearer error message when model doesn't support VC 2024-12-02 23:26:27 +01:00
Enno Hermann fce3137e0d feat: add openvoice vc model 2024-12-02 23:26:27 +01:00
Enno Hermann ca02d0352b feat(openvoice): add to .models.json 2024-12-02 22:34:56 +01:00
Enno Hermann 95998374bf feat(openvoice): add config classes 2024-12-02 22:34:56 +01:00
Enno Hermann b97d5378a5 refactor(openvoice): remove duplicate and unused code 2024-12-02 22:34:56 +01:00
Enno Hermann 4124b9d663 feat(vits): add tau parameter to posterior encoder 2024-12-02 22:34:56 +01:00
akulkarni 6de98ff480 feat(openvoice): initial integration 2024-12-02 22:34:56 +01:00
Enno Hermann 63625e79af refactor: import get_last_checkpoint from trainer.io 2024-11-29 13:59:43 +01:00
Enno Hermann 170d3dae92 refactor: remove duplicate to_camel 2024-11-24 19:57:14 +01:00
Enno Hermann 7330ad8854 refactor: move duplicate alignment functions into helpers 2024-11-24 19:57:14 +01:00
Enno Hermann 76df6421de refactor: move more audio processing into torch_transforms 2024-11-24 19:57:14 +01:00
Enno Hermann b1ac884e07 refactor: move shared function into dataset.py 2024-11-24 19:57:14 +01:00
Enno Hermann 54f4228a46 refactor(xtts): use existing cleaners 2024-11-24 19:57:14 +01:00
Enno Hermann b45a7a4220 refactor: move exists() and default() into generic_utils 2024-11-24 19:57:14 +01:00
Enno Hermann fa844e0fb7 refactor(tacotron): remove duplicate function 2024-11-24 19:57:14 +01:00
Enno Hermann 0f69d31f70 refactor(vocoder): remove duplicate function 2024-11-24 19:57:14 +01:00
Enno Hermann 6ecf47312c refactor(xtts): use tortoise conditioning encoder 2024-11-24 19:57:14 +01:00
Enno Hermann 69a599d403 refactor(freevc): remove duplicate code 2024-11-24 19:57:14 +01:00
Enno Hermann 2e5f68df6a refactor(wavernn): remove duplicate Stretch2d
I checked that the implementations are the same
2024-11-23 01:04:17 +01:00
Enno Hermann e63962c226 refactor(losses): move shared losses into losses.py 2024-11-23 01:04:17 +01:00
Enno Hermann 6f25c2b904 refactor(delightful_tts): remove unused classes 2024-11-23 01:04:17 +01:00
Enno Hermann 7cdfde226b refactor: move amp_to_db/db_to_amp into torch_transforms 2024-11-23 01:04:17 +01:00
Enno Hermann 33ac0d6ee1 refactor(xtts): use build_hf_gpt_transformer from tortoise 2024-11-21 15:44:42 +01:00
Enno Hermann 490c973371 refactor(xtts): use position embedding from tortoise 2024-11-21 15:44:42 +01:00
Enno Hermann 5ffc0543b7 refactor(bark): remove custom layer norm
Pytorch LayerNorm supports bias=False since version 2.1
2024-11-21 15:44:42 +01:00
Enno Hermann 705551c60c refactor(tortoise): remove unused do_checkpoint arguments
These are assigned but not used for anything.
2024-11-21 15:44:42 +01:00
Enno Hermann 4ba83f42ab chore(tortoise): remove unused AudioMiniEncoder
There's one in tortoise.classifier that's actually used
2024-11-21 15:44:42 +01:00
Enno Hermann 66701e1e51 refactor(xtts): reuse functions/classes from tortoise 2024-11-21 12:51:42 +01:00
Enno Hermann 1f27f994a1 refactor(utils): remove duplicate set_partial_state_dict 2024-11-21 11:53:35 +01:00
Enno Hermann 1b6d3ebd33 refactor(xtts): remove duplicate hifigan generator 2024-11-21 11:53:35 +01:00
Enno Hermann fbbae5ac6a refactor(audio): remove duplicate rms_volume_norm function 2024-11-18 10:01:52 +01:00
Enno Hermann 8ba3233ec6 refactor(audio): remove duplicate save_wav code 2024-11-18 10:01:50 +01:00
Enno Hermann 5784f6705a refactor(audio): improve type hints, address lint issues 2024-11-18 10:00:55 +01:00
Enno Hermann 48f5be2ccb feat(audio): automatically convert audio to mono 2024-11-18 01:11:43 +01:00
Enno Hermann 627bbe4150 fix(xtts): more helpful error message when vocab.json not found 2024-11-17 00:30:32 +01:00
Enno Hermann e81f8d079c fix: more helpful error message when formatter is not found 2024-11-17 00:29:49 +01:00
Enno Hermann 993da778b4 chore: use original instead of scarf urls
These allowed Coqui to get download stats, which we don't need anymore
2024-11-10 22:19:25 +01:00
Enno Hermann 2df9bfa78e
refactor: handle deprecation of torch.cuda.amp.autocast (#144)
torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be
deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu",
args...) instead.

https://pytorch.org/docs/stable/amp.html
2024-11-09 18:37:08 +01:00
Shavit 540e8d6cf2
fix(bin.synthesize): return speakers names only (#147) 2024-11-09 18:35:54 +01:00
Enno Hermann 0971bc236e refactor: use external package for monotonic alignment 2024-11-08 12:35:04 +01:00
Enno Hermann 6314032fd7
Merge pull request #113 from idiap/pytorch
fix: only enable load with weights_only in pytorch>=2.4
2024-11-04 22:14:42 +01:00
Enno Hermann 8e66be2c32 fix: only enable load with weights_only in pytorch>=2.4
Allows moving the minimum Pytorch version back to 2.1
2024-11-04 18:39:51 +01:00
Enno Hermann 47ad0bf190
fix(text.characters): add nasal diacritic (#127) 2024-11-04 17:40:23 +01:00
Enno Hermann 964b813235 fix(gpt): set attention mask and address other warnings 2024-10-25 18:49:04 +02:00
Johnny Street 018d4ba1db fix(xtts): support transformers>=4.43.0 in streaming inference 2024-10-20 15:20:26 +02:00
Shavit 36611a7192
feat: normalize unicode characters in text cleaners (#85)
* Add normalizer type C to text cleaners

* Linter recommendations

* Add unicode normalize to every cleaner

* Format test_text_cleaners.py
2024-10-02 17:01:19 +02:00
Enno Hermann e5dd06b3bb
Merge pull request #77 from shavit/71-torch-load
Load weights only in torch.load
2024-09-12 23:28:57 +01:00