Edresson Casanova
a45dfd6266
Add syntacc training recipe
2023-10-14 16:41:05 -03:00
Edresson Casanova
2bdc7a5675
Add text encoder based on adaptive Weight
2023-10-12 15:08:15 -03:00
Julian Weber
e5e0cbffc9
Streaming inference for XTTS 🚀 ( #3035 )
2023-10-06 18:34:06 +02:00
Edresson Casanova
4c3c11c958
Tortoise inference fix and fix zoo unit tests ( #3010 )
2023-09-29 13:40:57 +02:00
Aarni Koskela
33a7c722f6
Merge duplicate on_train_step_start functions in delightful_tts
2023-09-27 01:10:44 +03:00
Aarni Koskela
861c68b0b8
Rename misnamed setter
2023-09-27 01:09:59 +03:00
Aarni Koskela
09e14e68db
Remove duplicate get_named_beta_schedules
2023-09-27 01:09:59 +03:00
Aarni Koskela
59f85a7122
Remove duplicate code from xtts.tokenizer
2023-09-27 01:09:59 +03:00
loupzeur
da8b6bbce1
fix: xtts not taking into account device flag ( #2951 )
...
* fix: xtts not taking into account device flag
* Style changes
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-09-20 09:57:02 +02:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00
Edresson Casanova
4d3f23b5d3
Add CML-TTS dataset YourTTS training recipe ( #2934 )
2023-09-12 11:49:14 +02:00
Aleś Bułojčyk
fead04f779
Add phonemizer for Belarusian language ( #2856 )
2023-08-28 11:20:45 +02:00
Eren Gölge
a7a96d08dd
Fix loading Bark ( #2893 )
...
* Fixup hubert path
* Make style
2023-08-26 11:59:00 +02:00
Jake Tae
409db505d2
Add device support in TTS and Synthesizer ( #2855 )
...
* fix: resolve merge conflicts
* fix: retain backwards compatability in functions
* feature: utilize device for voice transfer
* feature: use device for vocoder
* chore: cleanup vocoder cpu logic
* fix: add necessary vocoder output device check
* fix: add necessary vocoder output device check
* fix: indentation
* fix: check if waveform is pt tensor before cpu conversion
---------
Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
2023-08-14 21:04:44 +02:00
Eren Gölge
3a104d5c49
Update Studio API for XTTS ( #2861 )
...
* Update Studio API for XTTS
* Update the docs
* Update README.md
* Update README.md
Update README
2023-08-13 12:04:12 +02:00
Eren G??lge
37b558ccb9
Make style
2023-08-11 12:55:23 +02:00
Eren G??lge
9a8352b8da
Fix import error with Bark
2023-08-11 03:33:59 +02:00
Eren Gölge
4186f42b21
Handle missing JA phonemizer ( #2843 )
...
* Handle missing JA phonemizer
* Make style
2023-08-07 13:19:38 +02:00
Javier
4e7f8cd021
Add fairseq onnx support and strict configuration, fixes some onnx errors ( #2831 )
2023-08-04 11:02:59 +02:00
Eren Gölge
69f080eb47
Fix DelightfulTTS ( #2823 )
...
* Fix tests
* Make style
2023-07-31 13:52:45 +02:00
Eren Gölge
483888b9d8
Add kwargs to ignore extra arguments w/o error ( #2822 )
2023-07-31 11:37:35 +02:00
Aleś Bułojčyk
d124f78430
Recipe for Belarusian TTS ( #2756 )
...
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>
* Recipe for Belarusian TTS
---------
Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
2023-07-31 10:26:21 +02:00
Javier
c140df5a58
Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false ( #2816 )
2023-07-31 10:19:49 +02:00
Eren Gölge
8aacb81849
Fix Tortoise load ( #2791 )
...
* Remove key prunning in tortoise
* Make lint
2023-07-24 13:42:47 +02:00
logan hart
6fdb88f8e2
Add Delightful-TTS implementation ( #2095 )
...
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
Eren Gölge
0de12ec5aa
API tests ( #2790 )
...
* Separate API tests and only run when uplifted
* Make style
2023-07-24 12:14:21 +02:00
Paul O'Leary McCann
c0aabb8596
Make Japanese-specific dependencies optional ( #2776 )
...
* Don't install MeCab by default
* Add optional [ja] deps, like [dev] etc
* Add JA requirements file
* Add JA requirements to requirements_all
This should help the tests run.
2023-07-24 11:28:27 +02:00
Eren Gölge
672ec3b35e
Fix #2749 ( #2750 )
2023-07-08 11:40:44 +02:00
Eren Gölge
a2984fb435
Fix #2745 ( #2748 )
2023-07-07 20:23:27 +02:00
Eren Gölge
7b5c8422c8
Export multispeaker onnx ( #2743 )
2023-07-06 13:36:50 +02:00
ZhouGongZaiShi
d5f16d77c2
delete meaningless print() ( #2662 )
2023-07-04 11:38:17 +02:00
Eren G??lge
cb9c320691
Fixup
2023-06-30 14:13:11 +02:00
Eren G??lge
91cc11d636
Remove commented codes
2023-06-28 12:14:37 +02:00
Eren G??lge
6b9ebf5aab
Merge branch 'p3_11' into dev
2023-06-28 12:13:04 +02:00
Eren Gölge
c844b6570a
Inference API for 🐶 Bark ( #2685 )
...
* Add bark requirements
* Draft Bark implementation
* Download HF models
* Update synthesizer
* Add bark model
* Make style
* Update pylintrc
* Update model URLs
* Update Bark Config
* Fix here and ther
* Make style
* Make lint
* Update requirements
* Update requirements
2023-06-28 11:55:27 +02:00
Eren G??lge
a13b1352a4
Fixup
2023-06-26 19:30:26 +02:00
Eren G??lge
17ac188958
Drop fairseq for Hubert
2023-06-26 19:27:48 +02:00
Eren G??lge
c03768bb53
Make style
2023-06-26 17:16:26 +02:00
Eren G??lge
a1c431e6a9
Fixups
2023-06-26 12:55:18 +02:00
Eren Gölge
fff8b762bc
Merge branch 'dev' into bark
2023-06-21 15:49:05 +02:00
Eren Gölge
4cf8652392
Fix Tortoise load ( #2697 )
...
* Handle missing gpt weights
* Make style
* Fix lint
2023-06-21 15:42:01 +02:00
Eren G??lge
cf98ae04df
Make lint
2023-06-21 12:05:08 +02:00
Eren G??lge
3b9fca2398
Make style
2023-06-21 12:02:06 +02:00
Eren G??lge
0f8932a6a9
Fix here and ther
2023-06-21 11:59:27 +02:00
Eren G??lge
03c347b7f3
Update Bark Config
2023-06-21 11:58:18 +02:00
Eren G??lge
f4c88ed677
Make style
2023-06-19 14:22:32 +02:00
Eren G??lge
37b708dac7
Add bark model
2023-06-19 14:16:06 +02:00
Eren G??lge
f59da4dba5
Draft Bark implementation
2023-06-12 14:32:39 +02:00
Tsai Meng-Ting
d65819422b
Update stochastic_duration_predictor.py ( #2663 )
...
fix a typo
2023-06-12 11:10:54 +02:00
Eren Gölge
e785d101a1
Port Fairseq TTS models ( #2628 )
...
* Load fairseq models
* Add docs and missing files
* Managing fairseq models and docs for API
* Make style
* Use scarf URL
* Add tests
* Fix URL
* Pass cpu
* Make lint
* Fixup
* Make lint
* fixup
* Fixup
* Change tokenization order
* Update README
* Fixup
* Fixup
2023-06-05 11:15:13 +02:00