logan hart
6fdb88f8e2
Add Delightful-TTS implementation ( #2095 )
...
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
Eren G??lge
17ac188958
Drop fairseq for Hubert
2023-06-26 19:27:48 +02:00
Eren Gölge
4de797bb11
Draft ONNX export for VITS ( #2563 )
...
* Draft ONNX export for VITS
Could not get it work to output variable length sequence
* Fixup for onnx constant output
* Make style
* Remove commented code
2023-05-16 01:07:56 +02:00
Eren G??lge
7d8b1665c8
Fix rand_segment edge case (input_len == seg_len - 1)
2022-08-01 11:37:45 +02:00
Eren Gölge
424d04e4f6
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
c11944022d
Revert back again rand_segment
2022-02-25 11:30:24 +01:00
Eren Gölge
ab8a4ca2c3
Revert random segment
2022-02-25 11:26:59 +01:00
Eren Gölge
1f0c8179da
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
c4c471d61d
Allow padding for shorter segments
2022-02-25 11:25:48 +01:00
Eren Gölge
26f76fce22
Remove SpeedySpeech from .models.json
2021-09-10 17:47:27 +00:00
Eren Gölge
ed4b1d8514
Test `TTS.tts.utils.helpers`
2021-09-10 08:25:21 +00:00
Eren Gölge
bfc6ceac29
Move MAS to `TTS.tts.utils.helpers`
2021-09-09 10:57:19 +00:00
Eren Gölge
537c8576ec
Stage `TTS.tts.utils.helpers`
2021-09-08 13:35:18 +00:00