Eren G??lge
ded7fd4fb2
Make style
2023-09-14 15:23:37 +02:00
Eren G??lge
44b61d2b92
Fixup
2023-09-14 15:22:54 +02:00
Eren Gölge
623ea41634
Fix model tests ( #2943 )
2023-09-14 15:21:48 +02:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00
Cohee
b3b1555d82
Fix exception handling in manage.py ( #2912 )
2023-09-04 12:54:30 +02:00
Jake Tae
409db505d2
Add device support in TTS and Synthesizer ( #2855 )
...
* fix: resolve merge conflicts
* fix: retain backwards compatability in functions
* feature: utilize device for voice transfer
* feature: use device for vocoder
* chore: cleanup vocoder cpu logic
* fix: add necessary vocoder output device check
* fix: add necessary vocoder output device check
* fix: indentation
* fix: check if waveform is pt tensor before cpu conversion
---------
Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
2023-08-14 21:04:44 +02:00
Julian Weber
febcaf710a
Add customizable data home path ( #2871 )
...
* Add customizable data home path
* Add TTS_HOME as an option
2023-08-14 21:02:48 +02:00
Eren Gölge
3a104d5c49
Update Studio API for XTTS ( #2861 )
...
* Update Studio API for XTTS
* Update the docs
* Update README.md
* Update README.md
Update README
2023-08-13 12:04:12 +02:00
Eren Gölge
17ddd65741
Please p3.11
2023-07-31 15:53:19 +02:00
Aleś Bułojčyk
d124f78430
Recipe for Belarusian TTS ( #2756 )
...
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>
* Recipe for Belarusian TTS
---------
Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
2023-07-31 10:26:21 +02:00
logan hart
6fdb88f8e2
Add Delightful-TTS implementation ( #2095 )
...
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
JiangCheng
53938e2d32
Squashed commit of the following:
...
commit dd612fd72e
Author: JiangCheng <jiangcheng@kezaihui.com>
Date: Mon Jun 5 16:04:54 2023 +0800
Failed to download the file and need to delete the created file path
2023-07-05 12:08:05 +02:00
Eren G??lge
34b9a18c47
Fixup
2023-06-28 12:26:04 +02:00
Eren G??lge
6b9ebf5aab
Merge branch 'p3_11' into dev
2023-06-28 12:13:04 +02:00
Eren Gölge
c844b6570a
Inference API for 🐶 Bark ( #2685 )
...
* Add bark requirements
* Draft Bark implementation
* Download HF models
* Update synthesizer
* Add bark model
* Make style
* Update pylintrc
* Update model URLs
* Update Bark Config
* Fix here and ther
* Make style
* Make lint
* Update requirements
* Update requirements
2023-06-28 11:55:27 +02:00
Eren G??lge
a1c431e6a9
Fixups
2023-06-26 12:55:18 +02:00
Eren G??lge
a58fb6c01b
Update requirements
2023-06-22 13:53:19 +02:00
Eren G??lge
e888e8a56d
Fix manage
2023-06-22 10:13:20 +02:00
Eren Gölge
fff8b762bc
Merge branch 'dev' into bark
2023-06-21 15:49:05 +02:00
Eren G??lge
0f8932a6a9
Fix here and ther
2023-06-21 11:59:27 +02:00
Eren G??lge
f4c88ed677
Make style
2023-06-19 14:22:32 +02:00
Eren G??lge
2364c38d16
Update synthesizer
2023-06-19 14:15:21 +02:00
Eren G??lge
5a31fad502
Download HF models
2023-06-19 14:14:04 +02:00
Eren Gölge
e785d101a1
Port Fairseq TTS models ( #2628 )
...
* Load fairseq models
* Add docs and missing files
* Managing fairseq models and docs for API
* Make style
* Use scarf URL
* Add tests
* Fix URL
* Pass cpu
* Make lint
* Fixup
* Make lint
* fixup
* Fixup
* Change tokenization order
* Update README
* Fixup
* Fixup
2023-06-05 11:15:13 +02:00
Shukrullo Turgunov
0d5e68a09f
fix typo ( #2647 )
...
* fix typo
* typo fix
2023-06-05 09:58:16 +02:00
manmay nakhashi
a3d5801c44
Tortoise TTS inference ( #2547 )
...
* initial commit
* Tortoise inference
* revert path change
* style fix
* remove accidental remove
* style fixes
* style fixes
* removed unwanted assests and deps
* remove changes
* remove cvvp
* style fix black
* added tortoise config and updated config and args, refactoring the code
* added tortoise to api
* Pull mel_norm from url
* Use TTS cleaners
* Let download model files
* add ability to pass tortoise presets through coqui api
* fix tests
* fix style and tests
* fix tts commandline for tortoise
* Add config.json to tortoise
* Use kwargs
* Use regular model api for loading tortoise
* Add load from dir to synthesizer
* Fix Tortoise floats
* Use model_dir when there are multiple urls
* Use `synthesize` when exists
* lint fixes and resolve preset bug
* resolve a download bug and update model link
* fix json
* do tortoise inference from voice dir
* fix
* fix test
* fix speaker id and remove assests
* update inference_tests.yml
* replace inference_test.yml
* fix extra dir as None
* fix tests
* remove space
* Reformat docstring
* Add docs
* Update docs
* lint fixes
---------
Co-authored-by: Eren Gölge <egolge@coqui.ai>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-05-16 00:58:21 +02:00
Eren Gölge
9b5822d625
Update VAD for silence trimming. ( #2604 )
...
* Update vad for mp3 and fault tolerance
* Make style
* Remove importt
* Remove stupid defaults
2023-05-11 11:09:23 +02:00
prakharpbuf
c1875f68df
typos and minor fixes ( #2508 )
...
* Update tacotron1-2.md
* Update README.md
* Update Tutorial_2_train_your_first_TTS_model.ipynb
* Update synthesizer.py
There is no arg called --speaker_name
* Update formatting_your_dataset.md
* Update AnalyzeDataset.ipynb
* Update AnalyzeDataset.ipynb
* Update AnalyzeDataset.ipynb
* Update finetuning.md
* Update train_yourtts.py
* Update train_yourtts.py
* Update train_yourtts.py
* Update finetuning.md
2023-04-26 15:22:57 +02:00
Eren Gölge
758ef84cc2
Using 🐸 Studio models with `tts` command
2023-04-13 14:14:41 +02:00
Eren Gölge
a49c1931d9
Fixup
2023-04-10 13:33:42 +02:00
Eren Gölge
30109af2a0
Merge pull request #2480 from MattyB95/librosa_v0.10.0
...
Update Librosa Version To V0.10.0
2023-04-07 12:32:33 +02:00
Eren Gölge
ad8b9bf2be
🐸 Coqui Studio API integration ( #2484 )
...
* Warn when lang is not avail
* Make style
* Implement Coqui Studio API
* Test
* Update docs
* Set action
* Make style
* Make lint
* Update README
* Make style
* Fix action
* Run actions
2023-04-05 15:06:50 +02:00
Matthew Boakes
4c829e74a1
Update Librosa Version To V0.10.0
2023-04-05 00:59:20 +01:00
Eren Gölge
d309f50e53
Implement FreeVC ( #2451 )
...
* Update .gitignore
* Draft FreeVC implementation
* Tests and relevant updates
* Update API tests
* Add missings
* Update requirements
* :(
* Lazy handle for vc
* Update docs for voice conversion
* Make style
2023-03-25 18:33:23 +01:00
Roee Shenberg
3c15f0619a
Bug fixes in OverFlow audio generation ( #2380 )
2023-03-15 12:02:11 +01:00
Eren Gölge
914280a556
Bump up to v0.11.0 ( #2329 )
...
* Make style
* Bump up to v0.11.0
2023-02-08 13:58:49 +01:00
Eren G??lge
85b3a04b37
Merge branch 'api_model_path' into dev
2023-02-06 11:18:00 +01:00
marius851000
1f4d8bf0f1
Fix tts-server for multi-lingual models ( #2257 )
2023-02-06 10:54:34 +01:00
Eren G??lge
7fddabc8ac
Implement cloning in API
2023-01-30 13:35:48 +01:00
manmay nakhashi
bc422f2f3c
Fastspeech2 ( #2073 )
...
* added EnergyDataset
* add energy to Dataset
* add comupte_energy
* added energy params
* added energy to forward_tts
* added plot_avg_energy for visualisation
* Update forward_tts.py
* create file
* added fastspeech2 recipe
* add fastspeech2 config
* removed energy from fast pitch
* add energy loss to forward tts
* Update fastspeech2_config.py
* change run_name
* Update numpy_transforms.py
* fix typo
* fix typo
* fix typo
* linting issues
* use_energy default value --> False
* Update numpy_transforms.py
* linting fixes
* fix typo
* liniting_fix
* liniting_fix
* fix
* fixes
* fixes
* lint fix
* lint fixws
* added training test
* wrong import
* wrong import
* trailing whitespace
* style fix
* changed class name because of error
* class name change
* class name change
* change class name
* fixed styles
2023-01-15 22:39:22 +01:00
Khalid Bashir
42afad5e79
Fixed bug related to yourtts speaker embeddings issue ( #2234 )
...
* Fixed bug related to yourtts speaker embeddings issue
* Reverted code for base_tts
* Bug fix on VITS d_vector_file type
* Ignore the test speakers on YourTTS recipe
* Add speaker encoder model and config on YourTTS recipe to easily do zero-shot inference
* Update YourTTS config file
* Update ModelManager._update_path to deal with list attributes
* Fix lint checks
* Remove unused code
* Fix unit tests
* Reset name_to_id to get the right speaker ids on load_embeddings_from_list_of_files
* Set weighted_sampler_multipliers as an empty dict to prevent users' mistakes
Co-authored-by: Edresson Casanova <edresson1@gmail.com>
2023-01-02 14:20:02 +01:00
Eren G??lge
8c32a6998a
Add pth files to manager
2022-12-26 14:29:25 +01:00
Eren Gölge
ecea43ec81
Adding pre-trained Overflow model ( #2211 )
...
* Adding pretrained Overflow model
* Stabilize HMM
* Fixup model manager
* Return `audio_unique_name` by default
* Distribute max split size over datasets
* Fixup eval_split_size
* Make style
2022-12-14 16:55:48 +01:00
Eren Gölge
1ddc484b49
Python API implementation ( #2195 )
...
* Draft implementation
* Fix style
* Add api tests
* Fix lint
* Update docs
* Update tests
* Set env
* Fixup
* Fixup
* Fix lint
* Revert
2022-12-12 12:04:20 +01:00
logan hart
ff9b63d02a
Add neon models ( #2140 )
...
* Add neon ljspeech vits model
* Add neon german model
* Update .models.json
* Add neon spanish model
* Add french model
* Add Dutch model
* Add Hungarian model
* Add Greek model
* Remove uneeded description
* Update .models.json
* Update .models.json
* Handling neon models
* Add all neon models
* Update .models.json
* Split zoo_tests
* Update test names
* Update model testing
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2022-11-16 16:12:39 +01:00
Eren Gölge
8cb1433e6e
Cache fsspec downloads ( #2132 )
...
* Cache fsspec downloaded files
* Use diff paths for test
* Make fsspec caching optional
* Decom GPU docker tests
* Make progress bar optional for better CI log
* Check path local
2022-11-09 22:12:48 +01:00
Victor Shepardson
5307a2229b
Fix Capacitron training ( #2086 )
2022-11-01 12:52:06 +01:00
Eren Gölge
5f5d441ee5
Write non-speech files in a TXT ( #2048 )
...
* Write non-speech files in a txt
* Save 16-bit wav out of vad
2022-10-06 13:25:54 +02:00
Eren Gölge
9e5a469c64
d-vector handling ( #1945 )
...
* Update BaseDatasetConfig
- Add dataset_name
- Chane name to formatter_name
* Update compute_embedding
- Allow entering dataset by args
- Use released model by default
- Use the new key format
* Update loading
* Update recipes
* Update other dep code
* Update tests
* Fixup
* Load multiple embedding files
* Fix argument names in dep code
* Update docs
* Fix argument name
* Fix linter
2022-09-13 14:10:33 +02:00
Edresson Casanova
371772c355
Replace pyworld by pyin ( #1946 )
...
* Replace pyworld by pyin
* Fix unit tests
2022-09-09 10:43:14 +02:00