Aya Jafari
ffddf10458
unit test fix
2023-10-13 10:56:47 -03:00
Aya Jafari
6eaecab0ca
fixed bugs in fastpitch tts synthesis
2023-10-10 23:02:31 -03:00
ggoknar
99635193f5
v0.17.8
2023-10-07 01:14:05 +03:00
ggoknar
3bb51b1276
0.17.8
2023-10-07 01:13:02 +03:00
Edresson Casanova
2852404bdf
Fix style
2023-10-06 17:42:46 -03:00
Edresson Casanova
99650044a4
Try-except to present error when the user dont have connection
2023-10-06 17:37:05 -03:00
Edresson Casanova
529ea3f67f
Print a message when it is already donwloaded
2023-10-06 17:26:40 -03:00
Edresson Casanova
ee1ef1c51e
Remove unused method
2023-10-06 17:21:22 -03:00
Edresson Casanova
4a6103fec9
Redownload XTTS with the local and remote config do not match
2023-10-06 17:16:30 -03:00
Eren Gölge
0520697b5f
v0.17.7
2023-10-06 18:35:26 +02:00
Julian Weber
e5e0cbffc9
Streaming inference for XTTS 🚀 ( #3035 )
2023-10-06 18:34:06 +02:00
OPERATOR
2150136210
None is not able to be read for "XTTS", fixes crash if its set to None. ( #3009 )
2023-10-02 12:53:36 +02:00
Eren Gölge
155c5fc0bd
v0.17.6
2023-09-29 23:44:09 +02:00
Edresson Casanova
4c3c11c958
Tortoise inference fix and fix zoo unit tests ( #3010 )
2023-09-29 13:40:57 +02:00
Eren Gölge
bb05dcb9b4
Merge pull request #2922 from coqui-ai/be_tts
...
Adding Belarusian TTS model
2023-09-27 09:48:28 +02:00
Eren Gölge
8cba47191f
Merge pull request #2993 from akx/tts-readme
...
Ensure `tts` CLI tool readme and usage is in sync
2023-09-27 09:46:54 +02:00
Eren Gölge
ea51a7ffcc
Merge pull request #3003 from akx/duplicate-code-removal
...
Duplicate code removal
2023-09-27 09:41:35 +02:00
Aarni Koskela
0dbe7cbcc4
Remove duplicate convert_pad_shape
2023-09-27 01:10:48 +03:00
Aarni Koskela
33a7c722f6
Merge duplicate on_train_step_start functions in delightful_tts
2023-09-27 01:10:44 +03:00
Aarni Koskela
861c68b0b8
Rename misnamed setter
2023-09-27 01:09:59 +03:00
Aarni Koskela
09e14e68db
Remove duplicate get_named_beta_schedules
2023-09-27 01:09:59 +03:00
Aarni Koskela
59f85a7122
Remove duplicate code from xtts.tokenizer
2023-09-27 01:09:59 +03:00
Aarni Koskela
0a82f063cc
Late-import main TTS libraries in `tts` CLI
2023-09-26 15:38:56 +03:00
Aarni Koskela
5c047cf304
Ensure `tts` CLI tool readme and usage help is in sync
2023-09-26 15:38:56 +03:00
Eren Gölge
0b95b88f13
Bum up to v0.17.5
2023-09-25 18:16:45 +02:00
VLT Media
dd73910651
Bug: self.model_name needed to be initialized.
...
Bug: self.model_name needed to be initialize to get around a bug that automatically crashes when the user provides the model paths but no model_name when initializing the TTS object.
2023-09-23 01:41:35 -04:00
loupzeur
da8b6bbce1
fix: xtts not taking into account device flag ( #2951 )
...
* fix: xtts not taking into account device flag
* Style changes
---------
Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-09-20 09:57:02 +02:00
Reuben Morais
f829bf50f8
Bump version to v0.17.4 (really)
2023-09-15 16:40:34 +02:00
Eren G??lge
aa8fa4756e
Bump up to v0.17.4
2023-09-14 17:52:44 +02:00
Eren G??lge
9d0b76ce23
Check env var for COQUI_TOS_AGREED
2023-09-14 17:51:40 +02:00
Eren G??lge
13dd7c4c9e
Bump up to v0.17.2
2023-09-14 15:24:05 +02:00
Eren G??lge
ded7fd4fb2
Make style
2023-09-14 15:23:37 +02:00
Eren G??lge
44b61d2b92
Fixup
2023-09-14 15:22:54 +02:00
Eren Gölge
623ea41634
Fix model tests ( #2943 )
2023-09-14 15:21:48 +02:00
Eren G??lge
af62613c86
Bump up to v0.17.1
2023-09-13 18:23:39 +02:00
Eren G??lge
ee7cee0e35
Fixup
2023-09-13 18:21:44 +02:00
Eren G??lge
5dcf9ae311
Bump up v0.17.0
2023-09-13 18:04:26 +02:00
Eren Gölge
4033db5f4b
🔥 XTTS implementation
2023-09-13 17:51:24 +02:00
Edresson Casanova
4d3f23b5d3
Add CML-TTS dataset YourTTS training recipe ( #2934 )
2023-09-12 11:49:14 +02:00
Eren Gölge
9533f8656c
Make style
2023-09-04 13:58:37 +02:00
Eren Gölge
562a9509f2
Add BE model
2023-09-04 13:57:03 +02:00
Eren Gölge
b4c82685a7
Add model entries
2023-09-04 13:04:58 +02:00
Cohee
b3b1555d82
Fix exception handling in manage.py ( #2912 )
2023-09-04 12:54:30 +02:00
Eren G??lge
40b527345f
Bump up to v0.16.6
2023-09-04 12:51:53 +02:00
Aleś Bułojčyk
fead04f779
Add phonemizer for Belarusian language ( #2856 )
2023-08-28 11:20:45 +02:00
Jake Tae
b79b6f0762
feature: add device flag to tts cli ( #2875 )
2023-08-28 11:20:12 +02:00
Eren Gölge
c0b5e61749
Bump up to v0.16.5
2023-08-26 12:00:25 +02:00
Eren Gölge
a7a96d08dd
Fix loading Bark ( #2893 )
...
* Fixup hubert path
* Make style
2023-08-26 11:59:00 +02:00
Eren Gölge
04a36a727b
Bump up to v0.16.4
2023-08-26 10:39:48 +02:00
Eren Gölge
a96562a750
Update .models.json
2023-08-26 10:36:40 +02:00
Jake Tae
409db505d2
Add device support in TTS and Synthesizer ( #2855 )
...
* fix: resolve merge conflicts
* fix: retain backwards compatability in functions
* feature: utilize device for voice transfer
* feature: use device for vocoder
* chore: cleanup vocoder cpu logic
* fix: add necessary vocoder output device check
* fix: add necessary vocoder output device check
* fix: indentation
* fix: check if waveform is pt tensor before cpu conversion
---------
Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
2023-08-14 21:04:44 +02:00
Julian Weber
febcaf710a
Add customizable data home path ( #2871 )
...
* Add customizable data home path
* Add TTS_HOME as an option
2023-08-14 21:02:48 +02:00
Eren Gölge
c4e5effab9
Bump up to v0.16.3
2023-08-13 12:22:04 +02:00
Eren Gölge
3a104d5c49
Update Studio API for XTTS ( #2861 )
...
* Update Studio API for XTTS
* Update the docs
* Update README.md
* Update README.md
Update README
2023-08-13 12:04:12 +02:00
Eren G??lge
37b558ccb9
Make style
2023-08-11 12:55:23 +02:00
Eren G??lge
9a8352b8da
Fix import error with Bark
2023-08-11 03:33:59 +02:00
Eren Gölge
c87377b713
Bump up to v0.16.2
2023-08-07 13:21:14 +02:00
Eren Gölge
4186f42b21
Handle missing JA phonemizer ( #2843 )
...
* Handle missing JA phonemizer
* Make style
2023-08-07 13:19:38 +02:00
Javier
4e7f8cd021
Add fairseq onnx support and strict configuration, fixes some onnx errors ( #2831 )
2023-08-04 11:02:59 +02:00
ChaseC
52a528cfcf
add post functionality to /api/tts ( #2836 )
2023-08-04 10:54:20 +02:00
Eren Gölge
dc04baa1ee
Bump up to v0.16.1
2023-07-31 15:54:45 +02:00
Eren Gölge
17ddd65741
Please p3.11
2023-07-31 15:53:19 +02:00
Eren Gölge
69f080eb47
Fix DelightfulTTS ( #2823 )
...
* Fix tests
* Make style
2023-07-31 13:52:45 +02:00
Eren Gölge
483888b9d8
Add kwargs to ignore extra arguments w/o error ( #2822 )
2023-07-31 11:37:35 +02:00
Aleś Bułojčyk
d124f78430
Recipe for Belarusian TTS ( #2756 )
...
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>
* Recipe for Belarusian TTS
---------
Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
2023-07-31 10:26:21 +02:00
Javier
c140df5a58
Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false ( #2816 )
2023-07-31 10:19:49 +02:00
Eren Gölge
b739326503
Bump up to v0.16.0
2023-07-24 16:04:10 +02:00
Eren Gölge
8aacb81849
Fix Tortoise load ( #2791 )
...
* Remove key prunning in tortoise
* Make lint
2023-07-24 13:42:47 +02:00
logan hart
6fdb88f8e2
Add Delightful-TTS implementation ( #2095 )
...
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
Eren Gölge
0de12ec5aa
API tests ( #2790 )
...
* Separate API tests and only run when uplifted
* Make style
2023-07-24 12:14:21 +02:00
Paul O'Leary McCann
c0aabb8596
Make Japanese-specific dependencies optional ( #2776 )
...
* Don't install MeCab by default
* Add optional [ja] deps, like [dev] etc
* Add JA requirements file
* Add JA requirements to requirements_all
This should help the tests run.
2023-07-24 11:28:27 +02:00
Eren Gölge
672ec3b35e
Fix #2749 ( #2750 )
2023-07-08 11:40:44 +02:00
Eren Gölge
b5cd644132
Bump up to v0.15.6
2023-07-08 10:33:09 +02:00
Eren Gölge
a2984fb435
Fix #2745 ( #2748 )
2023-07-07 20:23:27 +02:00
Eren Gölge
7b5c8422c8
Export multispeaker onnx ( #2743 )
2023-07-06 13:36:50 +02:00
JiangCheng
53938e2d32
Squashed commit of the following:
...
commit dd612fd72e
Author: JiangCheng <jiangcheng@kezaihui.com>
Date: Mon Jun 5 16:04:54 2023 +0800
Failed to download the file and need to delete the created file path
2023-07-05 12:08:05 +02:00
ZhouGongZaiShi
d5f16d77c2
delete meaningless print() ( #2662 )
2023-07-04 11:38:17 +02:00
PiaoYang
630327c4e6
Update compute_embeddings.py ( #2668 )
...
* [Typo] Fix variable name. More readable description.
Update train_yourtts.py
Reformat.
Reformat using black again.
* Add `old_append`. Fix bool argparse.
* Reformat.
2023-07-04 11:37:47 +02:00
ChaseC
8957799e45
fix loading of model and vocoder configs ( #2698 )
2023-07-04 11:32:00 +02:00
Eren Gölge
505ac1aa8f
Bump up to v0.15.5
2023-07-03 11:18:06 +02:00
Eren G??lge
21a3f280de
Bump up to v0.15.4
2023-06-30 15:05:00 +02:00
Eren Gölge
f9cde7bb1b
Bump up to v0.15.3
2023-06-30 14:30:18 +02:00
Eren G??lge
413a345d66
Bump up to v0.15.2
2023-06-30 14:16:47 +02:00
Eren G??lge
cb9c320691
Fixup
2023-06-30 14:13:11 +02:00
Eren G??lge
dfd8d313a2
Bump up to v1.5.1
2023-06-29 17:53:09 +02:00
Eren G??lge
a035b25340
Bump up to v0.15.0
2023-06-28 15:24:20 +02:00
Eren G??lge
34b9a18c47
Fixup
2023-06-28 12:26:04 +02:00
Eren G??lge
91cc11d636
Remove commented codes
2023-06-28 12:14:37 +02:00
Eren G??lge
6b9ebf5aab
Merge branch 'p3_11' into dev
2023-06-28 12:13:04 +02:00
Eren Gölge
c844b6570a
Inference API for 🐶 Bark ( #2685 )
...
* Add bark requirements
* Draft Bark implementation
* Download HF models
* Update synthesizer
* Add bark model
* Make style
* Update pylintrc
* Update model URLs
* Update Bark Config
* Fix here and ther
* Make style
* Make lint
* Update requirements
* Update requirements
2023-06-28 11:55:27 +02:00
Eren G??lge
a13b1352a4
Fixup
2023-06-26 19:30:26 +02:00
Eren G??lge
17ac188958
Drop fairseq for Hubert
2023-06-26 19:27:48 +02:00
Eren G??lge
c03768bb53
Make style
2023-06-26 17:16:26 +02:00
Eren G??lge
a1c431e6a9
Fixups
2023-06-26 12:55:18 +02:00
Eren G??lge
a58fb6c01b
Update requirements
2023-06-22 13:53:19 +02:00
Eren G??lge
e888e8a56d
Fix manage
2023-06-22 10:13:20 +02:00
Eren Gölge
fff8b762bc
Merge branch 'dev' into bark
2023-06-21 15:49:05 +02:00
Eren Gölge
4cf8652392
Fix Tortoise load ( #2697 )
...
* Handle missing gpt weights
* Make style
* Fix lint
2023-06-21 15:42:01 +02:00
Eren G??lge
cf98ae04df
Make lint
2023-06-21 12:05:08 +02:00
Eren G??lge
3b9fca2398
Make style
2023-06-21 12:02:06 +02:00
Eren G??lge
0f8932a6a9
Fix here and ther
2023-06-21 11:59:27 +02:00
Eren G??lge
03c347b7f3
Update Bark Config
2023-06-21 11:58:18 +02:00
Eren G??lge
695e862aad
Update model URLs
2023-06-21 11:57:46 +02:00
Eren G??lge
f4c88ed677
Make style
2023-06-19 14:22:32 +02:00
Eren G??lge
37b708dac7
Add bark model
2023-06-19 14:16:06 +02:00
Eren G??lge
2364c38d16
Update synthesizer
2023-06-19 14:15:21 +02:00
Eren G??lge
5a31fad502
Download HF models
2023-06-19 14:14:04 +02:00
Eren G??lge
f59da4dba5
Draft Bark implementation
2023-06-12 14:32:39 +02:00
Tsai Meng-Ting
d65819422b
Update stochastic_duration_predictor.py ( #2663 )
...
fix a typo
2023-06-12 11:10:54 +02:00
Eren Gölge
49cf6a5d62
Bump up to v0.14.3
2023-06-06 09:41:59 +02:00
Eren Gölge
8e415732dd
Fixup
2023-06-06 09:41:46 +02:00
Eren Gölge
547a72c97d
Fixup
2023-06-05 22:38:56 +02:00
Eren Gölge
a494f0c92a
Bump up to v0.14.1
2023-06-05 11:29:10 +02:00
Eren Gölge
50b1074779
Make `tts` ready
2023-06-05 11:29:10 +02:00
Eren Gölge
e785d101a1
Port Fairseq TTS models ( #2628 )
...
* Load fairseq models
* Add docs and missing files
* Managing fairseq models and docs for API
* Make style
* Use scarf URL
* Add tests
* Fix URL
* Pass cpu
* Make lint
* Fixup
* Make lint
* fixup
* Fixup
* Change tokenization order
* Update README
* Fixup
* Fixup
2023-06-05 11:15:13 +02:00
Shukrullo Turgunov
0d5e68a09f
fix typo ( #2647 )
...
* fix typo
* typo fix
2023-06-05 09:58:16 +02:00
Reuben Morais
23a7a9a363
Fetch all built-in speakers ( #2626 )
2023-05-22 17:28:08 +02:00
Eren Gölge
aef7f6d980
Bump up to v0.14.1
2023-05-18 11:13:09 +02:00
Eren Gölge
9e99e0f42d
Disable reduction
2023-05-18 11:12:51 +02:00
Eren Gölge
bc0a532c7a
Bump up to v0.14.0
2023-05-16 10:08:41 +02:00
Eren Gölge
4de797bb11
Draft ONNX export for VITS ( #2563 )
...
* Draft ONNX export for VITS
Could not get it work to output variable length sequence
* Fixup for onnx constant output
* Make style
* Remove commented code
2023-05-16 01:07:56 +02:00
manmay nakhashi
a3d5801c44
Tortoise TTS inference ( #2547 )
...
* initial commit
* Tortoise inference
* revert path change
* style fix
* remove accidental remove
* style fixes
* style fixes
* removed unwanted assests and deps
* remove changes
* remove cvvp
* style fix black
* added tortoise config and updated config and args, refactoring the code
* added tortoise to api
* Pull mel_norm from url
* Use TTS cleaners
* Let download model files
* add ability to pass tortoise presets through coqui api
* fix tests
* fix style and tests
* fix tts commandline for tortoise
* Add config.json to tortoise
* Use kwargs
* Use regular model api for loading tortoise
* Add load from dir to synthesizer
* Fix Tortoise floats
* Use model_dir when there are multiple urls
* Use `synthesize` when exists
* lint fixes and resolve preset bug
* resolve a download bug and update model link
* fix json
* do tortoise inference from voice dir
* fix
* fix test
* fix speaker id and remove assests
* update inference_tests.yml
* replace inference_test.yml
* fix extra dir as None
* fix tests
* remove space
* Reformat docstring
* Add docs
* Update docs
* lint fixes
---------
Co-authored-by: Eren Gölge <egolge@coqui.ai>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-05-16 00:58:21 +02:00
Eren Gölge
9b5822d625
Update VAD for silence trimming. ( #2604 )
...
* Update vad for mp3 and fault tolerance
* Make style
* Remove importt
* Remove stupid defaults
2023-05-11 11:09:23 +02:00
Eren Gölge
dfb51e06b2
Add jenny model ( #2603 )
2023-05-08 12:05:40 +02:00
Michael Görner
27e237ed08
use default_factory for audio parameter ( #2576 )
...
Python 3.11 complains about the mutable default and other members
were already adapted to use the factory, so I expect this line just
went unnoticed until now.
2023-05-08 11:17:36 +02:00
prakharpbuf
c1875f68df
typos and minor fixes ( #2508 )
...
* Update tacotron1-2.md
* Update README.md
* Update Tutorial_2_train_your_first_TTS_model.ipynb
* Update synthesizer.py
There is no arg called --speaker_name
* Update formatting_your_dataset.md
* Update AnalyzeDataset.ipynb
* Update AnalyzeDataset.ipynb
* Update AnalyzeDataset.ipynb
* Update finetuning.md
* Update train_yourtts.py
* Update train_yourtts.py
* Update train_yourtts.py
* Update finetuning.md
2023-04-26 15:22:57 +02:00
Eren Gölge
2071088bab
Bump up to v0.13.3
2023-04-17 16:13:35 +02:00
Eren Gölge
1a6a5710fd
Make lint
2023-04-17 15:02:56 +02:00
Eren Gölge
a44a0e1fd2
Update model urls
2023-04-17 14:53:27 +02:00
Eren Gölge
2533a18d62
Add BN tests
2023-04-17 13:37:10 +02:00
Eren Gölge
2d49c05259
Remove import
2023-04-17 13:05:29 +02:00
Eren Gölge
5e5768d784
Fix API
2023-04-17 13:05:19 +02:00
Eren Gölge
cd83991067
Add BN phonemizer
2023-04-17 12:54:00 +02:00
Eren Gölge
36be05290d
Add models
2023-04-17 12:52:32 +02:00
Eren Gölge
e4c5c27854
Bump up to v0.13.2
2023-04-14 10:23:39 +02:00
Eren Gölge
dba5cec497
Merge pull request #2509 from coqui-ai/update_vad
...
Update VAD
2023-04-13 19:35:17 +02:00
Eren Gölge
5a9bda13f3
Make style
2023-04-13 14:19:06 +02:00
Eren Gölge
c9375e4b8b
Make style
2023-04-13 14:17:06 +02:00
Eren Gölge
758ef84cc2
Using 🐸 Studio models with `tts` command
2023-04-13 14:14:41 +02:00
Eren G??lge
537dc0e933
Update VAD
2023-04-13 00:39:46 +02:00
Eren Gölge
e33e7170ed
Bump up to v0.13.1
2023-04-12 16:20:53 +02:00
Eren Gölge
8da3342676
Ping API
2023-04-12 16:20:53 +02:00
Eren Gölge
cbb592b295
Fixup
2023-04-10 14:50:11 +02:00
Eren Gölge
b8b9f09de5
Fixup
2023-04-10 14:06:31 +02:00
Eren Gölge
a49c1931d9
Fixup
2023-04-10 13:33:42 +02:00
Eren Gölge
5bd1fb6b2c
Fix API for voice conversion
2023-04-10 13:32:16 +02:00
Eren Gölge
30109af2a0
Merge pull request #2480 from MattyB95/librosa_v0.10.0
...
Update Librosa Version To V0.10.0
2023-04-07 12:32:33 +02:00
Eren Gölge
1233365cf4
Bump up to v0.13.0
2023-04-05 15:09:31 +02:00
Eren Gölge
ad8b9bf2be
🐸 Coqui Studio API integration ( #2484 )
...
* Warn when lang is not avail
* Make style
* Implement Coqui Studio API
* Test
* Update docs
* Set action
* Make style
* Make lint
* Update README
* Make style
* Fix action
* Run actions
2023-04-05 15:06:50 +02:00
Matthew Boakes
4c829e74a1
Update Librosa Version To V0.10.0
2023-04-05 00:59:20 +01:00