Jindrich Matousek
|
c312343585
|
Language of each item (sample/utterance) is set to dataset language only when not defined at the sample/utterance level
Speaker name is prepended by dataset name in case of multispeaker datasets
Refactor "artic" formatter
|
2023-09-06 17:05:47 +02:00 |
Jindrich Matousek
|
a0db2eeee8
|
Fix: add `is_eval` when calling `get_sampler` to respect training/validation
|
2023-09-06 13:59:24 +02:00 |
Jindrich Matousek
|
0938f1cfa1
|
Merge branch 'coqui-ai:main' into main
|
2023-09-04 14:27:19 +02:00 |
Eren Gölge
|
9533f8656c
|
Make style
|
2023-09-04 13:58:37 +02:00 |
Eren Gölge
|
562a9509f2
|
Add BE model
|
2023-09-04 13:57:03 +02:00 |
Eren Gölge
|
b4c82685a7
|
Add model entries
|
2023-09-04 13:04:58 +02:00 |
T145
|
cdc971ff74
|
Fixed spectrogram checking on librosa 0.10.x (#2899)
|
2023-09-04 12:58:27 +02:00 |
Cohee
|
b3b1555d82
|
Fix exception handling in manage.py (#2912)
|
2023-09-04 12:54:30 +02:00 |
Eren G??lge
|
33b5e87b56
|
Merge branch 'dev' into main
|
2023-09-04 12:52:38 +02:00 |
Eren G??lge
|
40b527345f
|
Bump up to v0.16.6
|
2023-09-04 12:51:53 +02:00 |
Eren Gölge
|
d1d95707bd
|
Update docs (#2919)
|
2023-09-04 12:28:36 +02:00 |
Unik
|
32b8ebb633
|
Updated scipy version (#2914)
|
2023-09-04 11:39:19 +02:00 |
Aleś Bułojčyk
|
fead04f779
|
Add phonemizer for Belarusian language (#2856)
|
2023-08-28 11:20:45 +02:00 |
Jake Tae
|
b79b6f0762
|
feature: add device flag to tts cli (#2875)
|
2023-08-28 11:20:12 +02:00 |
Jake Tae
|
fa0cbd78fe
|
Update README with new device API (#2876)
* docs: update readme w/ .to(device) api
* docs: add .to(device) in python quickstart
* docs: move section header out of comment
* chore: use device instead of hard-coded string
* docs: update inference.md
|
2023-08-28 11:19:00 +02:00 |
Jindrich Matousek
|
5504e13570
|
Merge branch 'coqui-ai:main' into main
|
2023-08-26 16:34:47 +02:00 |
Eren Gölge
|
530a8939fe
|
Merge pull request #2894 from coqui-ai/dev
v0.16.5
|
2023-08-26 12:00:48 +02:00 |
Eren Gölge
|
c0b5e61749
|
Bump up to v0.16.5
|
2023-08-26 12:00:25 +02:00 |
Eren Gölge
|
a7a96d08dd
|
Fix loading Bark (#2893)
* Fixup hubert path
* Make style
|
2023-08-26 11:59:00 +02:00 |
Eren Gölge
|
04a36a727b
|
Bump up to v0.16.4
|
2023-08-26 10:39:48 +02:00 |
Eren Gölge
|
a96562a750
|
Update .models.json
|
2023-08-26 10:36:40 +02:00 |
Jindrich Matousek
|
37807fef8b
|
Add vctk_wav formatter: it is the same as vctk but uses wav extension instead of flac
|
2023-08-23 11:52:14 +01:00 |
Jindrich Matousek
|
4085a229fe
|
Merge remote-tracking branch 'upstream/main'
|
2023-08-21 17:22:20 +01:00 |
Jake Tae
|
409db505d2
|
Add device support in TTS and Synthesizer (#2855)
* fix: resolve merge conflicts
* fix: retain backwards compatability in functions
* feature: utilize device for voice transfer
* feature: use device for vocoder
* chore: cleanup vocoder cpu logic
* fix: add necessary vocoder output device check
* fix: add necessary vocoder output device check
* fix: indentation
* fix: check if waveform is pt tensor before cpu conversion
---------
Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
|
2023-08-14 21:04:44 +02:00 |
Julian Weber
|
febcaf710a
|
Add customizable data home path (#2871)
* Add customizable data home path
* Add TTS_HOME as an option
|
2023-08-14 21:02:48 +02:00 |
Eren Gölge
|
c4e5effab9
|
Bump up to v0.16.3
|
2023-08-13 12:22:04 +02:00 |
Michael New
|
1f9d600b83
|
Denote human voices in README.md (#2851)
|
2023-08-13 12:15:17 +02:00 |
Eren Gölge
|
3a104d5c49
|
Update Studio API for XTTS (#2861)
* Update Studio API for XTTS
* Update the docs
* Update README.md
* Update README.md
Update README
|
2023-08-13 12:04:12 +02:00 |
Eren G??lge
|
37b558ccb9
|
Make style
|
2023-08-11 12:55:23 +02:00 |
Eren G??lge
|
9a8352b8da
|
Fix import error with Bark
|
2023-08-11 03:33:59 +02:00 |
Eren Gölge
|
c87377b713
|
Bump up to v0.16.2
|
2023-08-07 13:21:14 +02:00 |
Eren Gölge
|
4186f42b21
|
Handle missing JA phonemizer (#2843)
* Handle missing JA phonemizer
* Make style
|
2023-08-07 13:19:38 +02:00 |
Eren Gölge
|
48f8133eae
|
Fix imports (#2845)
|
2023-08-07 13:19:26 +02:00 |
Jindrich Matousek
|
874143bf04
|
Add support for phone (char) based length scale
Remove length_scale from default aux_input
|
2023-08-06 13:17:53 +02:00 |
Jindrich Matousek
|
d3661d7d26
|
Fix artic_multispeaker formatter
|
2023-08-05 10:30:53 +02:00 |
Javier
|
4e7f8cd021
|
Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831)
|
2023-08-04 11:02:59 +02:00 |
ChaseC
|
52a528cfcf
|
add post functionality to /api/tts (#2836)
|
2023-08-04 10:54:20 +02:00 |
Eren Gölge
|
dc04baa1ee
|
Bump up to v0.16.1
|
2023-07-31 15:54:45 +02:00 |
Eren Gölge
|
17ddd65741
|
Please p3.11
|
2023-07-31 15:53:19 +02:00 |
Eren Gölge
|
69f080eb47
|
Fix DelightfulTTS (#2823)
* Fix tests
* Make style
|
2023-07-31 13:52:45 +02:00 |
Eren Gölge
|
483888b9d8
|
Add kwargs to ignore extra arguments w/o error (#2822)
|
2023-07-31 11:37:35 +02:00 |
AWAS666
|
9e74b51aa6
|
Delightful TTS VCTK recipe fixes (#2808)
* fix: wrong import class
* fix: formatter name missing
* feat: get rid of clearml
|
2023-07-31 10:27:42 +02:00 |
Aleś Bułojčyk
|
d124f78430
|
Recipe for Belarusian TTS (#2756)
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>
* Recipe for Belarusian TTS
---------
Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
|
2023-07-31 10:26:21 +02:00 |
Javier
|
c140df5a58
|
Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816)
|
2023-07-31 10:19:49 +02:00 |
Eren Gölge
|
b739326503
|
Bump up to v0.16.0
|
2023-07-24 16:04:10 +02:00 |
Eren Gölge
|
8aacb81849
|
Fix Tortoise load (#2791)
* Remove key prunning in tortoise
* Make lint
|
2023-07-24 13:42:47 +02:00 |
Eren Gölge
|
b3472a739e
|
Update README.md
|
2023-07-24 13:42:20 +02:00 |
logan hart
|
6fdb88f8e2
|
Add Delightful-TTS implementation (#2095)
* add configs
* Update config file
* Add model configs
* Add model layers
* Add layer files
* Add layer modules
* change config names
* Add emotion manager
* fIX missing ap bug
* Fix missing ap bug
* Add base TTS e2e class
* Fix wrong variable name in load_tts_samples
* Add training script
* Remove range predictor and gaussian upsampling
* Add helper function
* Add vctk recipe
* Add conformer docs
* Fix linting in conformer.py
* Add Docs
* remove duplicate import
* refactor args
* Fix bugs
* Removew emotion embedding
* remove unused arg
* Remove emotion embedding arg
* Remove emotion embedding arg
* fix style issues
* Fix bugs
* Fix bugs
* Add unittests
* make style
* fix formatter bug
* fix test
* Add pyworld compute pitch func
* Update requirments.txt
* Fix dataset Bug
* Chnge layer norm to instance norm
* Add missing import
* Remove emotions.py
* remove ssim loss
* Add init layers func to aligner
* refactor model layers
* remove audio_config arg
* Rename loss func
* Rename to delightful-tts
* Rename loss func
* Remove unused modules
* refactor imports
* replace audio config with audio processor
* Add change sample rate option
* remove broken resample func
* update recipe
* fix style, add config docs
* fix tests and multispeaker embd dim
* remove pyworld
* Make style and fix inference
* Split tts tests
* Fixup
* Fixup
* Fixup
* Add argument names
* Set "random" speaker in the model Tortoise/Bark
* Use a diff f0_cache path for delightfull tts
* Fix delightful speaker handling
* Fix lint
* Make style
---------
Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
|
2023-07-24 13:41:26 +02:00 |
Eren Gölge
|
f24c5e0276
|
Update README
|
2023-07-24 13:30:19 +02:00 |
Eren Gölge
|
1652598a33
|
Test synthesize api separately
|
2023-07-24 12:38:20 +02:00 |