Eren Gölge
fba257104d
Compute F0 using librosa
2021-09-06 15:16:58 +00:00
Katsuya Iida
165e5814af
Update Japanese phonemizer ( #758 )
...
* Update default ja vocoder
* update
* Japanese phonemizer test
* Run make style
Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge
f186856e5d
Add option to sort input sequnce by audio len
2021-08-30 08:10:35 +00:00
Eren Gölge
c312acac7d
Implement VITS model 🚀
...
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
003e5579e8
Enable `custom_symbols` in text processing
...
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge
e4648ffef1
Fix multi-speaker init of Tacotron models & tests
2021-08-09 18:02:36 +00:00
Agrin Hilmkil
ced4cfdbbf
Allow saving / loading checkpoints from cloud paths ( #683 )
...
* Allow saving / loading checkpoints from cloud paths
Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.
Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.
* Append suffix _fsspec to save/load function names
* Add a lower bound to the fsspec dependency
Skips the 0 major version.
* Add missing changes from refactor
* Use fsspec for remaining artifacts
* Add test case with path requiring fsspec
* Avoid writing logs to file unless output_path is local
* Document the possibility of using paths supported by fsspec
* Fix style and lint
* Add missing lint fixes
* Add type annotations to new functions
* Use Coqpit method for converting config to dict
* Fix type annotation in semi-new function
* Add return type for load_fsspec
* Fix bug where fs not always created
* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge
75b201c6c1
Merge pull request #673 from coqui-ai/fix_stopnet
...
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge
fc0c4600bd
Fix stopnet training
2021-07-24 11:39:54 +02:00
Eren Gölge
30eed347b6
Merge pull request #581 from Edresson/dev
...
Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian
c79a82ed07
refix linter
2021-07-13 23:12:18 +02:00
Edresson
2e5baffa9c
Merge fix and eval split as argparse
2021-07-13 01:47:32 -03:00
Edresson
4eac1c4651
bug fix on train_encoder and unit tests
2021-07-11 12:00:39 -03:00
Eren Gölge
1e9538aaef
Add more model tests to `test_synthesize`
2021-07-04 11:45:49 +02:00
Eren Gölge
47b3b10d6d
Bump up to v0.1.0 🚀
2021-06-29 13:07:59 +02:00
Eren Gölge
9790eddada
Fix wrong argument name 🛠️
2021-06-28 17:03:47 +02:00
Eren Gölge
626c9d41e6
Update tests for the new trainer API
2021-06-28 17:03:19 +02:00
Eren Gölge
fcfd95669a
Update model test configs
2021-06-28 17:03:19 +02:00
Eren Gölge
00c82c516d
rename to
2021-06-28 17:03:19 +02:00
Eren Gölge
fdfb18d230
downsize melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
25238e0658
fix glow-tts `inference()`
2021-06-28 17:03:19 +02:00
Eren Gölge
82582993cc
use one testing sentence in tts tests
2021-06-28 17:03:19 +02:00
Eren Gölge
419735f440
refactor and fix multi-speaker training in Trainer and Tacotron models
2021-06-28 17:03:19 +02:00
Eren Gölge
269e5a734e
add max_decoder_steps argument to tacotron models
2021-06-28 17:03:19 +02:00
Eren Gölge
304d60197b
reduce multiband melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
db6a97d1a2
rename external speaker embedding arguments as `d_vectors`
2021-06-28 17:03:19 +02:00
Eren Gölge
877bf66b61
reduce size of the metadata.csv used at testing
2021-06-28 17:03:19 +02:00
Eren Gölge
87c61d210a
update test to be less demanding
2021-06-28 17:03:19 +02:00
Eren Gölge
6d6896fd99
reduce fullband-melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
1443d03af1
update test for the new input output API of the tts models
2021-06-28 17:03:19 +02:00
Eren Gölge
ef4ea9e527
update imports for `formatters`
2021-06-28 17:03:19 +02:00
Eren Gölge
6c495c6a6e
fix glow-tts inference and forward functions for handling `cond_input`
...
and refactor its test
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa
make style
2021-06-28 17:03:19 +02:00
Eren Gölge
d25f017b42
update `setup_model.py` imports
2021-06-28 17:03:19 +02:00
Eren Gölge
7dff6be871
update tts training tests to use the trainer
2021-06-28 17:03:19 +02:00
Eren Gölge
9134c7dfb6
update `sequence_mask` import globally
2021-06-28 17:03:19 +02:00
Eren Gölge
8def3c87af
trainer-API updates
2021-06-28 17:03:19 +02:00
Eren Gölge
42554cc711
rename MyDataset -> TTSDataset
2021-06-28 17:03:19 +02:00
Edresson Casanova
eb84bb2bc8
Merge branch 'dev' into dev
2021-06-26 15:32:19 -03:00
Eren Gölge
6c7bbcaef0
Use `en-us` for testing phoneme models
2021-06-25 16:52:17 +02:00
Michael Hansen
a41f53fe72
Fix silly error in tests
2021-06-25 14:41:35 +02:00
Michael Hansen
3f172b84d8
Fix linting issues
2021-06-25 14:41:31 +02:00
Michael Hansen
4d8426fa0a
Use eSpeak IPA lexicons by default for phoneme models
2021-06-25 14:41:05 +02:00
Michael Hansen
47191f3ecc
Add tests for gruut phonemization
2021-06-25 14:41:05 +02:00
Edresson
28bec238ca
fix Lint checks
2021-06-18 14:33:50 -03:00
Edresson
83644056e3
fix Lint checks
2021-06-18 14:32:28 -03:00
Eren Gölge
db48c69f0f
reduce fullband melgan model size for testing
2021-06-02 11:44:53 +02:00
Eren Gölge
49c5e5d820
maket style japanese PR
2021-06-02 11:44:46 +02:00
Eren Gölge
0c14278c30
reorg test files
2021-06-02 11:40:26 +02:00
Eren Gölge
73b4083c6c
Merge pull request #502 from kaiidams/kaiidams/kokoro
...
Japanese Tacotron 2 model
2021-06-02 10:20:08 +02:00
Katsuya Iida
1cc18d1972
Move unittest of Japanese phonemizer.
2021-06-01 18:51:34 +09:00
Eren Gölge
bec85ac58d
make style
2021-05-31 16:37:15 +02:00
Eren Gölge
301c516abd
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-05-31 15:46:25 +02:00
Edresson
cc192b6843
add resnet speaker encoder train unit test
2021-05-29 22:43:41 -03:00
Eren Gölge
925c08cf95
replace unidecode with anyascii
2021-05-27 14:02:44 +02:00
Edresson
bc5307caa0
add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix
2021-05-26 20:35:58 -03:00
Edresson
7a9a27282a
fix unit tests
2021-05-26 18:14:06 -03:00
Eren Gölge
f5a9950a2e
phoneme files for testing
2021-05-19 03:16:57 +02:00
Eren Gölge
9b706c5583
enable phonemes in test to match the attention masks
2021-05-19 03:15:54 +02:00
Eren Gölge
8a7c40736c
set use_phonemes false
2021-05-19 01:27:26 +02:00
Eren Gölge
faedea4b60
set use_phonemes False in configs
2021-05-19 00:08:07 +02:00
Eren Gölge
ccfaa6b1d5
add `needs_phonemizer` field to models.json. If set true these models
...
are only compatible with v0.0.13 or below.
2021-05-18 17:57:28 +02:00
Eren Gölge
a14fcf2a13
remove text_processing test
2021-05-18 17:57:28 +02:00
Eren Gölge
12722501bb
styling
2021-05-15 23:48:31 +02:00
Eren Gölge
7e02cff924
reduce pwgan test batch size
2021-05-12 16:10:08 +02:00
Eren Gölge
da49089a72
update melgan training test batch size
2021-05-12 10:12:11 +02:00
Eren Gölge
0213e1cbf4
update configs for tts models to match the field typed with the expected
...
values
2021-05-12 00:57:38 +02:00
Eren Gölge
715b0a65a0
update main.yml for python x64
...
fix test
2021-05-12 00:57:29 +02:00
Eren Gölge
843d1b3d98
linter fixes
2021-05-11 11:30:00 +02:00
Eren Gölge
19fb1d743d
style update
2021-05-11 11:30:00 +02:00
Eren Gölge
5aee30443f
refactoring tests after Coqpit
2021-05-11 11:30:00 +02:00
Eren Gölge
87384c6008
get_device_id() for tests
2021-05-11 11:29:18 +02:00
Eren Gölge
21dd4d7960
fix load_config imports for Coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
9f2d2d2081
add speaker encoder train test
2021-05-11 11:29:18 +02:00
Eren Gölge
9ee70af9bb
code styling
2021-05-11 11:29:18 +02:00
Eren Gölge
70fc7a7e71
remove bash running tests
2021-05-11 11:29:18 +02:00
Eren Gölge
1be45eae38
add new tests for vocoder trainings using coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
045f1c3e76
add hifigan train test
2021-05-11 11:29:18 +02:00
Eren Gölge
e6f45b9eb7
update train_vocoder_gan.py for coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
bcebd69d09
remove bash tts training tests
2021-05-11 11:29:17 +02:00
Eren Gölge
51a7e06945
glow_tts_config.py and train test on python
2021-05-11 11:29:17 +02:00
Eren Gölge
c6df8de80a
remove output train folder at the end of the test
2021-05-11 11:29:17 +02:00
Eren Gölge
35341d5482
move bash script based tests to python with coqpit
2021-05-11 11:29:17 +02:00
Eren Gölge
a21c0b5585
config update 2 WIP
2021-05-11 11:28:35 +02:00
Eren Gölge
f7582107da
Merge pull request #453 from Edresson/dev
...
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
2021-05-06 17:53:28 +02:00
Edresson
070227d2ab
change python3 to python in the extract tts espectrograms script to avoid incompatibility
2021-05-06 11:32:42 -03:00
Edresson
65860a954a
remove unused vars on test glow tts
2021-05-05 07:15:36 -03:00
Edresson
d78f27ea41
bugfix on tacotron unit test
2021-05-05 06:38:01 -03:00
Edresson
e3f56b613b
update unit test for extract tts spectrograms script
2021-05-04 20:45:07 -03:00
Eren Gölge
8cb27267a4
formatting
2021-05-03 14:26:35 +02:00
Edresson
3ecd556bbe
add unit test for extract tts spectrograms script
2021-05-01 13:41:56 -03:00
Edresson
bb82f4ae8b
add unit test for GlowTTS inference with MAS
2021-04-29 19:39:09 -03:00
Eren Gölge
f02f0338c2
fix .models.json and add testing to check released models availability
2021-04-29 09:32:36 +02:00
Eren Gölge
6353e87166
fix test
2021-04-27 15:04:20 +02:00
Eren Gölge
628abfe644
remove test
2021-04-27 14:35:39 +02:00
Eren Gölge
1235e54738
test for synthesize.py
2021-04-27 14:17:38 +02:00
Eren Gölge
19d9f58009
create dummy model on the fly
2021-04-27 13:27:24 +02:00
Eren Gölge
2f0716073e
enable multi-speaker CoquiTTS models for synthesize.py
2021-04-26 19:36:53 +02:00
Eren Gölge
f37b488876
Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager
2021-04-26 15:25:25 +02:00
Eren Gölge
b82daa5e86
style and linter fixes
2021-04-26 15:22:24 +02:00
Eren Gölge
4cf211348d
styling and linting
2021-04-23 18:04:37 +02:00
Eren Gölge
a878d8fb42
update tests
2021-04-23 18:04:37 +02:00
Eren Gölge
c80d21f311
load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager
2021-04-23 18:04:37 +02:00
Eren Gölge
32e6afc009
add `SpeakerManager` tests
2021-04-23 18:04:37 +02:00
Eren Gölge
757dfb9289
add `SpeakerManager` tests
2021-04-21 13:28:43 +02:00
Eren Gölge
a7f6045644
Merge branch 'reformat' into hifigan-reformat
2021-04-12 12:00:17 +02:00
Eren Gölge
f519012dea
reformatting and styling
2021-04-12 11:47:39 +02:00
Eren Gölge
3c0d1d066d
update permissions of all the shell scripts
2021-04-10 20:04:27 +02:00
Eren Gölge
5fd2333c91
move bash based tests to a separate folder and rename the calling script file. Fix align_tts training test.
2021-04-09 03:07:58 +02:00
Eren Gölge
87ee6ceb57
style update #3
2021-04-09 01:17:15 +02:00
Eren Gölge
e5b9607bc3
isort all imports
2021-04-09 00:45:20 +02:00
Eren Gölge
0e79fa86ad
format with black and pylint 2.7.3
2021-04-09 00:38:08 +02:00
Eren Gölge
5de7eb708b
fix the config
2021-04-09 00:36:49 +02:00
Eren Gölge
cd69da4868
linter fixes #2
2021-04-08 16:57:46 +02:00
Eren Gölge
4d3e1e9d9a
linter fix
2021-04-08 14:57:46 +02:00
Eren Gölge
53f54898bc
small fixes
2021-04-08 14:22:47 +02:00
Eren Gölge
a7fb49843b
update test config
2021-04-08 13:18:03 +02:00
Eren Gölge
ba80e82520
udpate gan_datasets tests
2021-04-08 11:52:35 +02:00
Eren Gölge
a669a492c6
test melgan feature loss
2021-04-07 12:36:56 +02:00
Eren Gölge
c8953f4da9
l1_spec_loss parameters in melgan test config
2021-04-07 12:36:36 +02:00
Eren Gölge
7a382a5c2b
stowed aligntts commit and small refactoring with feed_forward layers
2021-03-30 14:39:16 +02:00
Eren Gölge
2c364c0df8
test fixes
2021-03-30 14:39:16 +02:00
Eren Gölge
2b3e12ea49
correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting
2021-03-30 14:39:16 +02:00
Eren Gölge
ecb6b0d6ad
rename GlowTtts as GlowTTS
2021-03-30 14:39:16 +02:00
Eren Gölge
d9c405f0c3
create feedforward folder for SS layers
2021-03-30 14:39:16 +02:00
Eren Gölge
fd5c24c3b8
add missing config file
2021-03-18 13:33:23 +01:00
Eren Gölge
0514330869
fix mozilla/TTS#685
2021-03-18 13:33:23 +01:00
Eren Gölge
43672e206c
test config for tacotron model
2021-03-18 13:33:00 +01:00
Eren Gölge
65533f33e9
fix #374
2021-03-18 13:33:00 +01:00
Eren Gölge
d790d2fccb
linter fix
2021-03-18 13:33:00 +01:00
WeberJulian
f6cd8e0ecc
test case
2021-03-18 13:33:00 +01:00
Eren Gölge
4cc162a8b5
enable model manager test
2021-03-10 18:20:06 +01:00
Eren Gölge
f24d48c245
linter fix
2021-03-08 16:08:23 +01:00
Eren Gölge
94805236fb
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-03-08 15:21:06 +01:00
Eren Gölge
1e169a5ba5
bunch of short fixes
2021-03-08 14:51:27 +01:00
Eren Gölge
947e3d6a93
rename test
2021-03-08 14:50:54 +01:00
Eren Gölge
00bfa54e21
update server tests
2021-03-08 14:07:22 +01:00
Eren Gölge
dd4f46b60f
disable model manager test until moving models to github releases
2021-03-08 14:06:58 +01:00
Eren Gölge
9a48ba3821
a ton of linter updates
2021-03-08 05:06:54 +01:00
gerazov
2451a813a2
refactored keep_all_best
2021-03-08 02:57:11 +01:00
gerazov
62147994d4
added keep_best and keep_after to test configs.
2021-03-08 02:56:36 +01:00
Eren Gölge
ee71eb4eb7
linter fixes
2021-03-08 02:54:47 +01:00
Eren Gölge
55fc50b26d
update test_text_processing for espeak-ng
2021-03-08 02:54:47 +01:00
Eren Gölge
8ec28b1ac2
linterup date
2021-03-08 02:54:47 +01:00
Eren Gölge
e06c93fe81
model_manager tests
2021-03-08 02:54:47 +01:00
Eren Gölge
6cd642c2e1
add missing phonemes to test_config.json
2021-03-08 02:54:47 +01:00
Eren Gölge
a2e7403ae8
test_compute_statistics.sh
2021-03-08 02:54:47 +01:00
Eren Gölge
3c961370e7
linter fixes
2021-03-08 02:54:21 +01:00
Eren Gölge
83e50757ae
use single process dataloder in tests
2021-03-08 02:54:21 +01:00
Eren Gölge
d57560dcd3
reduce glowtts size for testing
2021-03-08 02:54:21 +01:00