Edresson Casanova
8d228ab22a
Trick to Upsampling to High sampling rates using VITS model ( #1456 )
...
* Add upsample VITS support
* Fix the bug in inference
* Fix lint checks
* Add RMS based norm in save_wav method
* Style fix
* Add the period for VITS multi-period discriminator in model_args
* Bug fix in speaker encoder load in inference time
* Add unit tests
* Remove useless detach_z_vocoder parameter
* Add docs for VITS upsampling
* Fix the docs
* Rename TTS_part_sample_rate to encoder_sample_rate
* Add upsampling_init and upsampling_z methods
* Add asserts for encoder_sample_rate part
* Move upsampling tests to test_vits.py
2022-04-26 11:47:46 +02:00
Edresson Casanova
060e0f9368
Add EmbeddingManager and BaseIDManager ( #1374 )
2022-03-31 13:41:16 +02:00
WeberJulian
1b22f03e98
Fix G2P backend of the released models ( #1461 )
...
* Fix enforce phonemizer
* Add new models
* Fix .model.json
2022-03-30 12:47:11 +02:00
WeberJulian
c66a6241fd
Enforce phonemizer definition for synthesis ( #1441 )
...
* Enforce phonemizer definition for synthesis
* Fix train_tts, tokenizer init can now edit config
* Add small change to trigger CI pipeline
* fix wrong output path for one tts_test
* Fix style
* Test config overides by args and tokenizer
* Fix style
2022-03-25 23:15:33 +01:00
Eren Gölge
0870a4faa2
Make style ( #1405 )
2022-03-16 12:13:55 +01:00
Edresson Casanova
dbe9da7f15
Add Voice conversion inference support ( #1337 )
...
* Add support for voice conversion inference
* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json
* Rebase bug fix
* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Eren Gölge
942df0fb05
Update vits dataset
2022-03-02 09:14:32 +01:00
Eren Gölge
1f0c8179da
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
1445a46e9e
Update synthesizer to use iinit_from_config
2022-02-25 11:26:59 +01:00
Eren Gölge
2fe16de8e3
Make lint
2022-02-25 11:25:00 +01:00
Eren Gölge
c9972e6f14
Make lint
2022-02-25 11:07:34 +01:00
Eren Gölge
9bb347a52b
Update for tokenizer API
2022-02-25 11:05:06 +01:00
Eren Gölge
84091096a6
Refactor Synthesizer class for TTSTokenizer
2022-02-25 11:05:06 +01:00
Eren Gölge
1df1d6c4a9
Update for tokenizer API
2022-02-25 10:48:03 +01:00
Eren Gölge
3476be30d7
Refactor Synthesizer class for TTSTokenizer
2022-02-25 10:48:03 +01:00
Eren Gölge
acc6eef625
Update for tokenizer API
2022-02-25 10:48:02 +01:00
Eren Gölge
3d86edfc81
Refactor Synthesizer class for TTSTokenizer
2022-02-25 09:32:54 +01:00
Eren Gölge
fc09e319d4
Prioritize the given encoder path over config
2022-01-03 14:24:19 +00:00
Eren Gölge
7fad969a1f
Fix if else statement
2022-01-03 14:16:11 +00:00
Eren Gölge
8fd1ee1926
Print urls when BadZipError
2022-01-01 15:26:35 +00:00
Eren Gölge
61874bc0a0
Fix your_tts inference from the listed models
2021-12-31 13:45:05 +00:00
Eren Gölge
5c5ddd2ba7
Init speaker manager for speaker encoder
2021-12-22 15:51:53 +00:00
Eren Gölge
56378b12f7
Fix speaker encoder init
2021-12-21 12:26:25 +00:00
Eren Gölge
c9c1fa0548
Fix multi-speaker init in Synthesizer
2021-12-21 09:44:07 +00:00
Eren Gölge
473414d4af
Implement init_speaker_encoder and change arg names
2021-12-20 11:54:10 +00:00
Eren Gölge
35a781fb90
Fix synthesizer reading `use_language_embedding`
2021-12-20 11:54:10 +00:00
Eren Gölge
704dddcffa
Make style
2021-12-20 11:54:10 +00:00
WeberJulian
54b7fb4e4a
Fix zoo tests
2021-12-20 11:54:10 +00:00
WeberJulian
a564eb9f54
Add support for multi-lingual models in CLI
2021-12-20 11:54:10 +00:00
Eren Gölge
d227aaebcc
Print when using Griffin-Lim in Synthesizer
2021-11-01 16:52:26 +01:00
George
37eaefc085
Optional silence trimming during inference and find_endpoint() fix ( #898 )
...
* Set find_endpoint db threshold in config.json
* Optional silence trimming during inference
* Make trim_db value negative
2021-10-29 18:28:55 +02:00
Eren Gölge
1987aaaaed
Update d-vector reshape in synthesizer
2021-10-21 13:53:25 +00:00
Eren Gölge
0a3d1cc7ee
Pass speaker manager to the model in synthesizer
2021-10-20 18:11:36 +00:00
Eren Gölge
700b056117
Update Synthesizer multi-speaker handling
2021-10-15 10:21:12 +00:00
Eren Gölge
aacbb3ed77
Fix SpeakerManager usage in `synthesize.py`
2021-09-06 15:16:58 +00:00
fijipants
8f57f8adfd
Update synthesizer.py
2021-08-18 19:56:52 -04:00
Eren Gölge
dd55960732
Update `synthesizer.py`
...
Fixes and changes for multi-speaker model init and custom symbols made
by mode.make_symbols()
2021-08-09 18:02:36 +00:00
Eren Gölge
168f97cbe9
Let `Synthesizer` use the speaker manager out of the model
2021-07-02 10:47:55 +02:00
Eren Gölge
e30f245e06
Update `synthesizer` for speaker and model init
2021-06-28 17:03:47 +02:00
Eren Gölge
db6a97d1a2
rename external speaker embedding arguments as `d_vectors`
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa
make style
2021-06-28 17:03:19 +02:00
Eren Gölge
c680a07a20
fix `Synthesized` for the new `synthesis()`
2021-06-28 17:03:19 +02:00
Eren Gölge
d25f017b42
update `setup_model.py` imports
2021-06-28 17:03:19 +02:00
Eren Gölge
34f8a74e4d
remove `truncated` from synthesizer
2021-06-28 17:03:19 +02:00
Eren Gölge
19fb1d743d
style update
2021-05-11 11:30:00 +02:00
Eren Gölge
6e980b49c4
fix synthesizer.py for Coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
21dd4d7960
fix load_config imports for Coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
734e6a515c
bug fix
2021-04-27 10:27:45 +02:00
Eren Gölge
2f0716073e
enable multi-speaker CoquiTTS models for synthesize.py
2021-04-26 19:36:53 +02:00
Eren Gölge
f37b488876
Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager
2021-04-26 15:25:25 +02:00