Edresson Casanova
dbe9da7f15
Add Voice conversion inference support ( #1337 )
...
* Add support for voice conversion inference
* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json
* Rebase bug fix
* Use the average d-vector for inference
2022-03-10 14:57:12 +01:00
Eren Gölge
764c7fa4a4
Rename phoneme_cleaners
2022-03-06 12:09:54 +01:00
Eren Gölge
dd4287de1f
Update models
2022-03-03 20:23:00 +01:00
Eren Gölge
d0c27a9661
Update synthesis.py
2022-02-25 11:29:41 +01:00
Eren Gölge
131bc0cfc0
Fix synthesis.py 🔧
2022-02-25 11:18:00 +01:00
Eren Gölge
c9972e6f14
Make lint
2022-02-25 11:07:34 +01:00
Eren Gölge
b6c2bfdf08
Refactor synthesis.py for TTSTokenizer
2022-02-25 11:05:06 +01:00
Eren Gölge
d0eb642d88
Refactor synthesis.py for TTSTokenizer
2022-02-25 10:48:03 +01:00
Eren Gölge
d8ec7086b6
Update `synthesis` for the new API
2022-02-25 10:48:03 +01:00
Eren Gölge
fbad17e084
Update imports for symbols -> characters
2022-02-25 10:48:02 +01:00
Eren Gölge
5a9653978a
Refactor synthesis.py for TTSTokenizer
2022-02-25 10:45:24 +01:00
Edresson Casanova
0860d73cf8
Remove Tensorflow requeriment ( #1225 )
...
* Remove TF modules
* Remove TF unit tests
* Remove TF vocoder modules
* Remove TF convert scripts
* Remove TF requirement
* Remove the Docs TF instructions
* Remove TF inference support
2022-02-10 16:14:54 +01:00
WeberJulian
ffc269eaf4
Update docstring
2021-12-20 11:54:10 +00:00
WeberJulian
120332d53f
Fix phonemes
2021-12-20 11:54:10 +00:00
WeberJulian
4d721bcabd
fix test sentence synthesis
2021-12-20 11:54:10 +00:00
Edresson
ac9416fb86
Add multilingual inference support
2021-12-20 11:54:09 +00:00
Eren Gölge
babdd84f91
Fix GST inference
...
commit d3e477875a7e46a101fcf95a1794442823750fe2
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit 074e6d0874d3b34fb6a4991fc17d66dccd413fbb
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit fdece14585ab5a36eed1061a9a838d8e48aa6882
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit cd29e21b8d0a541ee298d2bf5f67223ad60be38f
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit 908ce39370eadcc9fa8510cdb26c9ead87305427
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:49:37 2021 +0100
Make trim_db value negative
commit 1008a2e0f72fa7ca7f0307424f570386f2f16d42
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:22:24 2021 +0100
Set find_endpoint db threshold in config.json
2021-12-07 13:28:49 +00:00
Eren Gölge
aea90e2501
Comment synthesis.py
2021-10-21 13:53:45 +00:00
Eren Gölge
003e5579e8
Enable `custom_symbols` in text processing
...
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge
00c82c516d
rename to
2021-06-28 17:03:19 +02:00
Eren Gölge
166f0aeb9a
merge if branches with the same implementation
2021-06-28 17:03:19 +02:00
Eren Gölge
03494ad642
adjust `distribute.py` for the `train_tts.py`
2021-06-28 17:03:19 +02:00
Eren Gölge
25238e0658
fix glow-tts `inference()`
2021-06-28 17:03:19 +02:00
Eren Gölge
db6a97d1a2
rename external speaker embedding arguments as `d_vectors`
2021-06-28 17:03:19 +02:00
Eren Gölge
421194880d
linter fixes
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa
make style
2021-06-28 17:03:19 +02:00
Eren Gölge
c680a07a20
fix `Synthesized` for the new `synthesis()`
2021-06-28 17:03:19 +02:00
Eren Gölge
b8a4af4010
update `synthesis.py` for being more generic
2021-06-28 17:03:19 +02:00
Eren Gölge
f4f83b6379
update `synthesis.py` for the trainer
2021-06-28 17:03:19 +02:00
Michael Hansen
3f172b84d8
Fix linting issues
2021-06-25 14:41:31 +02:00
Michael Hansen
4d8426fa0a
Use eSpeak IPA lexicons by default for phoneme models
2021-06-25 14:41:05 +02:00
Eren Gölge
9ee70af9bb
code styling
2021-05-11 11:29:18 +02:00
Eren Gölge
647163397d
coqpit refactoring
2021-05-11 11:29:17 +02:00
Eren Gölge
eaa130e813
fix tacotron for coqpit
2021-05-11 11:29:17 +02:00
Eren Gölge
aadb2106ec
code styling
2021-04-23 18:04:37 +02:00
kirianguiller
7dccbfdcd5
handle multi speaker and gst in Synthetizer class
2021-04-23 18:04:37 +02:00
Eren Gölge
e5b9607bc3
isort all imports
2021-04-09 00:45:20 +02:00
Eren Gölge
0e79fa86ad
format with black and pylint 2.7.3
2021-04-09 00:38:08 +02:00
Eren Gölge
2b3e12ea49
correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting
2021-03-30 14:39:16 +02:00
Eren Gölge
9a48ba3821
a ton of linter updates
2021-03-08 05:06:54 +01:00
kirianguiller
9ab07f94e2
modify according to PR reviews
2021-03-08 02:59:48 +01:00
kirianguiller
42ba30eb8f
<add> Chinese mandarin implementation (tacotron2)
2021-03-08 02:59:24 +01:00
kirianguiller
0d4525322c
modify according to PR reviews
2021-03-08 02:57:11 +01:00
kirianguiller
e6fd118cf8
<add> Chinese mandarin implementation (tacotron2)
2021-03-08 02:57:11 +01:00
Eren Gölge
90d4f08d6c
reorder imports
2021-03-08 02:48:31 +01:00
erogol
13c6665c92
inference for SS
2021-01-06 13:19:40 +01:00
erogol
639fa29261
update speaker id casting for glow-tts
2020-12-14 16:58:47 +01:00
erogol
788cd6f902
fix multi-speaker glow-tts inference
2020-12-10 02:05:48 +01:00
erogol
92cc9630d7
fix glow-tts synthesis for DPP
2020-12-10 00:30:34 +01:00
erogol
946a0c0fb9
bug fixes for single speaker glow-tts, enable torch based amp. Make amp optional for wavegrad. Bug fixes for synthesis setup for glow-tts
2020-10-29 15:45:50 +01:00