Julian Weber
a07397733b
Multilingual tokenizer ( #2229 )
...
* Implement multilingual tokenizer
* Add multi_phonemizer receipe
* Fix lint
* Add TestMultiPhonemizer
* Fix lint
* make style
2023-01-02 10:03:19 +01:00
Eren Gölge
fdeefcc612
Handle espeak 1.48.15 ( #2203 )
2022-12-12 11:23:45 +01:00
Eren Gölge
b95cf3363c
Prevent installing mecab-ko ( #1967 )
2022-09-14 10:28:07 +02:00
happylittlecat
4546b4cbd8
Add espeak support for Chinese ( #1905 )
...
* fix description
* add espeak support for chinese
* add espeak support for chinese
2022-09-08 12:32:41 +02:00
harmlessman
5abbe56642
Korean Phonemizer ( #1822 )
...
* Update requirements.txt
install jamo for korean
* Update formatters.py
add KSS formatter
KSS is a korean single speech dataset (12hours)
* Add files via upload
add phonemizer for korean
* Add files via upload
add korean phonemizer
* Update requirements.txt
* change code style with `black` and `pylint`
* reflecting pylint's Evaluation
* reflecting pylint's Evaluation
* reflecting pylint's Evaluation-2
* isort
* edit about separator
write test case and add 'nltk' for requirements.txt
* add korean g2p (g2pkk)
* isort
* TTS/tts/utils/text/phonemizers/ko_kr_phonemizer.py:43:24: W0621: Redefining name 'text' from outer scope (line 58) (redefined-outer-name)
TTS/tts/utils/text/korean/korean.py:28:8: R1705: Unnecessary "else" after "return" (no-else-return)
* black
2022-09-08 12:06:07 +02:00
rbaraglia
75ac9e3f0c
Fix language flags generated by espeak-ng phonemizer ( #1801 )
...
* fix language flags generated by espeak-ng phonemizer
* Style
* Updated language flag regex to consider all language codes alike
2022-08-07 13:57:40 +02:00
WeberJulian
5cef6facb0
Fix tokenizer for punc only ( #1717 )
2022-07-06 22:59:41 +02:00
Edresson Casanova
a97eed696a
Fix the bug in eSpeak wrapper for eSpeak version 1.48.15 ( #1560 )
2022-05-12 15:15:18 +02:00
WeberJulian
c66a6241fd
Enforce phonemizer definition for synthesis ( #1441 )
...
* Enforce phonemizer definition for synthesis
* Fix train_tts, tokenizer init can now edit config
* Add small change to trigger CI pipeline
* fix wrong output path for one tts_test
* Fix style
* Test config overides by args and tokenizer
* Fix style
2022-03-25 23:15:33 +01:00
Eren Gölge
1c3623af33
Fix model manager ( #1436 )
...
* Fix manager
* Make style
2022-03-23 12:57:14 +01:00
WeberJulian
690c96ed28
Fix default phonemizer for ja and zh ( #1399 )
2022-03-16 12:13:22 +01:00
Eren Gölge
e9d9028b4d
Revert cleaner name
2022-03-06 12:57:06 +01:00
Eren Gölge
764c7fa4a4
Rename phoneme_cleaners
2022-03-06 12:09:54 +01:00
Eren Gölge
dd4287de1f
Update models
2022-03-03 20:23:00 +01:00
Eren Gölge
690de1ab06
Update Characters and add more tests
2022-02-25 11:32:44 +01:00
Eren Gölge
1e414b3a09
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
bed4afd4ee
Implement BaseVocabulary
2022-02-25 11:31:56 +01:00
Eren Gölge
424d04e4f6
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
35fc7270ff
Implement BaseTTS
2022-02-25 11:28:47 +01:00
Eren Gölge
2bad098625
Implement BaseVocabulary
2022-02-25 11:28:47 +01:00
Eren Gölge
8622226f3f
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
590b04fb89
Fix espeak_wrapper
2022-02-25 11:26:59 +01:00
Eren Gölge
d5c0e17548
Load right char class dynamically
2022-02-25 11:26:59 +01:00
Eren Gölge
2fe16de8e3
Make lint
2022-02-25 11:25:00 +01:00
Eren Gölge
07b0a80d57
Fix tokenizer init_from_config
2022-02-25 11:24:13 +01:00
Eren Gölge
bde68d9f25
Use the same phonemizer for `en` to `en-us`
2022-02-25 11:07:34 +01:00
Eren Gölge
8649d4fd36
Allow None pad and blank tokens
2022-02-25 11:07:34 +01:00
Eren Gölge
c9972e6f14
Make lint
2022-02-25 11:07:34 +01:00
Eren Gölge
196ae74273
Update data loader tests
2022-02-25 11:05:06 +01:00
Eren Gölge
98057a00ae
Make style
2022-02-25 10:57:35 +01:00
Eren Gölge
04202da1ac
Make style
2022-02-25 10:48:03 +01:00
Eren Gölge
3b63d713b9
Fix espeak wrapper cmd call
2022-02-25 10:48:03 +01:00
Eren Gölge
4894998e6b
Fix print_logs
2022-02-25 10:48:03 +01:00
Eren Gölge
4e8f9d6f10
Fix IPAPhonemes init_from_config
2022-02-25 10:48:03 +01:00
Eren Gölge
0fe39166fe
Discard OOV chars in tokenizer
...
Discard but store OOV chars with a warninig message
when the OOV char first recognized
2022-02-25 10:48:03 +01:00
Eren Gölge
c39aaafbfc
Update EspeakWrapper for espeak-ng
2022-02-25 10:48:03 +01:00
Eren Gölge
87bf940676
Print duplicate characters
2022-02-25 10:48:03 +01:00
Eren Gölge
4e83bf3968
Allow choosing phonemizer
2022-02-25 10:48:02 +01:00
Eren Gölge
22f0c58fe1
Print language codes
2022-02-25 10:48:02 +01:00
Eren Gölge
693fb4dd39
Modify init_from_config for IPAPhonemes
2022-02-25 10:48:02 +01:00
Eren Gölge
ba3b60c90f
Test TTSTokenizer
2022-02-25 10:48:02 +01:00
Eren Gölge
79a84410f2
Test punctuations
2022-02-25 10:48:02 +01:00
Eren Gölge
d8bdeb8b8f
Fix Punctuation
2022-02-25 10:48:02 +01:00
Eren Gölge
ff7c385838
Fix BasePhonemizer
2022-02-25 10:48:02 +01:00
Eren Gölge
10d435ce77
Fixup
2022-02-25 10:48:02 +01:00
Eren Gölge
f0655bfffc
Fix ja_jp_phonemizer
2022-02-25 10:48:02 +01:00
Eren Gölge
20e5dd3678
Add doc examples
2022-02-25 10:48:02 +01:00
Eren Gölge
fbad17e084
Update imports for symbols -> characters
2022-02-25 10:48:02 +01:00
Eren Gölge
a1df4f9887
Test character classes
2022-02-25 10:45:24 +01:00
Eren Gölge
2480bbe937
Remove OLD TOKENIZATION ROUTINES
2022-02-25 09:32:54 +01:00