mirror of https://github.com/coqui-ai/TTS.git
Update readme (#1978)
This commit is contained in:
parent
896e46d0e5
commit
dba2c3570a
19
README.md
19
README.md
|
@ -21,6 +21,8 @@
|
||||||

|

|
||||||
[](https://tts.readthedocs.io/en/latest/)
|
[](https://tts.readthedocs.io/en/latest/)
|
||||||
|
|
||||||
|
🚀 [**Sign up to free 🐸Coqui.ai API and start cloning your voice**](https://coqui.ai/)
|
||||||
|
|
||||||
📰 [**Subscribe to 🐸Coqui.ai Newsletter**](https://coqui.ai/?subscription=true)
|
📰 [**Subscribe to 🐸Coqui.ai Newsletter**](https://coqui.ai/?subscription=true)
|
||||||
|
|
||||||
📢 [English Voice Samples](https://erogol.github.io/ddc-samples/) and [SoundCloud playlist](https://soundcloud.com/user-565970875/pocket-article-wavernn-and-tacotron2)
|
📢 [English Voice Samples](https://erogol.github.io/ddc-samples/) and [SoundCloud playlist](https://soundcloud.com/user-565970875/pocket-article-wavernn-and-tacotron2)
|
||||||
|
@ -75,7 +77,7 @@ Underlined "TTS*" and "Judy*" are 🐸TTS models
|
||||||
- Modular (but not too much) code base enabling easy implementation of new ideas.
|
- Modular (but not too much) code base enabling easy implementation of new ideas.
|
||||||
|
|
||||||
## Implemented Models
|
## Implemented Models
|
||||||
### Text-to-Spectrogram
|
### Spectrogram models
|
||||||
- Tacotron: [paper](https://arxiv.org/abs/1703.10135)
|
- Tacotron: [paper](https://arxiv.org/abs/1703.10135)
|
||||||
- Tacotron2: [paper](https://arxiv.org/abs/1712.05884)
|
- Tacotron2: [paper](https://arxiv.org/abs/1712.05884)
|
||||||
- Glow-TTS: [paper](https://arxiv.org/abs/2005.11129)
|
- Glow-TTS: [paper](https://arxiv.org/abs/2005.11129)
|
||||||
|
@ -84,6 +86,7 @@ Underlined "TTS*" and "Judy*" are 🐸TTS models
|
||||||
- FastPitch: [paper](https://arxiv.org/pdf/2006.06873.pdf)
|
- FastPitch: [paper](https://arxiv.org/pdf/2006.06873.pdf)
|
||||||
- FastSpeech: [paper](https://arxiv.org/abs/1905.09263)
|
- FastSpeech: [paper](https://arxiv.org/abs/1905.09263)
|
||||||
- SC-GlowTTS: [paper](https://arxiv.org/abs/2104.05557)
|
- SC-GlowTTS: [paper](https://arxiv.org/abs/2104.05557)
|
||||||
|
- Capacitron: [paper](https://arxiv.org/abs/1906.03402)
|
||||||
|
|
||||||
### End-to-End Models
|
### End-to-End Models
|
||||||
- VITS: [paper](https://arxiv.org/pdf/2106.06103)
|
- VITS: [paper](https://arxiv.org/pdf/2106.06103)
|
||||||
|
@ -149,12 +152,12 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
||||||
```
|
```
|
||||||
- Get model info (for both tts_models and vocoder_models):
|
- Get model info (for both tts_models and vocoder_models):
|
||||||
- Query by type/name:
|
- Query by type/name:
|
||||||
The model_info_by_name uses the name as it from the --list_models.
|
The model_info_by_name uses the name as it from the --list_models.
|
||||||
```
|
```
|
||||||
$ tts --model_info_by_name "<model_type>/<language>/<dataset>/<model_name>"
|
$ tts --model_info_by_name "<model_type>/<language>/<dataset>/<model_name>"
|
||||||
```
|
```
|
||||||
For example:
|
For example:
|
||||||
|
|
||||||
```
|
```
|
||||||
$ tts --model_info_by_name tts_models/tr/common-voice/glow-tts
|
$ tts --model_info_by_name tts_models/tr/common-voice/glow-tts
|
||||||
```
|
```
|
||||||
|
@ -162,16 +165,16 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
||||||
$ tts --model_info_by_name vocoder_models/en/ljspeech/hifigan_v2
|
$ tts --model_info_by_name vocoder_models/en/ljspeech/hifigan_v2
|
||||||
```
|
```
|
||||||
- Query by type/idx:
|
- Query by type/idx:
|
||||||
The model_query_idx uses the corresponding idx from --list_models.
|
The model_query_idx uses the corresponding idx from --list_models.
|
||||||
```
|
```
|
||||||
$ tts --model_info_by_idx "<model_type>/<model_query_idx>"
|
$ tts --model_info_by_idx "<model_type>/<model_query_idx>"
|
||||||
```
|
```
|
||||||
For example:
|
For example:
|
||||||
|
|
||||||
```
|
```
|
||||||
$ tts --model_info_by_idx tts_models/3
|
$ tts --model_info_by_idx tts_models/3
|
||||||
```
|
```
|
||||||
|
|
||||||
- Run TTS with default models:
|
- Run TTS with default models:
|
||||||
|
|
||||||
```
|
```
|
||||||
|
@ -241,8 +244,6 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
||||||
|- TTS
|
|- TTS
|
||||||
|- bin/ (folder for all the executables.)
|
|- bin/ (folder for all the executables.)
|
||||||
|- train*.py (train your target model.)
|
|- train*.py (train your target model.)
|
||||||
|- distribute.py (train your TTS model using Multiple GPUs.)
|
|
||||||
|- compute_statistics.py (compute dataset statistics for normalization.)
|
|
||||||
|- ...
|
|- ...
|
||||||
|- tts/ (text to speech models)
|
|- tts/ (text to speech models)
|
||||||
|- layers/ (model layer definitions)
|
|- layers/ (model layer definitions)
|
||||||
|
|
Loading…
Reference in New Issue