Update README.md

frog is here
This commit is contained in:
Eren Gölge 2021-03-07 04:01:11 +01:00 committed by GitHub
parent ea60b48605
commit 0914c95398
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 10 additions and 10 deletions

View File

@ -1,7 +1,7 @@
# <img src="images/coqui-log-green-TTS.png" height="56"/>
TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
:frog: TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
:frog: TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
<!-- [![CircleCI](TODO)]() -->
[![License](<https://img.shields.io/badge/License-MPL%202.0-brightgreen.svg>)](https://opensource.org/licenses/MPL-2.0)
@ -88,18 +88,18 @@ Underlined "TTS*" and "Judy*" are :frog:TTS models
- WaveRNN: [origin](https://github.com/fatchord/WaveRNN/)
- WaveGrad: [paper](https://arxiv.org/abs/2009.00713)
You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
You can also help us implement more models. Some :frog: TTS related work can be found [here](https://github.com/erogol/TTS-papers).
## Install TTS
TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
:frog: TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released :frog: TTS models, installing from PyPI is the easiest option.
```bash
pip install TTS
```
If you plan to code or train models, clone TTS and install it locally.
If you plan to code or train models, clone :frog: TTS and install it locally.
```bash
git clone https://github.com/coqui-ai/TTS
@ -137,11 +137,11 @@ Audio examples: [soundcloud](https://soundcloud.com/user-565970875/pocket-articl
<img src="images/example_model_output.png?raw=true" alt="example_output" width="400"/>
## Datasets and Data-Loading
TTS provides a generic dataloader easy to use for your custom dataset.
:frog: TTS provides a generic dataloader easy to use for your custom dataset.
You just need to write a simple function to format the dataset. Check ```datasets/preprocess.py``` to see some examples.
After that, you need to set ```dataset``` fields in ```config.json```.
Some of the public datasets that we successfully applied TTS:
Some of the public datasets that we successfully applied :frog: TTS:
- [LJ Speech](https://keithito.com/LJ-Speech-Dataset/)
- [Nancy](http://www.cstr.ed.ac.uk/projects/blizzard/2011/lessac_blizzard2011/)
@ -152,9 +152,9 @@ Some of the public datasets that we successfully applied TTS:
## Example: Synthesizing Speech on Terminal Using the Released Models.
After the installation, TTS provides a CLI interface for synthesizing speech using pre-trained models. You can either use your own model or the release models under the TTS project.
After the installation, :frog: TTS provides a CLI interface for synthesizing speech using pre-trained models. You can either use your own model or the release models under :frog: TTS.
Listing released TTS models.
Listing released :frog: TTS models.
```bash
tts --list_models
```