mirror of https://github.com/coqui-ai/TTS.git
🐸 fashion
This commit is contained in:
parent
e9e07844b7
commit
0f4f26c66c
75
README.md
75
README.md
|
@ -1,12 +1,9 @@
|
||||||
<img src="https://user-images.githubusercontent.com/1402048/104139991-3fd15e00-53af-11eb-8640-3a78a64641dd.png" data-canonical-src="
|
# <img src="images/coqui-log-green-TTS.png" height="56"/>
|
||||||
" width="256" height="256" align="right" />
|
|
||||||
|
|
||||||
# TTS: Text-to-Speech for all.
|
|
||||||
|
|
||||||
TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
|
TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
|
||||||
TTS comes with [pretrained models](https://github.com/mozilla/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
|
TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
|
||||||
|
|
||||||
[]()
|
<!-- []() -->
|
||||||
[](https://opensource.org/licenses/MPL-2.0)
|
[](https://opensource.org/licenses/MPL-2.0)
|
||||||
[](https://badge.fury.io/py/TTS)
|
[](https://badge.fury.io/py/TTS)
|
||||||
|
|
||||||
|
@ -22,33 +19,32 @@ Please use our dedicated channels for questions and discussion. Help is much mor
|
||||||
| Type | Platforms |
|
| Type | Platforms |
|
||||||
| ------------------------------- | --------------------------------------- |
|
| ------------------------------- | --------------------------------------- |
|
||||||
| 🚨 **Bug Reports** | [GitHub Issue Tracker] |
|
| 🚨 **Bug Reports** | [GitHub Issue Tracker] |
|
||||||
| ❔ **FAQ** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/FAQ) |
|
| ❔ **FAQ** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/FAQ) |
|
||||||
| 🎁 **Feature Requests & Ideas** | [GitHub Issue Tracker] |
|
| 🎁 **Feature Requests & Ideas** | [GitHub Issue Tracker] |
|
||||||
| 👩💻 **Usage Questions** | [Discourse Forum] |
|
| 👩💻 **Usage Questions** | [Github Discussions] |
|
||||||
| 🗯 **General Discussion** | [Discourse Forum] and [Matrix Channel] |
|
| 🗯 **General Discussion** | [Github Discussions] and [Matrix Channel] |
|
||||||
|
|
||||||
[github issue tracker]: https://github.com/mozilla/tts/issues
|
[github issue tracker]: https://github.com/coqui-ai/tts/issues
|
||||||
[discourse forum]: https://discourse.mozilla.org/c/tts/
|
[github discussions]: https://github.com/coqui-ai/TTS/discussions
|
||||||
[matrix channel]: https://matrix.to/#/!KTePhNahjgiVumkqca:matrix.org?via=matrix.org
|
[matrix channel]: https://matrix.to/#/!KTePhNahjgiVumkqca:matrix.org?via=matrix.org
|
||||||
[Tutorials and Examples]: https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials
|
[Tutorials and Examples]: https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials
|
||||||
|
|
||||||
|
|
||||||
## 🔗 Links and Resources
|
## 🔗 Links and Resources
|
||||||
| Type | Links |
|
| Type | Links |
|
||||||
| ------------------------------- | --------------------------------------- |
|
| ------------------------------- | --------------------------------------- |
|
||||||
| 💾 **Installation** | [TTS/README.md](https://github.com/mozilla/TTS/tree/dev#install-tts)|
|
| 💾 **Installation** | [TTS/README.md](https://github.com/coqui-ai/TTS/tree/dev#install-tts)|
|
||||||
| 👩🏾🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials) |
|
| 👩🏾🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials) |
|
||||||
| 🚀 **Released Models** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/Released-Models)|
|
| 🚀 **Released Models** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/Released-Models)|
|
||||||
| 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-mozillatts)|
|
| 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-coqui-aitts)|
|
||||||
| 🖥️ **Demo Server** | [TTS/server](https://github.com/mozilla/TTS/tree/master/TTS/server)|
|
| 🖥️ **Demo Server** | [TTS/server](https://github.com/coqui-ai/TTS/tree/master/TTS/server)|
|
||||||
| 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/mozilla/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)|
|
| 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/coqui-ai/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)|
|
||||||
| ✨ **How to contribute** |[TTS/README.md](#contribution-guidelines)|
|
|
||||||
|
|
||||||
## 🥇 TTS Performance
|
<!-- ## TODO -- 🥇 TTS Performance
|
||||||
<p align="center"><img src="https://discourse-prod-uploads-81679984178418.s3.dualstack.us-west-2.amazonaws.com/optimized/3X/6/4/6428f980e9ec751c248e591460895f7881aec0c6_2_1035x591.png" width="800" /></p>
|
<p align="center"><img src="" width="800" /></p>
|
||||||
|
|
||||||
"Mozilla*" and "Judy*" are our models.
|
"TTS*" and "Judy*" are our models.
|
||||||
[Details...](https://github.com/mozilla/TTS/wiki/Mean-Opinion-Score-Results)
|
[Details...](https://github.com/coqui-ai/TTS/wiki/Mean-Opinion-Score-Results) -->
|
||||||
|
|
||||||
## Features
|
## Features
|
||||||
- High performance Deep Learning models for Text2Speech tasks.
|
- High performance Deep Learning models for Text2Speech tasks.
|
||||||
|
@ -94,9 +90,9 @@ Please use our dedicated channels for questions and discussion. Help is much mor
|
||||||
You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
|
You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
|
||||||
|
|
||||||
## Install TTS
|
## Install TTS
|
||||||
TTS supports **python >= 3.6, <3.9**.
|
TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
|
||||||
|
|
||||||
If you are only interested in [synthesizing speech](https://github.com/mozilla/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
|
If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
pip install TTS
|
pip install TTS
|
||||||
|
@ -105,7 +101,7 @@ pip install TTS
|
||||||
If you plan to code or train models, clone TTS and install it locally.
|
If you plan to code or train models, clone TTS and install it locally.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
git clone https://github.com/mozilla/TTS
|
git clone https://github.com/coqui-ai/TTS
|
||||||
pip install -e .
|
pip install -e .
|
||||||
```
|
```
|
||||||
|
|
||||||
|
@ -233,25 +229,16 @@ In case of any error or intercepted execution, if there is no checkpoint yet und
|
||||||
|
|
||||||
You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder.
|
You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder.
|
||||||
|
|
||||||
## Contribution Guidelines
|
## Contribution guidelines
|
||||||
This repository is governed by Mozilla's code of conduct and etiquette guidelines. For more details, please read the [Mozilla Community Participation Guidelines.](https://www.mozilla.org/about/governance/policies/participation/)
|
Please send your Pull Request to ```dev``` branch. Before making a Pull Request, check your changes for basic mistakes and style problems by using a linter. We have cardboardlinter setup in this repository, so for example, if you've made some changes and would like to run the linter on just the changed code, you can use the follow command:
|
||||||
|
|
||||||
1. Create a new branch.
|
|
||||||
2. Implement your changes.
|
|
||||||
3. (if applicable) Add [Google Style](https://google.github.io/styleguide/pyguide.html#381-docstrings) docstrings.
|
|
||||||
4. (if applicable) Implement a test case under ```tests``` folder.
|
|
||||||
5. (Optional but Prefered) Run tests.
|
|
||||||
```bash
|
|
||||||
./run_tests.sh
|
|
||||||
```
|
|
||||||
6. Run the linter.
|
|
||||||
```bash
|
```bash
|
||||||
pip install pylint cardboardlint
|
pip install pylint cardboardlint
|
||||||
cardboardlinter --refspec master
|
cardboardlinter --refspec master
|
||||||
```
|
```
|
||||||
7. Send a PR to ```dev``` branch, explain what the change is about.
|
7. Send a PR to ```dev``` branch, explain what the change is about.
|
||||||
8. Let us discuss until we make it perfect :).
|
8. Let us discuss until we make it perfect :).
|
||||||
9. We merge it to the ```dev``` branch once things look good.
|
9. We merge it to the ```dev``` branch once things look good.
|
||||||
|
|
||||||
Feel free to ping us at any step you need help using our communication channels.
|
Feel free to ping us at any step you need help using our communication channels.
|
||||||
|
|
||||||
|
@ -263,16 +250,6 @@ If you like to use TTS to try a new idea and like to share your experiments with
|
||||||
- Explain your idea and experiment.
|
- Explain your idea and experiment.
|
||||||
- Share your results regularly. (Tensorboard log files, audio results, visuals etc.)
|
- Share your results regularly. (Tensorboard log files, audio results, visuals etc.)
|
||||||
|
|
||||||
## Major TODOs
|
|
||||||
- [x] Implement the model.
|
|
||||||
- [x] Generate human-like speech on LJSpeech dataset.
|
|
||||||
- [x] Generate human-like speech on a different dataset (Nancy) (TWEB).
|
|
||||||
- [x] Train TTS with r=1 successfully.
|
|
||||||
- [x] Enable process based distributed training. Similar to (https://github.com/fastai/imagenet-fast/).
|
|
||||||
- [x] Adapting Neural Vocoder. TTS works with WaveRNN and ParallelWaveGAN (https://github.com/erogol/WaveRNN and https://github.com/erogol/ParallelWaveGAN)
|
|
||||||
- [x] Multi-speaker embedding.
|
|
||||||
- [x] Model optimization (model export, model pruning etc.)
|
|
||||||
|
|
||||||
### Acknowledgement
|
### Acknowledgement
|
||||||
- https://github.com/keithito/tacotron (Dataset pre-processing)
|
- https://github.com/keithito/tacotron (Dataset pre-processing)
|
||||||
- https://github.com/r9y9/tacotron_pytorch (Initial Tacotron architecture)
|
- https://github.com/r9y9/tacotron_pytorch (Initial Tacotron architecture)
|
||||||
|
|
Binary file not shown.
After Width: | Height: | Size: 60 KiB |
Loading…
Reference in New Issue