🐸 fashion

This commit is contained in:
Eren Gölge 2021-03-05 02:46:33 +01:00
parent e9e07844b7
commit 0f4f26c66c
2 changed files with 26 additions and 49 deletions

View File

@ -1,12 +1,9 @@
<img src="https://user-images.githubusercontent.com/1402048/104139991-3fd15e00-53af-11eb-8640-3a78a64641dd.png" data-canonical-src="![TTS banner](https://user-images.githubusercontent.com/1402048/104139991-3fd15e00-53af-11eb-8640-3a78a64641dd.png =250x250) # <img src="images/coqui-log-green-TTS.png" height="56"/>
" width="256" height="256" align="right" />
# TTS: Text-to-Speech for all.
TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
TTS comes with [pretrained models](https://github.com/mozilla/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects. TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
[![CircleCI](<https://circleci.com/gh/mozilla/TTS/tree/dev.svg?style=svg>)]() <!-- [![CircleCI](TODO)]() -->
[![License](<https://img.shields.io/badge/License-MPL%202.0-brightgreen.svg>)](https://opensource.org/licenses/MPL-2.0) [![License](<https://img.shields.io/badge/License-MPL%202.0-brightgreen.svg>)](https://opensource.org/licenses/MPL-2.0)
[![PyPI version](https://badge.fury.io/py/TTS.svg)](https://badge.fury.io/py/TTS) [![PyPI version](https://badge.fury.io/py/TTS.svg)](https://badge.fury.io/py/TTS)
@ -22,33 +19,32 @@ Please use our dedicated channels for questions and discussion. Help is much mor
| Type | Platforms | | Type | Platforms |
| ------------------------------- | --------------------------------------- | | ------------------------------- | --------------------------------------- |
| 🚨 **Bug Reports** | [GitHub Issue Tracker] | | 🚨 **Bug Reports** | [GitHub Issue Tracker] |
| ❔ **FAQ** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/FAQ) | | ❔ **FAQ** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/FAQ) |
| 🎁 **Feature Requests & Ideas** | [GitHub Issue Tracker] | | 🎁 **Feature Requests & Ideas** | [GitHub Issue Tracker] |
| 👩‍💻 **Usage Questions** | [Discourse Forum] | | 👩‍💻 **Usage Questions** | [Github Discussions] |
| 🗯 **General Discussion** | [Discourse Forum] and [Matrix Channel] | | 🗯 **General Discussion** | [Github Discussions] and [Matrix Channel] |
[github issue tracker]: https://github.com/mozilla/tts/issues [github issue tracker]: https://github.com/coqui-ai/tts/issues
[discourse forum]: https://discourse.mozilla.org/c/tts/ [github discussions]: https://github.com/coqui-ai/TTS/discussions
[matrix channel]: https://matrix.to/#/!KTePhNahjgiVumkqca:matrix.org?via=matrix.org [matrix channel]: https://matrix.to/#/!KTePhNahjgiVumkqca:matrix.org?via=matrix.org
[Tutorials and Examples]: https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials [Tutorials and Examples]: https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials
## 🔗 Links and Resources ## 🔗 Links and Resources
| Type | Links | | Type | Links |
| ------------------------------- | --------------------------------------- | | ------------------------------- | --------------------------------------- |
| 💾 **Installation** | [TTS/README.md](https://github.com/mozilla/TTS/tree/dev#install-tts)| | 💾 **Installation** | [TTS/README.md](https://github.com/coqui-ai/TTS/tree/dev#install-tts)|
| 👩🏾‍🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials) | | 👩🏾‍🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials) |
| 🚀 **Released Models** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/Released-Models)| | 🚀 **Released Models** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/Released-Models)|
| 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-mozillatts)| | 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-coqui-aitts)|
| 🖥️ **Demo Server** | [TTS/server](https://github.com/mozilla/TTS/tree/master/TTS/server)| | 🖥️ **Demo Server** | [TTS/server](https://github.com/coqui-ai/TTS/tree/master/TTS/server)|
| 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/mozilla/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)| | 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/coqui-ai/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)|
| ✨ **How to contribute** |[TTS/README.md](#contribution-guidelines)|
## 🥇 TTS Performance <!-- ## TODO -- 🥇 TTS Performance
<p align="center"><img src="https://discourse-prod-uploads-81679984178418.s3.dualstack.us-west-2.amazonaws.com/optimized/3X/6/4/6428f980e9ec751c248e591460895f7881aec0c6_2_1035x591.png" width="800" /></p> <p align="center"><img src="" width="800" /></p>
"Mozilla*" and "Judy*" are our models. "TTS*" and "Judy*" are our models.
[Details...](https://github.com/mozilla/TTS/wiki/Mean-Opinion-Score-Results) [Details...](https://github.com/coqui-ai/TTS/wiki/Mean-Opinion-Score-Results) -->
## Features ## Features
- High performance Deep Learning models for Text2Speech tasks. - High performance Deep Learning models for Text2Speech tasks.
@ -94,9 +90,9 @@ Please use our dedicated channels for questions and discussion. Help is much mor
You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers). You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
## Install TTS ## Install TTS
TTS supports **python >= 3.6, <3.9**. TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
If you are only interested in [synthesizing speech](https://github.com/mozilla/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option. If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
```bash ```bash
pip install TTS pip install TTS
@ -105,7 +101,7 @@ pip install TTS
If you plan to code or train models, clone TTS and install it locally. If you plan to code or train models, clone TTS and install it locally.
```bash ```bash
git clone https://github.com/mozilla/TTS git clone https://github.com/coqui-ai/TTS
pip install -e . pip install -e .
``` ```
@ -233,25 +229,16 @@ In case of any error or intercepted execution, if there is no checkpoint yet und
You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder. You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder.
## Contribution Guidelines ## Contribution guidelines
This repository is governed by Mozilla's code of conduct and etiquette guidelines. For more details, please read the [Mozilla Community Participation Guidelines.](https://www.mozilla.org/about/governance/policies/participation/) Please send your Pull Request to ```dev``` branch. Before making a Pull Request, check your changes for basic mistakes and style problems by using a linter. We have cardboardlinter setup in this repository, so for example, if you've made some changes and would like to run the linter on just the changed code, you can use the follow command:
1. Create a new branch.
2. Implement your changes.
3. (if applicable) Add [Google Style](https://google.github.io/styleguide/pyguide.html#381-docstrings) docstrings.
4. (if applicable) Implement a test case under ```tests``` folder.
5. (Optional but Prefered) Run tests.
```bash
./run_tests.sh
```
6. Run the linter.
```bash ```bash
pip install pylint cardboardlint pip install pylint cardboardlint
cardboardlinter --refspec master cardboardlinter --refspec master
``` ```
7. Send a PR to ```dev``` branch, explain what the change is about. 7. Send a PR to ```dev``` branch, explain what the change is about.
8. Let us discuss until we make it perfect :). 8. Let us discuss until we make it perfect :).
9. We merge it to the ```dev``` branch once things look good. 9. We merge it to the ```dev``` branch once things look good.
Feel free to ping us at any step you need help using our communication channels. Feel free to ping us at any step you need help using our communication channels.
@ -263,16 +250,6 @@ If you like to use TTS to try a new idea and like to share your experiments with
- Explain your idea and experiment. - Explain your idea and experiment.
- Share your results regularly. (Tensorboard log files, audio results, visuals etc.) - Share your results regularly. (Tensorboard log files, audio results, visuals etc.)
## Major TODOs
- [x] Implement the model.
- [x] Generate human-like speech on LJSpeech dataset.
- [x] Generate human-like speech on a different dataset (Nancy) (TWEB).
- [x] Train TTS with r=1 successfully.
- [x] Enable process based distributed training. Similar to (https://github.com/fastai/imagenet-fast/).
- [x] Adapting Neural Vocoder. TTS works with WaveRNN and ParallelWaveGAN (https://github.com/erogol/WaveRNN and https://github.com/erogol/ParallelWaveGAN)
- [x] Multi-speaker embedding.
- [x] Model optimization (model export, model pruning etc.)
### Acknowledgement ### Acknowledgement
- https://github.com/keithito/tacotron (Dataset pre-processing) - https://github.com/keithito/tacotron (Dataset pre-processing)
- https://github.com/r9y9/tacotron_pytorch (Initial Tacotron architecture) - https://github.com/r9y9/tacotron_pytorch (Initial Tacotron architecture)

Binary file not shown.

After

Width:  |  Height:  |  Size: 60 KiB