coqui-tts/README.md

31 lines
855 B
Markdown

# Tacotron (Work in Progress...)
Here we have pytorch implementation of:
- Tacotron: [A Fully End-to-End Text-To-Speech Synthesis Model](https://arxiv.org/abs/1703.10135).
- Tacotron2 (TODO): [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/pdf/1712.05884.pdf)
At the end, it should be easy to add new models and try different architectures.
You can find [here](https://www.evernote.com/shard/s146/sh/9544e7e9-d372-4610-a7b7-3ddcb63d5dac/d01d33837dab625229dec3cfb4cfb887) a brief note about possible TTS architectures and their comparisons.
## Requirements
Highly recommended to use [miniconda](https://conda.io/miniconda.html) for easier installation.
* python 3.6
* pytorch > 0.2.0
* TODO
## Data
TODO
## File description
TODO
## Training the network
TODO
## Generate TTS wav file
TODO