mirror of https://github.com/coqui-ai/TTS.git
31 lines
855 B
Markdown
31 lines
855 B
Markdown
# Tacotron (Work in Progress...)
|
|
|
|
Here we have pytorch implementation of:
|
|
- Tacotron: [A Fully End-to-End Text-To-Speech Synthesis Model](https://arxiv.org/abs/1703.10135).
|
|
- Tacotron2 (TODO): [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/pdf/1712.05884.pdf)
|
|
|
|
At the end, it should be easy to add new models and try different architectures.
|
|
|
|
You can find [here](https://www.evernote.com/shard/s146/sh/9544e7e9-d372-4610-a7b7-3ddcb63d5dac/d01d33837dab625229dec3cfb4cfb887) a brief note about possible TTS architectures and their comparisons.
|
|
|
|
## Requirements
|
|
Highly recommended to use [miniconda](https://conda.io/miniconda.html) for easier installation.
|
|
* python 3.6
|
|
* pytorch > 0.2.0
|
|
* TODO
|
|
|
|
## Data
|
|
TODO
|
|
|
|
## File description
|
|
TODO
|
|
|
|
## Training the network
|
|
TODO
|
|
|
|
## Generate TTS wav file
|
|
TODO
|
|
|
|
|
|
|