coqui-tts/notebooks/dataset_analysis
SanjaESC 841fb2159b multiprocessing 2020-08-03 16:55:38 +02:00
..
AnalyzeDataset-Copy1.ipynb Mass refactoring 2020-07-17 11:16:05 +02:00
AnalyzeDataset.ipynb Mass refactoring 2020-07-17 11:16:05 +02:00
CheckDatasetSNR.ipynb Mass refactoring 2020-07-17 11:16:05 +02:00
PhonemeCoverage.ipynb multiprocessing 2020-08-03 16:55:38 +02:00
README.md Mass refactoring 2020-07-17 11:16:05 +02:00
analyze.py Mass refactoring 2020-07-17 11:16:05 +02:00

README.md

Simple Notebook to Analyze a Dataset

By the use of this notebook, you can easily analyze a brand new dataset, find exceptional cases and define your training set.

What we are looking in here is reasonable distribution of instances in terms of sequence-length, audio-length and word-coverage.

This notebook is inspired from https://github.com/MycroftAI/mimic2