coqui-tts/notebooks/dataset_analysis
Guy Elsmore-Paddock 49484f0937 Clean-up `CheckSpectrograms` Notebook
- Fixes how parameters from the config are loaded while still making it
  possible for them to be overridden on the fly.
- Prints the list of audio files by index.
- Defines a constant to control which audio files gets loaded for all
  sections.
- Eliminates having to load the audio processor twice.
- Removes output (since it isn't relevant).
2021-03-20 22:17:37 -04:00
..
AnalyzeDataset.ipynb [ci skip] remove notebook outputs 2021-03-18 14:08:45 +01:00
CheckDatasetSNR.ipynb Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-03-18 14:09:47 +01:00
CheckSpectrograms.ipynb Clean-up `CheckSpectrograms` Notebook 2021-03-20 22:17:37 -04:00
PhonemeCoverage.ipynb [ci skip] remove notebook outputs 2021-03-18 14:08:45 +01:00
README.md Mass refactoring 2020-07-17 11:16:05 +02:00
analyze.py mass linter fix 2020-08-04 14:07:47 +02:00

README.md

Simple Notebook to Analyze a Dataset

By the use of this notebook, you can easily analyze a brand new dataset, find exceptional cases and define your training set.

What we are looking in here is reasonable distribution of instances in terms of sequence-length, audio-length and word-coverage.

This notebook is inspired from https://github.com/MycroftAI/mimic2