From 0605411c2ec17adb4a231c8925fedd4900eb2312 Mon Sep 17 00:00:00 2001 From: erogol Date: Mon, 9 Nov 2020 17:57:33 +0100 Subject: [PATCH] update readme add latest model updates --- README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 472b504b..e3c24d3b 100644 --- a/README.md +++ b/README.md @@ -47,6 +47,7 @@ Speaker Encoder: Vocoders: - MelGAN: [paper](https://arxiv.org/abs/1710.10467) - MultiBandMelGAN: [paper](https://arxiv.org/abs/2005.05106) +- ParallelWaveGAN: [paper](https://arxiv.org/abs/1910.11480) - GAN-TTS discriminators: [paper](https://arxiv.org/abs/1909.11646) - WaveRNN: [origin][https://github.com/fatchord/WaveRNN/] - WaveGrad: [paper][https://arxiv.org/abs/2009.00713] @@ -203,7 +204,7 @@ If you like to use TTS to try a new idea and like to share your experiments with - [x] Train TTS with r=1 successfully. - [x] Enable process based distributed training. Similar to (https://github.com/fastai/imagenet-fast/). - [x] Adapting Neural Vocoder. TTS works with WaveRNN and ParallelWaveGAN (https://github.com/erogol/WaveRNN and https://github.com/erogol/ParallelWaveGAN) -- [ ] Multi-speaker embedding. +- [x] Multi-speaker embedding. - [x] Model optimization (model export, model pruning etc.)