Commit Graph

1185 Commits

Author SHA1 Message Date
Eren Gölge 5ee01e5624 implement DeepSpeechConfig 2021-10-01 12:42:46 +00:00
Eren Gölge e033b3d7bb Add mfcc to BaseAudioConfig 2021-10-01 12:42:46 +00:00
Eren Gölge aee483f0a9 Implement STT downloaders 2021-10-01 12:42:46 +00:00
Eren Gölge f60eea701f Implement STT tokenizer 2021-10-01 12:42:46 +00:00
Eren Gölge 1787b2303a Implement STTDataset 2021-10-01 12:42:46 +00:00
Eren Gölge 0c7a2eb948 Implement BaseSTT 2021-10-01 12:42:46 +00:00
Eren Gölge d2323f0d98 Implement DeepSpeech 2021-10-01 12:42:46 +00:00
Eren Gölge 89cbfbc829 Add initial data downloaders for stt 2021-10-01 12:42:46 +00:00
Eren Gölge 4157e99d2d Allow custom padding value 2021-10-01 12:42:46 +00:00
Eren Gölge 355dfee98d Add mfcc to AudioProcessor 2021-10-01 12:42:46 +00:00
Eren Gölge 21cc0517a3 Fix WaveRNN test 2021-10-01 10:21:37 +00:00
Eren Gölge 4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00
Eren Gölge 37959ad0c7 Make linter 2021-09-30 23:02:16 +00:00
Eren Gölge 0b1986384f Make style 2021-09-30 16:21:18 +00:00
Eren Gölge 7edbe04fe0 Fix WaveRNN config and test 2021-09-30 16:20:12 +00:00
Eren Gölge 55d9209221 Remote STT tokenizer 2021-09-30 14:58:26 +00:00
Eren Gölge ba2b8c827f Update `train_tts.py` and `train_vocoder.py` 2021-09-30 14:47:56 +00:00
Eren Gölge 2e9b6b4f90 Refactor Speaker Encoder training 2021-09-30 14:47:56 +00:00
Eren Gölge 043dca61b4 Rename `load_meta_data` as `load_tts_data` 2021-09-30 14:47:56 +00:00
Eren Gölge 9f23ad6a0f Fix imports 2021-09-30 14:47:56 +00:00
Eren Gölge 16b70be0dd Add `_set_model_args` to BaseModel 2021-09-30 14:47:56 +00:00
Eren Gölge 9a0d8fa027 Update `copy_model_files()` 2021-09-30 14:47:56 +00:00
Eren Gölge 4163b4f2e4 Update Tacotron models 2021-09-30 14:47:56 +00:00
Eren Gölge e27feade38 Fixup wavernn 2021-09-30 14:47:56 +00:00
Eren Gölge 45889804c2 Update VITS 2021-09-30 14:47:56 +00:00
Eren Gölge 4f94f91305 Update WaveRNN 2021-09-30 14:47:56 +00:00
Eren Gölge 3d5205d66f Update WaveGrad 2021-09-30 14:47:56 +00:00
Eren Gölge fd95926009 Update GlowTTS 2021-09-30 14:47:56 +00:00
Eren Gölge 4baecdf92a Update GAN for Trainer_v2 2021-09-30 14:47:56 +00:00
Eren Gölge a156a40b47 Update ForwardTTS for Trainer_v2 2021-09-30 14:19:19 +00:00
Eren Gölge d9df33f837 Update `align_tts` for trainer_v2 2021-09-30 14:18:10 +00:00
Eren Gölge 8ada870a57 Refactor `trainer.py` for v2 2021-09-30 14:16:34 +00:00
Eren Gölge 7f388f26e3 Bump up to v0.3.1 2021-09-17 23:53:22 +00:00
Eren Gölge 2766dd1d6e
Fix #813 - GlowTTS training (#814)
* Fix #813

* Update glow_tts recipe

* Fix glow-tts test

* Linter fix

* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge f563415052 Bump up to v0.3.0 2021-09-13 09:40:38 +00:00
Eren Gölge a97dc8d09f Fix trainer malformatted print 2021-09-13 08:32:02 +00:00
Eren Gölge 91bebebe18 Add new models to `.models.json`
SpeedySpeech model using `ForwardTTS`
UnivNet model fine-tuned on TacotronDDC_ph spectrograms
2021-09-13 08:22:14 +00:00
Eren Gölge 1ea011571a Update SpeedySpeech config 2021-09-12 15:33:27 +00:00
Eren Gölge cbbc9e0172 Add FastSpeechConfig 2021-09-11 10:20:37 +00:00
Eren Gölge 26f76fce22 Remove SpeedySpeech from .models.json 2021-09-10 17:47:27 +00:00
Eren Gölge d97952611d Remove unused import 2021-09-10 17:31:41 +00:00
Eren Gölge 7d8f77385a Use `glow-tts` in synthesis tests 2021-09-10 17:27:33 +00:00
Eren Gölge d5f256b34c Update tacotron `r` init 2021-09-10 17:26:23 +00:00
Eren Gölge ab37fa9c39 Edit AlignTTS 2021-09-10 17:25:00 +00:00
Eren Gölge 66732025e1 Add `base_model` field to `forward_tts` configs 2021-09-10 17:23:48 +00:00
Eren Gölge d6e29ef98a Style update 2021-09-10 08:30:33 +00:00
Eren Gölge a89eb12aca Fix glow_tts imports 2021-09-10 08:29:51 +00:00
Eren Gölge 570d5971be Implement `ForwardTTSLoss` 2021-09-10 08:29:12 +00:00
Eren Gölge 0541a25e90 Remove `fastpitch.py` and `speedy_speech.py` 2021-09-10 08:27:48 +00:00
Eren Gölge 3c16013199 Fix Vits imports 2021-09-10 08:26:34 +00:00