Commit Graph

41 Commits

Author SHA1 Message Date
Enno Hermann 3c2d5a9e03
Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb (#3230)
* chore: remove unused argument

* refactor(audio.processor): remove duplicate stft+griffin_lim

* chore(audio.processor): remove unused compute_stft_paddings

Same function available in numpy_transforms

* refactor(audio.processor): remove duplicate db_to_amp

* refactor(audio.processor): remove duplicate amp_to_db

* refactor(audio.processor): remove duplicate linear_to_mel

* refactor(audio.processor): remove duplicate mel_to_linear

* refactor(audio.processor): remove duplicate build_mel_basis

* refactor(audio.processor): remove duplicate stft_parameters

* refactor(audio.processor): use pre-/deemphasis from numpy_transforms

* refactor(audio.processor): use rms_volume_norm from numpy_transforms

* chore(audio.processor): remove duplicate assert

Already checked in numpy_transforms.compute_f0

* refactor(audio.processor): use find_endpoint from numpy_transforms

* refactor(audio.processor): use trim_silence from numpy_transforms

* refactor(audio.processor): use volume_norm from numpy_transforms

* refactor(audio.processor): use load_wav from numpy_transforms

* fix(bin.extract_tts_spectrograms): set quantization bits

* fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code

Fixes #2447, #2574

* refactor(audio.processor): remove duplicate quantization methods
2023-11-16 10:57:06 +01:00
Eren Gölge 914280a556
Bump up to v0.11.0 (#2329)
* Make style

* Bump up to v0.11.0
2023-02-08 13:58:49 +01:00
Stanislav Kachnov 2c9f00a808
Fix tune wavegrad (#1844)
* fix imports in tune_wavegrad

* load_config returns Coqpit object instead None

* set action (store true) for flag "--use_cuda"; start to tune if module is running as the main program

* fix var order in the result of batch collating

* make style

* make style with black and isort
2022-08-22 09:55:32 +02:00
Edresson Casanova 60034674f9 Remove audio padding before mel spec extraction 2022-05-07 13:12:09 +02:00
Eren Gölge 7058fcc3ff Take file extension as an argument 2022-02-25 11:26:59 +01:00
Eren Gölge 127118c637
Update TTS.tts formatters (#1228)
* Return Dict from tts formatters

* Make style
2022-02-11 23:03:43 +01:00
Eren Gölge e15bc157d8 Fix #873 2021-10-14 14:39:45 +00:00
Eren Gölge 21cc0517a3 Fix WaveRNN test 2021-10-01 10:21:37 +00:00
Eren Gölge 9f23ad6a0f Fix imports 2021-09-30 14:47:56 +00:00
Eren Gölge e20ea57c87 Update comment and add a warning 2021-09-07 12:23:32 +00:00
Eren Gölge 58cc414477 Fix WaveGrad `test_run` 2021-07-16 13:02:25 +02:00
Eren Gölge d7225eedb0 Update `vocoder` datasets and `setup_dataset` 2021-06-28 17:03:20 +02:00
Eren Gölge 8cb27267a4 formatting 2021-05-03 14:26:35 +02:00
Eren Gölge a7f6045644 Merge branch 'reformat' into hifigan-reformat 2021-04-12 12:00:17 +02:00
Eren Gölge f519012dea reformatting and styling 2021-04-12 11:47:39 +02:00
Eren Gölge e5b9607bc3 isort all imports 2021-04-09 00:45:20 +02:00
Eren Gölge 0e79fa86ad format with black and pylint 2.7.3 2021-04-09 00:38:08 +02:00
Eren Gölge cd69da4868 linter fixes #2 2021-04-08 16:57:46 +02:00
Eren Gölge aee24b0704 set different seed in gan_dataset when it is multi-workers 2021-04-08 11:28:30 +02:00
Eren Gölge 57f6bd1afa make using different samples for G and D networks optional 2021-04-08 11:26:01 +02:00
Eren Gölge f890454de3 linter fixes 2021-04-07 12:36:03 +02:00
Eren Gölge b86e7fb2e8 pad short samples when loading precomputed features in vocoder trainign 2021-04-06 16:24:50 +02:00
Eren Gölge 594d8d8f09 linter fixes 2021-03-08 11:22:59 +01:00
Eren Gölge 00b5090974 linter fix 2021-03-08 11:05:30 +01:00
Eren Gölge e15734c3fc linter fix 2021-03-08 05:29:43 +01:00
erogol e3eda159d1 wavegrad_dataset update 2020-11-25 14:50:50 +01:00
erogol c76a617072 linter updates 2020-11-09 13:18:35 +01:00
erogol c80225544e tune wavegrad to fine the best noise schedule for inferece 2020-11-06 13:04:46 +01:00
erogol dc2825dfb2 wavegrad dataset update 2020-10-29 12:31:43 +01:00
erogol 7bcdb7ac35 wavegrad updates 2020-10-29 12:31:43 +01:00
erogol e02cd6a220 initial wavegrad layers model and trainig script 2020-10-29 12:30:37 +01:00
erogol 9d0ae2bfb4 wavernn dataloader handling for short samples and mixed precision training 2020-10-28 12:31:01 +01:00
sanjaesc 2ee47e9568 fix pylint once again 2020-10-27 12:17:38 +01:00
sanjaesc bef3f2020b compute audio feat on dataload 2020-10-27 12:17:38 +01:00
sanjaesc 91e5f8b63d added to device cpu/gpu + formatting 2020-10-27 12:17:38 +01:00
sanjaesc e8294cb9db fixing pylint errors 2020-10-27 12:17:38 +01:00
sanjaesc 878b7c373e added feature preprocessing if not set in config 2020-10-27 12:17:38 +01:00
sanjaesc e495e03ea1 some minor changes to wavernn 2020-10-27 12:17:38 +01:00
Alex K 9c3c7ce2f8 wavernn stuff... 2020-10-27 12:17:38 +01:00
Alex K 6378fa2b07 add initial wavernn support 2020-10-27 12:17:38 +01:00
erogol df19428ec6 rename the project to old TTS 2020-09-09 12:27:23 +02:00