Edresson Casanova
11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Enno Hermann
3c2d5a9e03
Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb ( #3230 )
...
* chore: remove unused argument
* refactor(audio.processor): remove duplicate stft+griffin_lim
* chore(audio.processor): remove unused compute_stft_paddings
Same function available in numpy_transforms
* refactor(audio.processor): remove duplicate db_to_amp
* refactor(audio.processor): remove duplicate amp_to_db
* refactor(audio.processor): remove duplicate linear_to_mel
* refactor(audio.processor): remove duplicate mel_to_linear
* refactor(audio.processor): remove duplicate build_mel_basis
* refactor(audio.processor): remove duplicate stft_parameters
* refactor(audio.processor): use pre-/deemphasis from numpy_transforms
* refactor(audio.processor): use rms_volume_norm from numpy_transforms
* chore(audio.processor): remove duplicate assert
Already checked in numpy_transforms.compute_f0
* refactor(audio.processor): use find_endpoint from numpy_transforms
* refactor(audio.processor): use trim_silence from numpy_transforms
* refactor(audio.processor): use volume_norm from numpy_transforms
* refactor(audio.processor): use load_wav from numpy_transforms
* fix(bin.extract_tts_spectrograms): set quantization bits
* fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code
Fixes #2447 , #2574
* refactor(audio.processor): remove duplicate quantization methods
2023-11-16 10:57:06 +01:00
Matthew Boakes
1b9c400bca
PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) ( #3176 )
...
* Replaced PyTorch weight_norm With parametrizations.weight_norm
* TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism
* Corrected Code Style
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-11-09 16:31:03 +01:00
Eren Gölge
cc6e9fcaa7
Fix #3153 ( #3169 )
2023-11-08 11:13:58 +01:00
Rajiv P
c2d15cd413
[minor] hifigan_generator.py typo ( #2462 )
...
resblock2 description updated.
2023-03-28 12:43:36 +02:00
Eren Gölge
d309f50e53
Implement FreeVC ( #2451 )
...
* Update .gitignore
* Draft FreeVC implementation
* Tests and relevant updates
* Update API tests
* Add missings
* Update requirements
* :(
* Lazy handle for vc
* Update docs for voice conversion
* Make style
2023-03-25 18:33:23 +01:00
Eren Gölge
914280a556
Bump up to v0.11.0 ( #2329 )
...
* Make style
* Bump up to v0.11.0
2023-02-08 13:58:49 +01:00
Eren Gölge
8cb1433e6e
Cache fsspec downloads ( #2132 )
...
* Cache fsspec downloaded files
* Use diff paths for test
* Make fsspec caching optional
* Decom GPU docker tests
* Make progress bar optional for better CI log
* Check path local
2022-11-09 22:12:48 +01:00
Eren G??lge
b686c09704
Fix #2062
2022-11-07 09:22:43 +01:00
Stanislav Kachnov
2c9f00a808
Fix tune wavegrad ( #1844 )
...
* fix imports in tune_wavegrad
* load_config returns Coqpit object instead None
* set action (store true) for flag "--use_cuda"; start to tune if module is running as the main program
* fix var order in the result of batch collating
* make style
* make style with black and isort
2022-08-22 09:55:32 +02:00
Eren Gölge
d46fbc240c
Introduce numpy and torch transforms ( #1705 )
...
* Refactor audio processing functions
* Add tests for numpy transforms
* Fix imports
* Fix imports2
2022-08-08 11:57:50 +02:00
vanIvan
5094499eba
Fix & update WaveRNN vocoder model ( #1749 )
...
* Fixes KeyError bug. Adding logging to dashboard.
* Make pep8 compliant
* Make style compliant
* Still fixing style
2022-07-26 15:05:11 +02:00
manmay nakhashi
577ec406f4
Fix checkpointing GAN models ( #1641 )
...
* checkpoint sae step crash fix
* checkpoint save step crash fix
* Update gan.py
updated requested changes
* crash fix
2022-06-22 12:07:46 +02:00
Eren Gölge
a0a9279e4b
Fix GAN optimizer order
...
commit 212d330929
Author: Edresson Casanova <edresson1@gmail.com>
Date: Fri Apr 29 16:29:44 2022 -0300
Fix unit test
commit 44456b0483
Author: Edresson Casanova <edresson1@gmail.com>
Date: Fri Apr 29 07:28:39 2022 -0300
Fix style
commit d545beadb9
Author: Edresson Casanova <edresson1@gmail.com>
Date: Thu Apr 28 17:08:04 2022 -0300
Change order of HIFI-GAN optimizers to be equal than the original repository
commit 657c5442e5
Author: Edresson Casanova <edresson1@gmail.com>
Date: Thu Apr 28 15:40:16 2022 -0300
Remove audio padding before mel spec extraction
commit 76b274e690
Merge: 379ccd7b
6233f4fc
Author: Edresson Casanova <edresson1@gmail.com>
Date: Wed Apr 27 07:28:48 2022 -0300
Merge pull request #1541 from coqui-ai/comp_emb_fix
Bug fix in compute embedding without eval partition
commit 379ccd7ba6
Author: WeberJulian <julian.weber@hotmail.fr>
Date: Wed Apr 27 10:42:26 2022 +0200
returns y_mask in VITS inference (#1540 )
* returns y_mask
* make style
2022-05-07 13:29:11 +02:00
Edresson Casanova
60034674f9
Remove audio padding before mel spec extraction
2022-05-07 13:12:09 +02:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Eren Gölge
1425a023fe
Make style and lint
2022-03-02 13:25:35 +01:00
Eren Gölge
1e414b3a09
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
e0f9be76c0
Update test_run in wavernn and wavegrad
2022-02-25 11:31:56 +01:00
Eren Gölge
83c5ddc5b7
Update imports
2022-02-25 11:31:56 +01:00
Eren Gölge
424d04e4f6
Make stlye
2022-02-25 11:31:56 +01:00
Eren Gölge
833de62e30
Update base_vocoder
2022-02-25 11:28:14 +01:00
Eren Gölge
fc3b6d2861
Update gan
2022-02-25 11:28:14 +01:00
Eren Gölge
20a677c623
Update test_run in wavernn and wavegrad
2022-02-25 11:28:14 +01:00
Eren Gölge
1f0c8179da
Make style
2022-02-25 11:26:59 +01:00
Eren Gölge
7058fcc3ff
Take file extension as an argument
2022-02-25 11:26:59 +01:00
Eren Gölge
13482dde1f
Update GAN model
2022-02-25 11:26:59 +01:00
Eren Gölge
c9972e6f14
Make lint
2022-02-25 11:07:34 +01:00
Eren Gölge
30cfafce56
Add init_from_config
2022-02-25 11:05:54 +01:00
Eren Gölge
a71a013276
Fix the wrong default loss name for GAN models
2022-02-25 10:48:03 +01:00
Eren Gölge
3b63d713b9
Fix espeak wrapper cmd call
2022-02-25 10:48:03 +01:00
Eren Gölge
acc6eef625
Update for tokenizer API
2022-02-25 10:48:02 +01:00
Eren Gölge
e1b4c4ca43
Add init_from_config to GAN
2022-02-25 10:48:02 +01:00
Eren Gölge
353f913efc
Fix #985
2022-02-25 10:48:02 +01:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
Eren Gölge
5e3f499a69
Fix #1187 ( #1227 )
2022-02-11 13:27:59 +01:00
Edresson Casanova
0860d73cf8
Remove Tensorflow requeriment ( #1225 )
...
* Remove TF modules
* Remove TF unit tests
* Remove TF vocoder modules
* Remove TF convert scripts
* Remove TF requirement
* Remove the Docs TF instructions
* Remove TF inference support
2022-02-10 16:14:54 +01:00
Eren Gölge
6189fdfaea
Fix Training HiFiGan -- avg loss not decreasing #1003
2021-12-30 10:48:55 +00:00
Eren Gölge
275c759993
Fix #1037
2021-12-23 15:57:10 +00:00
Eren Gölge
ce45d9e1af
Make style and lint
2021-12-01 10:42:52 +00:00
Eren Gölge
40cb8ac966
Fix #958
2021-12-01 10:33:34 +00:00
Eren Gölge
e15bc157d8
Fix #873
2021-10-14 14:39:45 +00:00
Eren Gölge
21cc0517a3
Fix WaveRNN test
2021-10-01 10:21:37 +00:00
Eren Gölge
37959ad0c7
Make linter
2021-09-30 23:02:16 +00:00
Eren Gölge
7edbe04fe0
Fix WaveRNN config and test
2021-09-30 16:20:12 +00:00
Eren Gölge
9f23ad6a0f
Fix imports
2021-09-30 14:47:56 +00:00
Eren Gölge
e27feade38
Fixup wavernn
2021-09-30 14:47:56 +00:00
Eren Gölge
4f94f91305
Update WaveRNN
2021-09-30 14:47:56 +00:00
Eren Gölge
3d5205d66f
Update WaveGrad
2021-09-30 14:47:56 +00:00
Eren Gölge
4baecdf92a
Update GAN for Trainer_v2
2021-09-30 14:47:56 +00:00