coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	2d06aeb79b	chore: remove unused TTS.utils.io module All uses of these methods were replaced with the equivalents from coqui-tts-trainer	2024-06-29 15:07:10 +02:00
Enno Hermann	da82d55329	refactor: use load_fsspec from trainer Made automatically with: rg "from TTS.utils.io import load_fsspec" --files-with-matches \| xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'	2024-06-29 15:07:10 +02:00
Enno Hermann	4bd3df2607	refactor: remove duplicate get_padding	2024-06-26 11:54:36 +02:00
Enno Hermann	4bc0e75a08	build: add numpy2 support Identified necessary code changes with the NPY201 ruff rule. Gruut is the only dependency that doesn't support numpy2 yet. NB: At build time numpy>=2.0.0 should be required to be able to build wheels compatible with both numpy1+2: https://numpy.org/devdocs/dev/depending_on_numpy.html#numpy-2-abi-handling	2024-06-16 22:10:33 +02:00
Enno Hermann	b711e19cb6	refactor: remove verbose arguments Can be handled by adjusting logging levels instead.	2024-04-03 15:19:45 +02:00
Enno Hermann	b6ab85a050	fix: use logging instead of print statements Fixes #1691	2024-04-03 15:19:45 +02:00
Aarni Koskela	08fa5d4098	Fix implicitly concatenated docstring	2023-12-13 14:56:41 +02:00
Aarni Koskela	449820ec7d	Ruff autofix E71*	2023-12-13 14:56:41 +02:00
Edresson Casanova	11283fce07	Ensures that only GPT model is in training mode during XTTS GPT training (#3241 ) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test	2023-11-17 15:13:46 +01:00
Enno Hermann	3c2d5a9e03	Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb (#3230 ) * chore: remove unused argument * refactor(audio.processor): remove duplicate stft+griffin_lim * chore(audio.processor): remove unused compute_stft_paddings Same function available in numpy_transforms * refactor(audio.processor): remove duplicate db_to_amp * refactor(audio.processor): remove duplicate amp_to_db * refactor(audio.processor): remove duplicate linear_to_mel * refactor(audio.processor): remove duplicate mel_to_linear * refactor(audio.processor): remove duplicate build_mel_basis * refactor(audio.processor): remove duplicate stft_parameters * refactor(audio.processor): use pre-/deemphasis from numpy_transforms * refactor(audio.processor): use rms_volume_norm from numpy_transforms * chore(audio.processor): remove duplicate assert Already checked in numpy_transforms.compute_f0 * refactor(audio.processor): use find_endpoint from numpy_transforms * refactor(audio.processor): use trim_silence from numpy_transforms * refactor(audio.processor): use volume_norm from numpy_transforms * refactor(audio.processor): use load_wav from numpy_transforms * fix(bin.extract_tts_spectrograms): set quantization bits * fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code Fixes #2447, #2574 * refactor(audio.processor): remove duplicate quantization methods	2023-11-16 10:57:06 +01:00
Matthew Boakes	1b9c400bca	PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176 ) * Replaced PyTorch weight_norm With parametrizations.weight_norm * TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism * Corrected Code Style --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-11-09 16:31:03 +01:00
Eren Gölge	cc6e9fcaa7	Fix #3153 (#3169 )	2023-11-08 11:13:58 +01:00
Rajiv P	c2d15cd413	[minor] hifigan_generator.py typo (#2462 ) resblock2 description updated.	2023-03-28 12:43:36 +02:00
Eren Gölge	d309f50e53	Implement FreeVC (#2451 ) * Update .gitignore * Draft FreeVC implementation * Tests and relevant updates * Update API tests * Add missings * Update requirements * :( * Lazy handle for vc * Update docs for voice conversion * Make style	2023-03-25 18:33:23 +01:00
Eren Gölge	914280a556	Bump up to v0.11.0 (#2329 ) * Make style * Bump up to v0.11.0	2023-02-08 13:58:49 +01:00
Eren Gölge	8cb1433e6e	Cache fsspec downloads (#2132 ) * Cache fsspec downloaded files * Use diff paths for test * Make fsspec caching optional * Decom GPU docker tests * Make progress bar optional for better CI log * Check path local	2022-11-09 22:12:48 +01:00
Eren G??lge	b686c09704	Fix #2062	2022-11-07 09:22:43 +01:00
Stanislav Kachnov	2c9f00a808	Fix tune wavegrad (#1844 ) * fix imports in tune_wavegrad * load_config returns Coqpit object instead None * set action (store true) for flag "--use_cuda"; start to tune if module is running as the main program * fix var order in the result of batch collating * make style * make style with black and isort	2022-08-22 09:55:32 +02:00
Eren Gölge	d46fbc240c	Introduce numpy and torch transforms (#1705 ) * Refactor audio processing functions * Add tests for numpy transforms * Fix imports * Fix imports2	2022-08-08 11:57:50 +02:00
vanIvan	5094499eba	Fix & update WaveRNN vocoder model (#1749 ) * Fixes KeyError bug. Adding logging to dashboard. * Make pep8 compliant * Make style compliant * Still fixing style	2022-07-26 15:05:11 +02:00
manmay nakhashi	577ec406f4	Fix checkpointing GAN models (#1641 ) * checkpoint sae step crash fix * checkpoint save step crash fix * Update gan.py updated requested changes * crash fix	2022-06-22 12:07:46 +02:00
Eren Gölge	a0a9279e4b	Fix GAN optimizer order commit `212d330929` Author: Edresson Casanova <edresson1@gmail.com> Date: Fri Apr 29 16:29:44 2022 -0300 Fix unit test commit `44456b0483` Author: Edresson Casanova <edresson1@gmail.com> Date: Fri Apr 29 07:28:39 2022 -0300 Fix style commit `d545beadb9` Author: Edresson Casanova <edresson1@gmail.com> Date: Thu Apr 28 17:08:04 2022 -0300 Change order of HIFI-GAN optimizers to be equal than the original repository commit `657c5442e5` Author: Edresson Casanova <edresson1@gmail.com> Date: Thu Apr 28 15:40:16 2022 -0300 Remove audio padding before mel spec extraction commit `76b274e690` Merge: `379ccd7b` `6233f4fc` Author: Edresson Casanova <edresson1@gmail.com> Date: Wed Apr 27 07:28:48 2022 -0300 Merge pull request #1541 from coqui-ai/comp_emb_fix Bug fix in compute embedding without eval partition commit `379ccd7ba6` Author: WeberJulian <julian.weber@hotmail.fr> Date: Wed Apr 27 10:42:26 2022 +0200 returns y_mask in VITS inference (#1540) * returns y_mask * make style	2022-05-07 13:29:11 +02:00
Edresson Casanova	60034674f9	Remove audio padding before mel spec extraction	2022-05-07 13:12:09 +02:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Eren Gölge	1425a023fe	Make style and lint	2022-03-02 13:25:35 +01:00
Eren Gölge	1e414b3a09	Make stlye	2022-02-25 11:31:56 +01:00
Eren Gölge	e0f9be76c0	Update test_run in wavernn and wavegrad	2022-02-25 11:31:56 +01:00
Eren Gölge	83c5ddc5b7	Update imports	2022-02-25 11:31:56 +01:00
Eren Gölge	424d04e4f6	Make stlye	2022-02-25 11:31:56 +01:00
Eren Gölge	833de62e30	Update base_vocoder	2022-02-25 11:28:14 +01:00
Eren Gölge	fc3b6d2861	Update gan	2022-02-25 11:28:14 +01:00
Eren Gölge	20a677c623	Update test_run in wavernn and wavegrad	2022-02-25 11:28:14 +01:00
Eren Gölge	1f0c8179da	Make style	2022-02-25 11:26:59 +01:00
Eren Gölge	7058fcc3ff	Take file extension as an argument	2022-02-25 11:26:59 +01:00
Eren Gölge	13482dde1f	Update GAN model	2022-02-25 11:26:59 +01:00
Eren Gölge	c9972e6f14	Make lint	2022-02-25 11:07:34 +01:00
Eren Gölge	30cfafce56	Add init_from_config	2022-02-25 11:05:54 +01:00
Eren Gölge	a71a013276	Fix the wrong default loss name for GAN models	2022-02-25 10:48:03 +01:00
Eren Gölge	3b63d713b9	Fix espeak wrapper cmd call	2022-02-25 10:48:03 +01:00
Eren Gölge	acc6eef625	Update for tokenizer API	2022-02-25 10:48:02 +01:00
Eren Gölge	e1b4c4ca43	Add init_from_config to GAN	2022-02-25 10:48:02 +01:00
Eren Gölge	353f913efc	Fix #985	2022-02-25 10:48:02 +01:00
Eren Gölge	127118c637	Update TTS.tts formatters (#1228 ) * Return Dict from tts formatters * Make style	2022-02-11 23:03:43 +01:00
Eren Gölge	5e3f499a69	Fix #1187 (#1227 )	2022-02-11 13:27:59 +01:00
Edresson Casanova	0860d73cf8	Remove Tensorflow requeriment (#1225 ) * Remove TF modules * Remove TF unit tests * Remove TF vocoder modules * Remove TF convert scripts * Remove TF requirement * Remove the Docs TF instructions * Remove TF inference support	2022-02-10 16:14:54 +01:00
Eren Gölge	6189fdfaea	Fix Training HiFiGan -- avg loss not decreasing #1003	2021-12-30 10:48:55 +00:00
Eren Gölge	275c759993	Fix #1037	2021-12-23 15:57:10 +00:00
Eren Gölge	ce45d9e1af	Make style and lint	2021-12-01 10:42:52 +00:00
Eren Gölge	40cb8ac966	Fix #958	2021-12-01 10:33:34 +00:00
Eren Gölge	e15bc157d8	Fix #873	2021-10-14 14:39:45 +00:00

1 2 3 4 5

201 Commits