coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	849e75e967	docs: improve documentation	2024-12-12 18:23:17 +01:00
Enno Hermann	fe14ca6b68	refactor(xtts): remove duplicate xtts audio config	2024-12-05 15:46:28 +01:00
Enno Hermann	ce202532cf	fix(xtts): clearer error message when file given to checkpoint_dir	2024-12-02 16:54:11 +01:00
Enno Hermann	7330ad8854	refactor: move duplicate alignment functions into helpers	2024-11-24 19:57:14 +01:00
Enno Hermann	76df6421de	refactor: move more audio processing into torch_transforms	2024-11-24 19:57:14 +01:00
Enno Hermann	b1ac884e07	refactor: move shared function into dataset.py	2024-11-24 19:57:14 +01:00
Enno Hermann	6ecf47312c	refactor(xtts): use tortoise conditioning encoder	2024-11-24 19:57:14 +01:00
Enno Hermann	e63962c226	refactor(losses): move shared losses into losses.py	2024-11-23 01:04:17 +01:00
Enno Hermann	6f25c2b904	refactor(delightful_tts): remove unused classes	2024-11-23 01:04:17 +01:00
Enno Hermann	7cdfde226b	refactor: move amp_to_db/db_to_amp into torch_transforms	2024-11-23 01:04:17 +01:00
Enno Hermann	627bbe4150	fix(xtts): more helpful error message when vocab.json not found	2024-11-17 00:30:32 +01:00
Enno Hermann	2df9bfa78e	refactor: handle deprecation of torch.cuda.amp.autocast (#144 ) torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu", args...) instead. https://pytorch.org/docs/stable/amp.html	2024-11-09 18:37:08 +01:00
Enno Hermann	0971bc236e	refactor: use external package for monotonic alignment	2024-11-08 12:35:04 +01:00
Enno Hermann	6314032fd7	Merge pull request #113 from idiap/pytorch fix: only enable load with weights_only in pytorch>=2.4	2024-11-04 22:14:42 +01:00
Enno Hermann	8e66be2c32	fix: only enable load with weights_only in pytorch>=2.4 Allows moving the minimum Pytorch version back to 2.1	2024-11-04 18:39:51 +01:00
Enno Hermann	964b813235	fix(gpt): set attention mask and address other warnings	2024-10-25 18:49:04 +02:00
Enno Hermann	659b4852ba	chore(bark): remove manual download of hubert model Bark was previously adapted to download Hubert from HuggingFace, so the manual download is superfluous.	2024-09-12 23:37:19 +02:00
shavit	17ca24c3d6	fix: load weights only in torch.load	2024-09-12 23:37:19 +02:00
Enno Hermann	20583a496e	Merge pull request #57 from idiap/xtts-vocab fix(xtts): load tokenizer file based on config as last resort	2024-07-25 13:26:28 +01:00
Enno Hermann	9192ef1aa6	fix(xtts): load tokenizer file based on config as last resort	2024-07-05 13:52:01 +02:00
Abraham Mathews	6ea3b75b84	Update xtts.py (#53 ) docs(xtts): fix typo in example	2024-07-02 13:43:52 +02:00
Enno Hermann	da82d55329	refactor: use load_fsspec from trainer Made automatically with: rg "from TTS.utils.io import load_fsspec" --files-with-matches \| xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'	2024-06-29 15:07:10 +02:00
Enno Hermann	0fb26f97df	refactor: use get_user_data_dir from trainer	2024-06-29 15:07:10 +02:00
Enno Hermann	c30fb0f56b	chore: remove duplicate init_weights	2024-06-26 11:46:37 +02:00
Enno Hermann	cd7b6daf46	fix: clarify types, fix missing functions	2024-06-26 10:17:04 +02:00
Enno Hermann	03de4b889e	docs: fix readthedocs links [ci skip]	2024-06-13 22:48:34 +02:00
Enno Hermann	018f1e6453	docs(bark): update docstrings and type hints	2024-05-15 22:56:55 +02:00
Enno Hermann	59a6c9fdf2	fix(bark): add missing argument for load_voice() Fixes https://github.com/coqui-ai/TTS/issues/2795	2024-05-15 22:56:28 +02:00
Enno Hermann	b711e19cb6	refactor: remove verbose arguments Can be handled by adjusting logging levels instead.	2024-04-03 15:19:45 +02:00
Enno Hermann	b6ab85a050	fix: use logging instead of print statements Fixes #1691	2024-04-03 15:19:45 +02:00
Enno Hermann	d772724125	fix: update repository links, package names, metadata	2024-04-03 12:02:44 +02:00
Enno Hermann	e5c6da1c98	Merge pull request #20 from eginhard/return-complex fix: torch.stft will soon require return_complex=True	2024-03-13 13:50:21 +01:00
Enno Hermann	e95f8950eb	fix: torch.stft will soon require return_complex=True Refactor that removes the deprecation warning: torch.view_as_real(torch.stft(, return_complex=True)) is equal to torch.stft(, return_complex=False) https://pytorch.org/docs/stable/generated/torch.stft.html	2024-03-13 12:06:27 +01:00
Enno Hermann	89a061f1d1	docs(tts.models.vits): clarify use of discriminator/generator [ci skip]	2024-03-12 18:59:05 +01:00
Enno Hermann	efdafd5a7f	style: run black	2024-03-07 11:46:51 +01:00
Enno Hermann	24298da5fc	Merge pull request #1 from eginhard/lint-overhaul Lint overhaul (pylint to ruff)	2024-03-06 16:10:26 +01:00
Edresson Casanova	5dcc16d193	Bug fix in MP3 and FLAC compute length on TTSDataset (#3092 ) * Bug Fix on XTTS load * Bug fix in MP3 length on TTSDataset * Update TTS/tts/datasets/dataset.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Uses mutagen for all audio formats * Add dataloader test wit hall supported audio formats * Use mutagen.File * Update * Fix aux unit tests * Bug fixe on unit tests --------- Co-authored-by: Aarni Koskela <akx@iki.fi>	2023-12-27 13:23:43 -03:00
Aarni Koskela	d6ea806469	Run `make style`	2023-12-13 14:56:41 +02:00
Aarni Koskela	aa549e9028	Fix trailing whitespace	2023-12-13 14:56:41 +02:00
Aarni Koskela	64bb41f4fa	Ruff autofix C41	2023-12-13 14:56:41 +02:00
Aarni Koskela	449820ec7d	Ruff autofix E71*	2023-12-13 14:56:41 +02:00
Aarni Koskela	90991e89b4	Ruff autofix unused imports and import order	2023-12-13 14:56:41 +02:00
Eren Gölge	8c1a8b522b	Merge pull request #3405 from coqui-ai/studio_speakers Add studio speakers to open source XTTS!	2023-12-12 16:10:09 +01:00
WeberJulian	5cd750ac7e	Fix API and CI	2023-12-11 20:21:53 +01:00
WeberJulian	e3c9dab7a3	Make CLI work	2023-12-11 18:49:18 +01:00
WeberJulian	36143fee26	Add basic speaker manager	2023-12-11 15:25:46 +01:00
Frederico S. Oliveira	163f9a3fdf	Merge branch 'coqui-ai:dev' into dev	2023-12-11 10:04:07 -03:00
Edresson Casanova	5f900f156a	Add XTTS Fine tuning gradio demo (#3296 ) * Add XTTS FT demo data processing pipeline * Add training and inference columns * Uses tabs instead of columns * Fix demo freezing issue * Update demo * Convert stereo to mono * Bug fix on XTTS inference * Update gradio demo * Update gradio demo * Update gradio demo * Update gradio demo * Add parameters to be able to set then on colab demo * Add erros messages * Add intuitive error messages * Update * Add max_audio_length parameter * Add XTTS fine-tuner docs * Update XTTS finetuner docs * Delete trainer to freeze memory * Delete unused variables * Add gc.collect() * Update xtts.md --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-12-01 23:52:23 +01:00
Frederico S. Oliveira	bcd500fa7b	Fixing bug Correction in training the Fastspeech/Fastspeech2/FastPitch/SpeedySpeech model using external speaker embedding.	2023-11-30 17:27:05 -03:00
Eren G??lge	44880f09ed	Make style	2023-11-17 13:43:34 +01:00

1 2 3 4 5 ...

433 Commits