coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	7cdfde226b	refactor: move amp_to_db/db_to_amp into torch_transforms	2024-11-23 01:04:17 +01:00
Enno Hermann	2df9bfa78e	refactor: handle deprecation of torch.cuda.amp.autocast (#144 ) torch.cuda.amp.autocast(args...) and torch.cpu.amp.autocast(args...) will be deprecated. Please use torch.autocast("cuda", args...) or torch.autocast("cpu", args...) instead. https://pytorch.org/docs/stable/amp.html	2024-11-09 18:37:08 +01:00
Enno Hermann	da82d55329	refactor: use load_fsspec from trainer Made automatically with: rg "from TTS.utils.io import load_fsspec" --files-with-matches \| xargs sed -i 's/from TTS.utils.io import load_fsspec/from trainer.io import load_fsspec/g'	2024-06-29 15:07:10 +02:00
Enno Hermann	c30fb0f56b	chore: remove duplicate init_weights	2024-06-26 11:46:37 +02:00
Enno Hermann	b711e19cb6	refactor: remove verbose arguments Can be handled by adjusting logging levels instead.	2024-04-03 15:19:45 +02:00
Enno Hermann	b6ab85a050	fix: use logging instead of print statements Fixes #1691	2024-04-03 15:19:45 +02:00
Enno Hermann	e95f8950eb	fix: torch.stft will soon require return_complex=True Refactor that removes the deprecation warning: torch.view_as_real(torch.stft(, return_complex=True)) is equal to torch.stft(, return_complex=False) https://pytorch.org/docs/stable/generated/torch.stft.html	2024-03-13 12:06:27 +01:00
Aarni Koskela	33a7c722f6	Merge duplicate on_train_step_start functions in delightful_tts	2023-09-27 01:10:44 +03:00
Aarni Koskela	861c68b0b8	Rename misnamed setter	2023-09-27 01:09:59 +03:00
Eren Gölge	69f080eb47	Fix DelightfulTTS (#2823 ) * Fix tests * Make style	2023-07-31 13:52:45 +02:00
logan hart	6fdb88f8e2	Add Delightful-TTS implementation (#2095 ) * add configs * Update config file * Add model configs * Add model layers * Add layer files * Add layer modules * change config names * Add emotion manager * fIX missing ap bug * Fix missing ap bug * Add base TTS e2e class * Fix wrong variable name in load_tts_samples * Add training script * Remove range predictor and gaussian upsampling * Add helper function * Add vctk recipe * Add conformer docs * Fix linting in conformer.py * Add Docs * remove duplicate import * refactor args * Fix bugs * Removew emotion embedding * remove unused arg * Remove emotion embedding arg * Remove emotion embedding arg * fix style issues * Fix bugs * Fix bugs * Add unittests * make style * fix formatter bug * fix test * Add pyworld compute pitch func * Update requirments.txt * Fix dataset Bug * Chnge layer norm to instance norm * Add missing import * Remove emotions.py * remove ssim loss * Add init layers func to aligner * refactor model layers * remove audio_config arg * Rename loss func * Rename to delightful-tts * Rename loss func * Remove unused modules * refactor imports * replace audio config with audio processor * Add change sample rate option * remove broken resample func * update recipe * fix style, add config docs * fix tests and multispeaker embd dim * remove pyworld * Make style and fix inference * Split tts tests * Fixup * Fixup * Fixup * Add argument names * Set "random" speaker in the model Tortoise/Bark * Use a diff f0_cache path for delightfull tts * Fix delightful speaker handling * Fix lint * Make style --------- Co-authored-by: loganhart420 <loganartpersonal@gmail.com> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-07-24 13:41:26 +02:00

11 Commits