coqui-tts

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	11283fce07	Ensures that only GPT model is in training mode during XTTS GPT training (#3241 ) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test	2023-11-17 15:13:46 +01:00
Eren G??lge	44880f09ed	Make style	2023-11-17 13:43:34 +01:00
Eren G??lge	26efdf6ee7	Make k_diffusion optional	2023-11-17 13:42:33 +01:00
Julian Weber	fbc18b8c34	Fix zh bug (#3238 )	2023-11-16 17:51:37 +01:00
Julian Weber	675f983550	Add sentence splitting (#3227 ) * Add sentence spliting * update requirements * update default args v2 * Add spanish * Fix return gpt_latents * Update requirements * Fix requirements	2023-11-16 11:01:11 +01:00
Edresson Casanova	73a5bd08c0	Fix XTTS GPT padding and inference issues (#3216 ) * Fix end artifact for fine tuning models * Bug fix on zh-cn inference * Remove ununsed code	2023-11-15 14:02:05 +01:00
Julian Weber	04901fb2e4	Add speed control for inference (#3214 ) * Add speed control for inference * Fix XTTS tests * Add speed control tests	2023-11-14 16:07:17 +01:00
Eren Gölge	ac3df409a6	Merge pull request #3208 from coqui-ai/fix_max_mel_len fix max generation length for XTTS	2023-11-13 14:32:56 +01:00
Eren G??lge	92fa988aec	Fixup	2023-11-13 13:44:06 +01:00
WeberJulian	b85536b23f	fix max generation length	2023-11-13 13:18:45 +01:00
Eren G??lge	b2682d39c5	Make style	2023-11-13 13:01:01 +01:00
Eren G??lge	a16360af85	Implement chunking gpt_cond	2023-11-13 13:00:08 +01:00
Enno Hermann	3b1e7038bc	fix(formatters): set missing root_path attribute (#3182 ) Fixes #2778	2023-11-09 16:49:52 +01:00
Aarni Koskela	a8e9163fb3	xtts/tokenizer: merge duplicate implementations of preprocess_text (#3170 ) This was found via ruff: > F811 Redefinition of unused `preprocess_text` from line 570	2023-11-09 16:32:12 +01:00
Matthew Boakes	1b9c400bca	PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176 ) * Replaced PyTorch weight_norm With parametrizations.weight_norm * TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism * Corrected Code Style --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-11-09 16:31:03 +01:00
Gorkem	66a1e248d0	torchaudio should use proper backend to load audio (#3179 )	2023-11-09 16:28:39 +01:00
Julian Weber	03ad90135b	Add lang code in XTTS doc (#3158 ) * Add lang code in XTTS doc * Remove ununsed config and args * update docs * woops	2023-11-08 13:47:33 +01:00
Gorkem	78a596618a	Fix for exception on streaming if last chunk empty (#3160 )	2023-11-08 11:32:02 +01:00
Julian Weber	ce1a39a9a4	Add char limit warn (#3130 ) * Add char limit warning * Adding v2 langs * cached_property for cutlet * Fix import	2023-11-08 10:24:23 +01:00
Edresson Casanova	5f9ab6cfaa	Fix style Co-authored-by: Aarni Koskela <akx@iki.fi>	2023-11-06 19:22:34 -03:00
Edresson Casanova	09fb317e6d	Remove unused code	2023-11-06 17:36:32 -03:00
Edresson Casanova	b146de4ce8	Bug fix on XTTS v2.0 Trainer	2023-11-06 20:26:01 +01:00
Edresson Casanova	1b6f8d0e46	Update unit tests and recipes	2023-11-06 20:25:06 +01:00
Edresson Casanova	72b2bac0f8	Load reference in 24khz to avoid issued with multiple sr references	2023-11-06 20:25:06 +01:00
Edresson Casanova	00294ffdf6	Update XTTS docs	2023-11-06 20:24:06 +01:00
Edresson Casanova	459ad70dc8	Add support for multiples speaker references on XTTS inference	2023-11-06 20:22:35 +01:00
Eren Gölge	f0cb19ecca	Drop diffusion from XTTS (#3150 ) * Drop diffusion for XTTS * Make style * Drop diffusion deps in code * Restore thrashed	2023-11-06 20:15:49 +01:00
Eren G??lge	5d418bb84a	Update docs	2023-11-06 18:48:41 +01:00
Eren G??lge	9bbf6eb8dd	Drop use_ne_hifigan	2023-11-06 18:43:38 +01:00
Eren G??lge	9d54bd7655	Fixup XTTS	2023-11-06 18:13:58 +01:00
Edresson Casanova	e45227d9ff	XTTS v2.0 (#3137 ) * Implement most similar ref training approach * Use non-enhanced hifigan for test samples * Add Perceiver * Update GPT Trainer for perceiver support * Update XTTS docs * Bug fix masking with XTTS perceiver * Bug fix on gpt forward * Bug Fix on XTTS v2.0 training * Add XTTS v2.0 unit tests * Add XTTS v2.0 inference unit tests * Bug Fix on diffusion inference * Add XTTS v2.0 training recipe * Placeholder model entry * Add cloning params to config * Make prompt embedding configurable * Make cloning configurable * Cheap fix for a cheaper fix * Prevent resampling * Update model entry * Update docs * Update requirements * Code linting * Add xtts v2 to sep tests * Bug fix on XTTS get_gpt_cond_latents * Bug fix on rebase * Make style * Bug fix in Japenese tokenizer * Add num2words to deps * Remove unused kwarg and added num_beams=1 as default --------- Co-authored-by: Eren G??lge <egolge@coqui.ai>	2023-11-06 14:58:18 +01:00
Aarni Koskela	38f6f8f0bb	Run `make style` & re-enable it in CI (#3127 )	2023-11-06 11:36:37 +01:00
WeberJulian	1c98821359	Remove unused load_audio function	2023-10-27 22:27:18 +02:00
WeberJulian	d4e08c8d6c	Add features to get_conditioning_latents	2023-10-26 14:57:33 +02:00
WeberJulian	c1133724a1	Move lang token add to tokenizer	2023-10-26 14:52:13 +02:00
WeberJulian	6fa46d197d	Fix get_conditioning_latents when using only ne	2023-10-26 14:51:35 +02:00
Edresson Casanova	01839af926	Bug fix on XTTS masking training	2023-10-24 18:30:14 -03:00
Edresson Casanova	0f96abb5ec	Add FT inference example on XTTS docs	2023-10-23 13:23:30 -03:00
Edresson Casanova	37b7945474	Update XTTS train not implemented error to point to the XTTS docs	2023-10-23 11:39:17 -03:00
Edresson Casanova	ec7f54768a	Rebase bug fix and update recipe	2023-10-21 17:37:51 -03:00
Edresson Casanova	affaf11148	Add XTTS training unit test	2023-10-21 13:41:12 -03:00
Edresson Casanova	1f92741d6a	Fix issue #2971	2023-10-21 13:37:21 -03:00
Edresson Casanova	5f98dbeec9	Update Ljspeech XTTS recipe	2023-10-21 13:37:21 -03:00
Edresson Casanova	9e3598c3b7	Bug Fix on inference using XTTS trainer checkpoint	2023-10-21 13:37:21 -03:00
Edresson Casanova	c4ceaabe2c	Add test sentences during the training	2023-10-21 13:33:56 -03:00
Edresson Casanova	2f868dd5c2	Bug fix on reproducible evaluation	2023-10-21 13:33:56 -03:00
Edresson Casanova	bafab049c2	Add prompting masking	2023-10-21 13:33:56 -03:00
Edresson Casanova	47d613df3a	Add reproducible evaluation	2023-10-21 13:33:56 -03:00
Edresson Casanova	40a4e631ea	Update mel spectrogram for the style encoder	2023-10-21 13:33:56 -03:00
Edresson Casanova	a32961bcb4	Add XTTS base training code	2023-10-21 13:33:56 -03:00
Julian Weber	dad6a7b0b6	Preserve [ja] token of the text processing	2023-10-21 11:26:03 +02:00
Julian Weber	c7a16042e3	Remove global cutlet import	2023-10-21 11:18:58 +02:00
Edresson Casanova	59576fc0ec	Bug fix on XTTS v1.1 inference (#3093 ) * Bug fix on XTTS v1.1 inference * Update .models.json --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr>	2023-10-20 17:29:43 -03:00
Julian Weber	cf97116185	XTTS v1.1 (#3089 ) * Add support for ne_hifigan * Update model.json * Update hash * Fix model loading * Enhance text_normalization * Add xtts to zoo test exception * Add model hash check * Add get_number_tokens	2023-10-20 16:02:08 +02:00
Aya Jafari	ffddf10458	unit test fix	2023-10-13 10:56:47 -03:00
Aya Jafari	6eaecab0ca	fixed bugs in fastpitch tts synthesis	2023-10-10 23:02:31 -03:00
Julian Weber	e5e0cbffc9	Streaming inference for XTTS 🚀 (#3035 )	2023-10-06 18:34:06 +02:00
Edresson Casanova	4c3c11c958	Tortoise inference fix and fix zoo unit tests (#3010 )	2023-09-29 13:40:57 +02:00
Aarni Koskela	33a7c722f6	Merge duplicate on_train_step_start functions in delightful_tts	2023-09-27 01:10:44 +03:00
Aarni Koskela	861c68b0b8	Rename misnamed setter	2023-09-27 01:09:59 +03:00
Aarni Koskela	09e14e68db	Remove duplicate get_named_beta_schedules	2023-09-27 01:09:59 +03:00
Aarni Koskela	59f85a7122	Remove duplicate code from xtts.tokenizer	2023-09-27 01:09:59 +03:00
loupzeur	da8b6bbce1	fix: xtts not taking into account device flag (#2951 ) * fix: xtts not taking into account device flag * Style changes --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr>	2023-09-20 09:57:02 +02:00
Eren Gölge	4033db5f4b	🔥 XTTS implementation	2023-09-13 17:51:24 +02:00
Edresson Casanova	4d3f23b5d3	Add CML-TTS dataset YourTTS training recipe (#2934 )	2023-09-12 11:49:14 +02:00
Aleś Bułojčyk	fead04f779	Add phonemizer for Belarusian language (#2856 )	2023-08-28 11:20:45 +02:00
Eren Gölge	a7a96d08dd	Fix loading Bark (#2893 ) * Fixup hubert path * Make style	2023-08-26 11:59:00 +02:00
Jake Tae	409db505d2	Add device support in TTS and Synthesizer (#2855 ) * fix: resolve merge conflicts * fix: retain backwards compatability in functions * feature: utilize device for voice transfer * feature: use device for vocoder * chore: cleanup vocoder cpu logic * fix: add necessary vocoder output device check * fix: add necessary vocoder output device check * fix: indentation * fix: check if waveform is pt tensor before cpu conversion --------- Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>	2023-08-14 21:04:44 +02:00
Eren Gölge	3a104d5c49	Update Studio API for XTTS (#2861 ) * Update Studio API for XTTS * Update the docs * Update README.md * Update README.md Update README	2023-08-13 12:04:12 +02:00
Eren G??lge	37b558ccb9	Make style	2023-08-11 12:55:23 +02:00
Eren G??lge	9a8352b8da	Fix import error with Bark	2023-08-11 03:33:59 +02:00
Eren Gölge	4186f42b21	Handle missing JA phonemizer (#2843 ) * Handle missing JA phonemizer * Make style	2023-08-07 13:19:38 +02:00
Javier	4e7f8cd021	Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831 )	2023-08-04 11:02:59 +02:00
Eren Gölge	69f080eb47	Fix DelightfulTTS (#2823 ) * Fix tests * Make style	2023-07-31 13:52:45 +02:00
Eren Gölge	483888b9d8	Add kwargs to ignore extra arguments w/o error (#2822 )	2023-07-31 11:37:35 +02:00
Aleś Bułojčyk	d124f78430	Recipe for Belarusian TTS (#2756 ) * Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com> * Recipe for Belarusian TTS --------- Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>	2023-07-31 10:26:21 +02:00
Javier	c140df5a58	Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816 )	2023-07-31 10:19:49 +02:00
Eren Gölge	8aacb81849	Fix Tortoise load (#2791 ) * Remove key prunning in tortoise * Make lint	2023-07-24 13:42:47 +02:00
logan hart	6fdb88f8e2	Add Delightful-TTS implementation (#2095 ) * add configs * Update config file * Add model configs * Add model layers * Add layer files * Add layer modules * change config names * Add emotion manager * fIX missing ap bug * Fix missing ap bug * Add base TTS e2e class * Fix wrong variable name in load_tts_samples * Add training script * Remove range predictor and gaussian upsampling * Add helper function * Add vctk recipe * Add conformer docs * Fix linting in conformer.py * Add Docs * remove duplicate import * refactor args * Fix bugs * Removew emotion embedding * remove unused arg * Remove emotion embedding arg * Remove emotion embedding arg * fix style issues * Fix bugs * Fix bugs * Add unittests * make style * fix formatter bug * fix test * Add pyworld compute pitch func * Update requirments.txt * Fix dataset Bug * Chnge layer norm to instance norm * Add missing import * Remove emotions.py * remove ssim loss * Add init layers func to aligner * refactor model layers * remove audio_config arg * Rename loss func * Rename to delightful-tts * Rename loss func * Remove unused modules * refactor imports * replace audio config with audio processor * Add change sample rate option * remove broken resample func * update recipe * fix style, add config docs * fix tests and multispeaker embd dim * remove pyworld * Make style and fix inference * Split tts tests * Fixup * Fixup * Fixup * Add argument names * Set "random" speaker in the model Tortoise/Bark * Use a diff f0_cache path for delightfull tts * Fix delightful speaker handling * Fix lint * Make style --------- Co-authored-by: loganhart420 <loganartpersonal@gmail.com> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-07-24 13:41:26 +02:00
Eren Gölge	0de12ec5aa	API tests (#2790 ) * Separate API tests and only run when uplifted * Make style	2023-07-24 12:14:21 +02:00
Paul O'Leary McCann	c0aabb8596	Make Japanese-specific dependencies optional (#2776 ) * Don't install MeCab by default * Add optional [ja] deps, like [dev] etc * Add JA requirements file * Add JA requirements to requirements_all This should help the tests run.	2023-07-24 11:28:27 +02:00
Eren Gölge	672ec3b35e	Fix #2749 (#2750 )	2023-07-08 11:40:44 +02:00
Eren Gölge	a2984fb435	Fix #2745 (#2748 )	2023-07-07 20:23:27 +02:00
Eren Gölge	7b5c8422c8	Export multispeaker onnx (#2743 )	2023-07-06 13:36:50 +02:00
ZhouGongZaiShi	d5f16d77c2	delete meaningless print() (#2662 )	2023-07-04 11:38:17 +02:00
Eren G??lge	cb9c320691	Fixup	2023-06-30 14:13:11 +02:00
Eren G??lge	91cc11d636	Remove commented codes	2023-06-28 12:14:37 +02:00
Eren G??lge	6b9ebf5aab	Merge branch 'p3_11' into dev	2023-06-28 12:13:04 +02:00
Eren Gölge	c844b6570a	Inference API for 🐶Bark (#2685 ) * Add bark requirements * Draft Bark implementation * Download HF models * Update synthesizer * Add bark model * Make style * Update pylintrc * Update model URLs * Update Bark Config * Fix here and ther * Make style * Make lint * Update requirements * Update requirements	2023-06-28 11:55:27 +02:00
Eren G??lge	a13b1352a4	Fixup	2023-06-26 19:30:26 +02:00
Eren G??lge	17ac188958	Drop fairseq for Hubert	2023-06-26 19:27:48 +02:00
Eren G??lge	c03768bb53	Make style	2023-06-26 17:16:26 +02:00
Eren G??lge	a1c431e6a9	Fixups	2023-06-26 12:55:18 +02:00
Eren Gölge	fff8b762bc	Merge branch 'dev' into bark	2023-06-21 15:49:05 +02:00
Eren Gölge	4cf8652392	Fix Tortoise load (#2697 ) * Handle missing gpt weights * Make style * Fix lint	2023-06-21 15:42:01 +02:00
Eren G??lge	cf98ae04df	Make lint	2023-06-21 12:05:08 +02:00
Eren G??lge	3b9fca2398	Make style	2023-06-21 12:02:06 +02:00
Eren G??lge	0f8932a6a9	Fix here and ther	2023-06-21 11:59:27 +02:00
Eren G??lge	03c347b7f3	Update Bark Config	2023-06-21 11:58:18 +02:00
Eren G??lge	f4c88ed677	Make style	2023-06-19 14:22:32 +02:00

1 2 3 4 5 ...

992 Commits