coqui-tts

Commit Graph

Author	SHA1	Message	Date
Jindrich Matousek	aa9dbca939	Merge branch 'coqui-ai:main' into main	2023-10-23 14:52:33 +02:00
Edresson Casanova	59576fc0ec	Bug fix on XTTS v1.1 inference (#3093 ) * Bug fix on XTTS v1.1 inference * Update .models.json --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr>	2023-10-20 17:29:43 -03:00
Julian Weber	cf97116185	XTTS v1.1 (#3089 ) * Add support for ne_hifigan * Update model.json * Update hash * Fix model loading * Enhance text_normalization * Add xtts to zoo test exception * Add model hash check * Add get_number_tokens	2023-10-20 16:02:08 +02:00
Jindrich Matousek	a235ef3a0f	Merge branch 'coqui-ai:dev' into dev	2023-10-18 07:35:01 +02:00
Aya Jafari	ffddf10458	unit test fix	2023-10-13 10:56:47 -03:00
Aya Jafari	6eaecab0ca	fixed bugs in fastpitch tts synthesis	2023-10-10 23:02:31 -03:00
Jindrich Matousek	326d663be3	Merge branch 'coqui-ai:dev' into dev	2023-10-06 19:07:51 +02:00
Julian Weber	e5e0cbffc9	Streaming inference for XTTS 🚀 (#3035 )	2023-10-06 18:34:06 +02:00
Jindrich Matousek	1fc2dce098	Merge branch 'coqui-ai:dev' into dev	2023-09-30 11:21:39 +02:00
Aarni Koskela	33a7c722f6	Merge duplicate on_train_step_start functions in delightful_tts	2023-09-27 01:10:44 +03:00
Aarni Koskela	861c68b0b8	Rename misnamed setter	2023-09-27 01:09:59 +03:00
Jindrich Matousek	0a89a43a77	Merge branch 'main' into dev Merge changes by JMa into dev	2023-09-21 08:10:47 +02:00
loupzeur	da8b6bbce1	fix: xtts not taking into account device flag (#2951 ) * fix: xtts not taking into account device flag * Style changes --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr>	2023-09-20 09:57:02 +02:00
Jindrich Matousek	030a6f6395	Merge branch 'coqui-ai:main' into main	2023-09-14 15:39:33 +02:00
Eren Gölge	4033db5f4b	🔥 XTTS implementation	2023-09-13 17:51:24 +02:00
Jindrich Matousek	a0db2eeee8	Fix: add `is_eval` when calling `get_sampler` to respect training/validation	2023-09-06 13:59:24 +02:00
Jindrich Matousek	5504e13570	Merge branch 'coqui-ai:main' into main	2023-08-26 16:34:47 +02:00
Eren Gölge	a7a96d08dd	Fix loading Bark (#2893 ) * Fixup hubert path * Make style	2023-08-26 11:59:00 +02:00
Jindrich Matousek	4085a229fe	Merge remote-tracking branch 'upstream/main'	2023-08-21 17:22:20 +01:00
Eren Gölge	3a104d5c49	Update Studio API for XTTS (#2861 ) * Update Studio API for XTTS * Update the docs * Update README.md * Update README.md Update README	2023-08-13 12:04:12 +02:00
Eren G??lge	37b558ccb9	Make style	2023-08-11 12:55:23 +02:00
Jindrich Matousek	874143bf04	Add support for phone (char) based length scale Remove length_scale from default aux_input	2023-08-06 13:17:53 +02:00
Javier	4e7f8cd021	Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831 )	2023-08-04 11:02:59 +02:00
Eren Gölge	69f080eb47	Fix DelightfulTTS (#2823 ) * Fix tests * Make style	2023-07-31 13:52:45 +02:00
Eren Gölge	483888b9d8	Add kwargs to ignore extra arguments w/o error (#2822 )	2023-07-31 11:37:35 +02:00
Javier	c140df5a58	Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816 )	2023-07-31 10:19:49 +02:00
Eren Gölge	8aacb81849	Fix Tortoise load (#2791 ) * Remove key prunning in tortoise * Make lint	2023-07-24 13:42:47 +02:00
logan hart	6fdb88f8e2	Add Delightful-TTS implementation (#2095 ) * add configs * Update config file * Add model configs * Add model layers * Add layer files * Add layer modules * change config names * Add emotion manager * fIX missing ap bug * Fix missing ap bug * Add base TTS e2e class * Fix wrong variable name in load_tts_samples * Add training script * Remove range predictor and gaussian upsampling * Add helper function * Add vctk recipe * Add conformer docs * Fix linting in conformer.py * Add Docs * remove duplicate import * refactor args * Fix bugs * Removew emotion embedding * remove unused arg * Remove emotion embedding arg * Remove emotion embedding arg * fix style issues * Fix bugs * Fix bugs * Add unittests * make style * fix formatter bug * fix test * Add pyworld compute pitch func * Update requirments.txt * Fix dataset Bug * Chnge layer norm to instance norm * Add missing import * Remove emotions.py * remove ssim loss * Add init layers func to aligner * refactor model layers * remove audio_config arg * Rename loss func * Rename to delightful-tts * Rename loss func * Remove unused modules * refactor imports * replace audio config with audio processor * Add change sample rate option * remove broken resample func * update recipe * fix style, add config docs * fix tests and multispeaker embd dim * remove pyworld * Make style and fix inference * Split tts tests * Fixup * Fixup * Fixup * Add argument names * Set "random" speaker in the model Tortoise/Bark * Use a diff f0_cache path for delightfull tts * Fix delightful speaker handling * Fix lint * Make style --------- Co-authored-by: loganhart420 <loganartpersonal@gmail.com> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-07-24 13:41:26 +02:00
Eren Gölge	a2984fb435	Fix #2745 (#2748 )	2023-07-07 20:23:27 +02:00
Eren Gölge	7b5c8422c8	Export multispeaker onnx (#2743 )	2023-07-06 13:36:50 +02:00
ZhouGongZaiShi	d5f16d77c2	delete meaningless print() (#2662 )	2023-07-04 11:38:17 +02:00
Jindrich Matousek	b761d488a7	Merge branch 'coqui-ai:main' into main	2023-07-03 08:45:07 +02:00
Eren G??lge	cb9c320691	Fixup	2023-06-30 14:13:11 +02:00
Eren Gölge	4cf8652392	Fix Tortoise load (#2697 ) * Handle missing gpt weights * Make style * Fix lint	2023-06-21 15:42:01 +02:00
Eren Gölge	e785d101a1	Port Fairseq TTS models (#2628 ) * Load fairseq models * Add docs and missing files * Managing fairseq models and docs for API * Make style * Use scarf URL * Add tests * Fix URL * Pass cpu * Make lint * Fixup * Make lint * fixup * Fixup * Change tokenization order * Update README * Fixup * Fixup	2023-06-05 11:15:13 +02:00
Eren Gölge	9e99e0f42d	Disable reduction	2023-05-18 11:12:51 +02:00
Jindrich Matousek	1476c5203f	Merge branch 'coqui-ai:main' into main	2023-05-16 15:47:14 +02:00
Eren Gölge	4de797bb11	Draft ONNX export for VITS (#2563 ) * Draft ONNX export for VITS Could not get it work to output variable length sequence * Fixup for onnx constant output * Make style * Remove commented code	2023-05-16 01:07:56 +02:00
manmay nakhashi	a3d5801c44	Tortoise TTS inference (#2547 ) * initial commit * Tortoise inference * revert path change * style fix * remove accidental remove * style fixes * style fixes * removed unwanted assests and deps * remove changes * remove cvvp * style fix black * added tortoise config and updated config and args, refactoring the code * added tortoise to api * Pull mel_norm from url * Use TTS cleaners * Let download model files * add ability to pass tortoise presets through coqui api * fix tests * fix style and tests * fix tts commandline for tortoise * Add config.json to tortoise * Use kwargs * Use regular model api for loading tortoise * Add load from dir to synthesizer * Fix Tortoise floats * Use model_dir when there are multiple urls * Use `synthesize` when exists * lint fixes and resolve preset bug * resolve a download bug and update model link * fix json * do tortoise inference from voice dir * fix * fix test * fix speaker id and remove assests * update inference_tests.yml * replace inference_test.yml * fix extra dir as None * fix tests * remove space * Reformat docstring * Add docs * Update docs * lint fixes --------- Co-authored-by: Eren Gölge <egolge@coqui.ai> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-05-16 00:58:21 +02:00
Jindrich Matousek	a60b423f76	Merge remote-tracking branch 'upstream/main'	2023-04-13 13:21:06 +02:00
Matthew Boakes	4c829e74a1	Update Librosa Version To V0.10.0	2023-04-05 00:59:20 +01:00
p0p	91cf1b2da9	[minor] batch["speaker_ids"] getting set two times (#2470 ) * [minor] batch["speaker_ids"] getting set two times just to make it consistent with language_ids * Update vits.py style.	2023-04-03 11:35:21 +02:00
Eren Gölge	d309f50e53	Implement FreeVC (#2451 ) * Update .gitignore * Draft FreeVC implementation * Tests and relevant updates * Update API tests * Add missings * Update requirements * :( * Lazy handle for vc * Update docs for voice conversion * Make style	2023-03-25 18:33:23 +01:00
Khalid Bashir	14c80dd1fd	vits.py training fixed due to return_complex (#2418 ) Torch set default value for `return_complex=True` for `torch.stft` method This turned warning into error:- ``` Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1591, in fit self._fit() File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1544, in _fit self.train_epoch() File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1309, in train_epoch _, _ = self.train_step(batch, batch_num_steps, cur_step, loader_start_time) File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1162, in train_step outputs, loss_dict_new, step_time = self._optimize( File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1023, in _optimize outputs, loss_dict = self._model_train_step(batch, model, criterion, optimizer_idx=optimizer_idx) File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 970, in _model_train_step return model.train_step(*input_args) File "/workspace/coqui-tts/TTS/tts/models/vits.py", line 1293, in train_step mel_slice_hat = wav_to_mel( File "/workspace/coqui-tts/TTS/tts/models/vits.py", line 191, in wav_to_mel spec = torch.stft( File "/usr/local/lib/python3.10/dist-packages/torch/functional.py", line 641, in stft return _VF.stft(input, n_fft, hop_length, win_length, window, # type: ignore[attr-defined] RuntimeError: stft requires the return_complex parameter be given for real inputs, and will further require that return_complex=True in a future PyTorch release. ```	2023-03-19 00:22:04 +01:00
Roee Shenberg	3c15f0619a	Bug fixes in OverFlow audio generation (#2380 )	2023-03-15 12:02:11 +01:00
Jindrich Matousek	027d69f48b	Merge branch 'coqui-ai:main' into main	2023-03-14 09:24:03 +01:00
Jindrich Matousek	67edc4e40f	Fix length scale handling and default value	2023-03-13 21:13:51 +01:00
Daniel Vera Nieto	dfb48737fb	Style fixed	2023-03-13 16:11:15 +01:00
Dani Vera	0d12229b64	Update vits.py This should fix the issue https://github.com/coqui-ai/TTS/issues/1986 without breaking batch data sampling.	2023-03-10 18:35:16 +01:00
Jindrich Matousek	fcfecf6310	Fix usage of `aux_input["min_input_length"]` when running `test_run()` during training	2023-03-09 16:32:29 +01:00

1 2 3 4 5 ...

371 Commits