coqui-tts

Commit Graph

Author	SHA1	Message	Date
manmay nakhashi	a3d5801c44	Tortoise TTS inference (#2547 ) * initial commit * Tortoise inference * revert path change * style fix * remove accidental remove * style fixes * style fixes * removed unwanted assests and deps * remove changes * remove cvvp * style fix black * added tortoise config and updated config and args, refactoring the code * added tortoise to api * Pull mel_norm from url * Use TTS cleaners * Let download model files * add ability to pass tortoise presets through coqui api * fix tests * fix style and tests * fix tts commandline for tortoise * Add config.json to tortoise * Use kwargs * Use regular model api for loading tortoise * Add load from dir to synthesizer * Fix Tortoise floats * Use model_dir when there are multiple urls * Use `synthesize` when exists * lint fixes and resolve preset bug * resolve a download bug and update model link * fix json * do tortoise inference from voice dir * fix * fix test * fix speaker id and remove assests * update inference_tests.yml * replace inference_test.yml * fix extra dir as None * fix tests * remove space * Reformat docstring * Add docs * Update docs * lint fixes --------- Co-authored-by: Eren Gölge <egolge@coqui.ai> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-05-16 00:58:21 +02:00
Eren Gölge	dfb51e06b2	Add jenny model (#2603 )	2023-05-08 12:05:40 +02:00
Eren Gölge	a44a0e1fd2	Update model urls	2023-04-17 14:53:27 +02:00
Eren Gölge	36be05290d	Add models	2023-04-17 12:52:32 +02:00
Eren Gölge	d309f50e53	Implement FreeVC (#2451 ) * Update .gitignore * Draft FreeVC implementation * Tests and relevant updates * Update API tests * Add missings * Update requirements * :( * Lazy handle for vc * Update docs for voice conversion * Make style	2023-03-25 18:33:23 +01:00
Eren G??lge	713e8c8d04	Add pretrained model	2023-01-30 13:55:17 +01:00
Khalid Bashir	42afad5e79	Fixed bug related to yourtts speaker embeddings issue (#2234 ) * Fixed bug related to yourtts speaker embeddings issue * Reverted code for base_tts * Bug fix on VITS d_vector_file type * Ignore the test speakers on YourTTS recipe * Add speaker encoder model and config on YourTTS recipe to easily do zero-shot inference * Update YourTTS config file * Update ModelManager._update_path to deal with list attributes * Fix lint checks * Remove unused code * Fix unit tests * Reset name_to_id to get the right speaker ids on load_embeddings_from_list_of_files * Set weighted_sampler_multipliers as an empty dict to prevent users' mistakes Co-authored-by: Edresson Casanova <edresson1@gmail.com>	2023-01-02 14:20:02 +01:00
Eren G??lge	cf765cb3f2	Add ca and fa models	2022-12-26 14:29:10 +01:00
Eren Gölge	ecea43ec81	Adding pre-trained Overflow model (#2211 ) * Adding pretrained Overflow model * Stabilize HMM * Fixup model manager * Return `audio_unique_name` by default * Distribute max split size over datasets * Fixup eval_split_size * Make style	2022-12-14 16:55:48 +01:00
logan hart	ff9b63d02a	Add neon models (#2140 ) * Add neon ljspeech vits model * Add neon german model * Update .models.json * Add neon spanish model * Add french model * Add Dutch model * Add Hungarian model * Add Greek model * Remove uneeded description * Update .models.json * Update .models.json * Handling neon models * Add all neon models * Update .models.json * Split zoo_tests * Update test names * Update model testing Co-authored-by: Eren Gölge <erogol@hotmail.com>	2022-11-16 16:12:39 +01:00
Julian Weber	bb59718c03	Add capacitron v2 model (#1768 ) * Add capacitron v2 in .models.json * Put right commit hash	2022-09-08 09:43:56 +02:00
Eren Gölge	e5430a6519	Add new DE Thorsten models (#1898 ) - Tacotron2-DDC - HifiGAN vocoder	2022-08-22 11:27:39 +02:00
WeberJulian	30c72e0d05	Add Thorsten VITS model (#1675 ) Co-authored-by: Eren Gölge <egolge@coqui.ai>	2022-06-21 11:39:49 +02:00
a-froghyar	8be21ec387	Capacitron (#977 ) * new CI config * initial Capacitron implementation * delete old unused file * fix empty formatting changes * update losses and training script * fix previous commit * fix commit * Add Capacitron test and first round of test fixes * revert formatter change * add changes to the synthesizer * add stepwise gradual lr scheduler and changes to the recipe * add inference script for dev use * feat: add posterior inference arguments to synth methods - added reference wav and text args for posterior inference - some formatting * fix: add espeak flag to base_tts and dataset APIs - use_espeak_phonemes flag was not implemented in those APIs - espeak is now able to be utilised for phoneme generation - necessary phonemizer for the Capacitron model * chore: update training script and style - training script includes the espeak flag and other hyperparams - made style * chore: fix linting * feat: add Tacotron 2 support * leftover from dev * chore:rename parser args * feat: extract optimizers - created a separate optimizer class to merge the two optimizers * chore: revert arbitrary trainer changes * fmt: revert formatting bug * formatting again * formatting fixed * fix: log func * fix: update optimizer - Implemented load_state_dict for continuing training * fix: clean optimizer init for standard models * improvement: purge espeak flags and add training scripts * Delete capacitronT2.py delete old training script, new one is pushed * feat: capacitron trainer methods - extracted capacitron specific training operations from the trainer into custom methods in taco1 and taco2 models * chore: renaming and merging capacitron and gst style args * fix: bug fixes from the previous commit * fix: implement state_dict method on CapacitronOptimizer * fix: call method * fix: inference naming * Delete train_capacitron.py * fix: synthesize * feat: update tests * chore: fix style * Delete capacitron_inference.py * fix: fix train tts t2 capacitron tests * fix: double forward in T2 train step * fix: double forward in T1 train step * fix: run make style * fix: remove unused import * fix: test for T1 capacitron * fix: make lint * feat: add blizzard2013 recipes * make style * fix: update recipes * chore: make style * Plot test sentences in Tacotron * chore: make style and fix import * fix: call forward first before problematic floordiv op * fix: update recipes * feat: add min_audio_len to recipes * aux_input["style_mel"] * chore: make style * Make capacitron T2 recipe more stable * Remove T1 capacitron Ljspeech * feat: implement new grad clipping routine and update configs * make style * Add pretrained checkpoints * Add default vocoder * Change trainer package * Fix grad clip issue for tacotron * Fix scheduler issue with tacotron Co-authored-by: Eren Gölge <egolge@coqui.ai> Co-authored-by: WeberJulian <julian.weber@hotmail.fr> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2022-05-20 16:17:11 +02:00
WeberJulian	4953636b14	Add African models (#1511 ) * Add african models * Set default license for all models	2022-04-19 14:18:30 +02:00
WeberJulian	1b22f03e98	Fix G2P backend of the released models (#1461 ) * Fix enforce phonemizer * Add new models * Fix .model.json	2022-03-30 12:47:11 +02:00
Eren Gölge	dc280819be	Add new models	2022-03-07 12:08:09 +01:00
Eren Gölge	dd4287de1f	Update models	2022-03-03 20:23:00 +01:00
Eren Gölge	6cb00be795	Update your_tts model URL	2022-03-02 18:04:49 +01:00
Eren Gölge	7ef458a59c	Updake default vocoder for uk model	2022-01-01 16:09:42 +00:00
Eren Gölge	c5512af82b	Update uk vocoder url	2022-01-01 15:38:21 +00:00
Eren Gölge	d37cfe474a	Merge branch 'pr/Edresson/731-rebased' into dev	2022-01-01 15:37:35 +00:00
Eren Gölge	33711afa01	Update yourTTS url	2022-01-01 15:37:08 +00:00
Eren Gölge	8100135a7e	Add the YourTTS entry to the models	2021-12-31 12:22:08 +00:00
Eren Gölge	8d2bb284ac	Add UK vocoder models	2021-12-21 13:13:35 +00:00
Eren Gölge	5ba47081ee	Use GL for VCTK FastPitch models	2021-11-01 16:39:03 +01:00
Eren Gölge	3ea1c2037b	Fix model entry in .models.json	2021-10-26 19:14:29 +02:00
Eren Gölge	7c10574931	Gateway for TTS models	2021-10-26 13:04:51 +02:00
Eren Gölge	027424dda8	Add VCTK fast_pitch and UK glow-tts	2021-10-25 19:29:16 +02:00
Eren Gölge	91bebebe18	Add new models to `.models.json` SpeedySpeech model using `ForwardTTS` UnivNet model fine-tuned on TacotronDDC_ph spectrograms	2021-09-13 08:22:14 +00:00
Eren Gölge	26f76fce22	Remove SpeedySpeech from .models.json	2021-09-10 17:47:27 +00:00
Eren Gölge	4cc544bc46	Add FastPitch model to `.models.json`	2021-09-06 16:59:22 +00:00
Katsuya Iida	165e5814af	Update Japanese phonemizer (#758 ) * Update default ja vocoder * update * Japanese phonemizer test * Run make style Co-authored-by: Eren Gölge <egolge@coqui.ai>	2021-09-01 09:33:15 +02:00
Eren Gölge	09ed8426e8	Add the models released with v0.2.0	2021-08-10 15:46:31 +00:00
Eren Gölge	7eb94f760b	Remove Ruslan model	2021-08-09 21:48:36 +00:00
Eren Gölge	6af03ac476	Fix `num_char` init in Tacotron models	2021-08-09 21:46:15 +00:00
Eren Gölge	febd6105b5	Update default vocoder for de-thorsten	2021-07-26 16:08:52 +02:00
Eren Gölge	4b7b88dd3d	Add fullband-melgan DE vocoder	2021-07-26 15:38:30 +02:00
Eren Gölge	270c3823eb	Fix #608	2021-07-04 11:19:31 +02:00
Eren Gölge	db47f4f105	Update `.models.json`	2021-07-02 10:43:00 +02:00
Eren Gölge	e66753bd0d	fixup! new japanese model placeholder in `.models.json`	2021-06-03 18:04:28 +02:00
Eren Gölge	bd434636a9	new japanese model placeholder in `.models.json`	2021-06-02 15:54:37 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	f02f0338c2	fix .models.json and add testing to check released models availability	2021-04-29 09:32:36 +02:00
Eren Gölge	fd95e9b8a4	[ci skip] Add sam models	2021-04-28 21:57:31 +02:00
Eren Gölge	6bdd81667e	place holders for sc-glow and hifigan models	2021-04-26 19:53:12 +02:00
Eren Gölge	a53958ae3a	fix urls for the new models	2021-04-15 17:05:00 +02:00
Eren Gölge	1ad838bc83	add newly released models under .model.json	2021-04-15 16:06:10 +02:00
Eren Gölge	28a2fed8a3	update hifigan in .model.json	2021-04-12 16:48:05 +02:00
Eren Gölge	abaf36861a	aligntts model .model.json placeholder	2021-04-12 16:43:52 +02:00

1 2

69 Commits