coqui-tts

Commit Graph

Author	SHA1	Message	Date
code-review-doctor	fa887ef5f9	Fix issue probably-meant-fstring found at https://codereview.doctor (#1532 )	2022-05-07 13:33:40 +02:00
Eren Gölge	a0a9279e4b	Fix GAN optimizer order commit `212d330929` Author: Edresson Casanova <edresson1@gmail.com> Date: Fri Apr 29 16:29:44 2022 -0300 Fix unit test commit `44456b0483` Author: Edresson Casanova <edresson1@gmail.com> Date: Fri Apr 29 07:28:39 2022 -0300 Fix style commit `d545beadb9` Author: Edresson Casanova <edresson1@gmail.com> Date: Thu Apr 28 17:08:04 2022 -0300 Change order of HIFI-GAN optimizers to be equal than the original repository commit `657c5442e5` Author: Edresson Casanova <edresson1@gmail.com> Date: Thu Apr 28 15:40:16 2022 -0300 Remove audio padding before mel spec extraction commit `76b274e690` Merge: `379ccd7b` `6233f4fc` Author: Edresson Casanova <edresson1@gmail.com> Date: Wed Apr 27 07:28:48 2022 -0300 Merge pull request #1541 from coqui-ai/comp_emb_fix Bug fix in compute embedding without eval partition commit `379ccd7ba6` Author: WeberJulian <julian.weber@hotmail.fr> Date: Wed Apr 27 10:42:26 2022 +0200 returns y_mask in VITS inference (#1540) * returns y_mask * make style	2022-05-07 13:29:11 +02:00
Edresson Casanova	60034674f9	Remove audio padding before mel spec extraction	2022-05-07 13:12:09 +02:00
WeberJulian	fbdf76b2fc	returns y_mask in VITS inference (#1540 ) * returns y_mask * make style	2022-05-03 13:49:24 +02:00
Edresson Casanova	6233f4fcd7	Bug fix in compute embedding without eval partition	2022-04-26 13:58:03 -03:00
Edresson Casanova	8d228ab22a	Trick to Upsampling to High sampling rates using VITS model (#1456 ) * Add upsample VITS support * Fix the bug in inference * Fix lint checks * Add RMS based norm in save_wav method * Style fix * Add the period for VITS multi-period discriminator in model_args * Bug fix in speaker encoder load in inference time * Add unit tests * Remove useless detach_z_vocoder parameter * Add docs for VITS upsampling * Fix the docs * Rename TTS_part_sample_rate to encoder_sample_rate * Add upsampling_init and upsampling_z methods * Add asserts for encoder_sample_rate part * Move upsampling tests to test_vits.py	2022-04-26 11:47:46 +02:00
Eren Gölge	c410bc58ef	Bump to v0.6.2	2022-04-20 11:46:26 +02:00
WeberJulian	30bea7d53c	Update manage.py (#1514 )	2022-04-19 14:27:32 +02:00
Eren Gölge	7133f8f47d	Print Model's license when downloading (#1512 ) * Print model license while downloading * Make style * Add a new license link * Make style	2022-04-19 14:18:49 +02:00
WeberJulian	4953636b14	Add African models (#1511 ) * Add african models * Set default license for all models	2022-04-19 14:18:30 +02:00
Edresson Casanova	060e0f9368	Add EmbeddingManager and BaseIDManager (#1374 )	2022-03-31 13:41:16 +02:00
WeberJulian	1b22f03e98	Fix G2P backend of the released models (#1461 ) * Fix enforce phonemizer * Add new models * Fix .model.json	2022-03-30 12:47:11 +02:00
WeberJulian	c66a6241fd	Enforce phonemizer definition for synthesis (#1441 ) * Enforce phonemizer definition for synthesis * Fix train_tts, tokenizer init can now edit config * Add small change to trigger CI pipeline * fix wrong output path for one tts_test * Fix style * Test config overides by args and tokenizer * Fix style	2022-03-25 23:15:33 +01:00
Edresson Casanova	37896e1743	Bug fix in freeze encoder (#1391 ) * Fix the bug in freeze encoder * Remove emb_l definition for non-multilingual training * Fix unit tests	2022-03-24 18:16:04 +01:00
Edresson Casanova	3435bc8fca	Fix style tests	2022-03-23 15:05:32 -03:00
Edresson Casanova	0ae1e0248c	Fix the bug for emptly audio files	2022-03-23 14:39:31 -03:00
Edresson Casanova	ea53d6feb3	Replace webrtcvad by silero-vad	2022-03-23 14:39:31 -03:00
Eren Gölge	3af01cfe3b	Update base model wrt 👟 (#1406 )	2022-03-23 17:24:20 +01:00
Eren Gölge	1c3623af33	Fix model manager (#1436 ) * Fix manager * Make style	2022-03-23 12:57:14 +01:00
Eren Gölge	72d85e53c9	Update model file extension (#1422 ) * Update model file ext to ```.pth``` * Update docs * Rename more * Find model files	2022-03-22 17:55:00 +01:00
Eren Gölge	fd56fabb21	Fix #1380 (#1409 )	2022-03-16 12:38:27 +01:00
Eren Gölge	0870a4faa2	Make style (#1405 )	2022-03-16 12:13:55 +01:00
WeberJulian	690c96ed28	Fix default phonemizer for ja and zh (#1399 )	2022-03-16 12:13:22 +01:00
Edresson Casanova	f81892483d	REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349 ) * Rename Speaker encoder module to encoder * Add a generic emotion dataset formatter * Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config * Add class map in emotion config * Add Base encoder config * Add evaluation encoder script * Fix the bug in plot_embeddings * Enable Weight decay for encoder training * Add argumnet to disable storage * Add Perfect Sampler and remove storage * Add evaluation during encoder training * Fix lint checks * Remove useless config parameter * Active evaluation in speaker encoder test and use multispeaker dataset for this test * Unit tests fixs * Remove useless tests for speedup the aux_tests * Use get_optimizer in Encoder * Add BaseEncoder Class * Fix the unitests * Add Perfect Batch Sampler unit test * Add compute encoder accuracy in a function	2022-03-11 14:43:40 +01:00
Edresson Casanova	36e9ea2f97	Open bible dataset formatter (#1365 ) * Add support for voice conversion inference * Cache d_vectors_by_speaker for fast inference using a bigger speakers.json * Rebase bug fix * Use the average d-vector for inference * Fix the bug in find unique chars script * Add OpenBible formatter Co-authored-by: Eren Gölge <erogol@hotmail.com>	2022-03-11 10:43:31 +01:00
Edresson Casanova	dbe9da7f15	Add Voice conversion inference support (#1337 ) * Add support for voice conversion inference * Cache d_vectors_by_speaker for fast inference using a bigger speakers.json * Rebase bug fix * Use the average d-vector for inference	2022-03-10 14:57:12 +01:00
Edresson Casanova	917f417ac4	Add alphas to control language and speaker balancer (#1216 ) * Add alphas to control language and speaker balancer * Add docs for speaker and language samplers * Change the Samplers weights to float for save memory * Change the test_samplers to unittest format * Add get_sampler method in BaseTTS * Fix rebase issues * Add language and speaker samplers support for DDP training * Rename distributed sampler wrapper * Remove the DistributedSamplerWrapper and use the one from Trainer * Bugfix after rebase * Move the samplers config to tts config	2022-03-10 14:56:09 +01:00
Edresson Casanova	f381e29b91	REBASED: Add support for the speaker encoder training using torch spectrograms (#1348 ) * Add support for the speaker encoder training using torch spectrograms * Remove useless function in speaker encoder dataset class	2022-03-10 14:54:51 +01:00
Eren Gölge	c670365507	Fix VCTK recipe and formatter	2022-03-08 14:20:34 +01:00
Eren Gölge	8feb41d361	Bump up to v0.6.1	2022-03-07 15:57:44 +01:00
Eren Gölge	ee02bc3823	Bump up to v0.6.0	2022-03-07 12:08:22 +01:00
Eren Gölge	dc280819be	Add new models	2022-03-07 12:08:09 +01:00
Eren Gölge	e9d9028b4d	Revert cleaner name	2022-03-06 12:57:06 +01:00
Eren Gölge	764c7fa4a4	Rename phoneme_cleaners	2022-03-06 12:09:54 +01:00
Eren Gölge	dd4287de1f	Update models	2022-03-03 20:23:00 +01:00
Eren Gölge	6cb00be795	Update your_tts model URL	2022-03-02 18:04:49 +01:00
Eren Gölge	1425a023fe	Make style and lint	2022-03-02 13:25:35 +01:00
Eren Gölge	c68885b3fd	Update Vits speaker encoder init	2022-03-02 13:20:23 +01:00
Eren Gölge	27b67b7945	Fix import	2022-03-02 09:15:20 +01:00
Eren Gölge	942df0fb05	Update vits dataset	2022-03-02 09:14:32 +01:00
Eren Gölge	6a9f8074f0	Fix TTSDataset	2022-03-01 07:57:48 +01:00
Eren Gölge	690de1ab06	Update Characters and add more tests	2022-02-25 11:32:44 +01:00
Eren Gölge	9063397892	Fix FastSpeech config	2022-02-25 11:31:56 +01:00
Eren Gölge	1e414b3a09	Make stlye	2022-02-25 11:31:56 +01:00
Eren Gölge	acc83cd3e6	Update Vits model API	2022-02-25 11:31:56 +01:00
Eren Gölge	fe656659be	Implement BaseTTS	2022-02-25 11:31:56 +01:00
Eren Gölge	bed4afd4ee	Implement BaseVocabulary	2022-02-25 11:31:56 +01:00
Eren Gölge	e0f9be76c0	Update test_run in wavernn and wavegrad	2022-02-25 11:31:56 +01:00
Eren Gölge	bf540f4323	Update imports for trainer	2022-02-25 11:31:56 +01:00
Eren Gölge	4c43eda414	Update BaseTrainerModel	2022-02-25 11:31:56 +01:00

1 2 3 4 5 ...

1556 Commits