coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	fba257104d	Compute F0 using librosa	2021-09-06 15:16:58 +00:00
Katsuya Iida	165e5814af	Update Japanese phonemizer (#758 ) * Update default ja vocoder * update * Japanese phonemizer test * Run make style Co-authored-by: Eren Gölge <egolge@coqui.ai>	2021-09-01 09:33:15 +02:00
Eren Gölge	f186856e5d	Add option to sort input sequnce by audio len	2021-08-30 08:10:35 +00:00
Eren Gölge	c312acac7d	Implement VITS model 🚀 VITS model implementation built on Glow TTS and HiFiGAN layers.	2021-08-09 18:02:36 +00:00
Eren Gölge	003e5579e8	Enable `custom_symbols` in text processing Models can define their own custom symbols lists with custom `make_symbols()`	2021-08-09 18:02:36 +00:00
Eren Gölge	e4648ffef1	Fix multi-speaker init of Tacotron models & tests	2021-08-09 18:02:36 +00:00
Agrin Hilmkil	ced4cfdbbf	Allow saving / loading checkpoints from cloud paths (#683 ) * Allow saving / loading checkpoints from cloud paths Allows saving and loading checkpoints directly from cloud paths like Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec. Note: The user will have to install the relevant dependency for each protocol. Otherwise fsspec will fail and specify which dependency is missing. * Append suffix _fsspec to save/load function names * Add a lower bound to the fsspec dependency Skips the 0 major version. * Add missing changes from refactor * Use fsspec for remaining artifacts * Add test case with path requiring fsspec * Avoid writing logs to file unless output_path is local * Document the possibility of using paths supported by fsspec * Fix style and lint * Add missing lint fixes * Add type annotations to new functions * Use Coqpit method for converting config to dict * Fix type annotation in semi-new function * Add return type for load_fsspec * Fix bug where fs not always created * Restore the experiment removal functionality	2021-08-09 18:02:36 +00:00
Eren Gölge	75b201c6c1	Merge pull request #673 from coqui-ai/fix_stopnet Fix stopnet training for Tacotron models	2021-07-24 12:25:38 +02:00
Eren Gölge	fc0c4600bd	Fix stopnet training	2021-07-24 11:39:54 +02:00
Eren Gölge	30eed347b6	Merge pull request #581 from Edresson/dev Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.	2021-07-23 17:22:51 +02:00
WeberJulian	c79a82ed07	refix linter	2021-07-13 23:12:18 +02:00
Edresson	2e5baffa9c	Merge fix and eval split as argparse	2021-07-13 01:47:32 -03:00
Edresson	4eac1c4651	bug fix on train_encoder and unit tests	2021-07-11 12:00:39 -03:00
Eren Gölge	1e9538aaef	Add more model tests to `test_synthesize`	2021-07-04 11:45:49 +02:00
Eren Gölge	47b3b10d6d	Bump up to v0.1.0 🚀	2021-06-29 13:07:59 +02:00
Eren Gölge	9790eddada	Fix wrong argument name 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	626c9d41e6	Update tests for the new trainer API	2021-06-28 17:03:19 +02:00
Eren Gölge	fcfd95669a	Update model test configs	2021-06-28 17:03:19 +02:00
Eren Gölge	00c82c516d	rename to	2021-06-28 17:03:19 +02:00
Eren Gölge	fdfb18d230	downsize melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	25238e0658	fix glow-tts `inference()`	2021-06-28 17:03:19 +02:00
Eren Gölge	82582993cc	use one testing sentence in tts tests	2021-06-28 17:03:19 +02:00
Eren Gölge	419735f440	refactor and fix multi-speaker training in Trainer and Tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	269e5a734e	add max_decoder_steps argument to tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	304d60197b	reduce multiband melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	db6a97d1a2	rename external speaker embedding arguments as `d_vectors`	2021-06-28 17:03:19 +02:00
Eren Gölge	877bf66b61	reduce size of the metadata.csv used at testing	2021-06-28 17:03:19 +02:00
Eren Gölge	87c61d210a	update test to be less demanding	2021-06-28 17:03:19 +02:00
Eren Gölge	6d6896fd99	reduce fullband-melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	1443d03af1	update test for the new input output API of the tts models	2021-06-28 17:03:19 +02:00
Eren Gölge	ef4ea9e527	update imports for `formatters`	2021-06-28 17:03:19 +02:00
Eren Gölge	6c495c6a6e	fix glow-tts inference and forward functions for handling `cond_input` and refactor its test	2021-06-28 17:03:19 +02:00
Eren Gölge	b500338faa	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	d25f017b42	update `setup_model.py` imports	2021-06-28 17:03:19 +02:00
Eren Gölge	7dff6be871	update tts training tests to use the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	9134c7dfb6	update `sequence_mask` import globally	2021-06-28 17:03:19 +02:00
Eren Gölge	8def3c87af	trainer-API updates	2021-06-28 17:03:19 +02:00
Eren Gölge	42554cc711	rename MyDataset -> TTSDataset	2021-06-28 17:03:19 +02:00
Edresson Casanova	eb84bb2bc8	Merge branch 'dev' into dev	2021-06-26 15:32:19 -03:00
Eren Gölge	6c7bbcaef0	Use `en-us` for testing phoneme models	2021-06-25 16:52:17 +02:00
Michael Hansen	a41f53fe72	Fix silly error in tests	2021-06-25 14:41:35 +02:00
Michael Hansen	3f172b84d8	Fix linting issues	2021-06-25 14:41:31 +02:00
Michael Hansen	4d8426fa0a	Use eSpeak IPA lexicons by default for phoneme models	2021-06-25 14:41:05 +02:00
Michael Hansen	47191f3ecc	Add tests for gruut phonemization	2021-06-25 14:41:05 +02:00
Edresson	28bec238ca	fix Lint checks	2021-06-18 14:33:50 -03:00
Edresson	83644056e3	fix Lint checks	2021-06-18 14:32:28 -03:00
Eren Gölge	db48c69f0f	reduce fullband melgan model size for testing	2021-06-02 11:44:53 +02:00
Eren Gölge	49c5e5d820	maket style japanese PR	2021-06-02 11:44:46 +02:00
Eren Gölge	0c14278c30	reorg test files	2021-06-02 11:40:26 +02:00
Eren Gölge	73b4083c6c	Merge pull request #502 from kaiidams/kaiidams/kokoro Japanese Tacotron 2 model	2021-06-02 10:20:08 +02:00
Katsuya Iida	1cc18d1972	Move unittest of Japanese phonemizer.	2021-06-01 18:51:34 +09:00
Eren Gölge	bec85ac58d	make style	2021-05-31 16:37:15 +02:00
Eren Gölge	301c516abd	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-05-31 15:46:25 +02:00
Edresson	cc192b6843	add resnet speaker encoder train unit test	2021-05-29 22:43:41 -03:00
Eren Gölge	925c08cf95	replace unidecode with anyascii	2021-05-27 14:02:44 +02:00
Edresson	bc5307caa0	add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix	2021-05-26 20:35:58 -03:00
Edresson	7a9a27282a	fix unit tests	2021-05-26 18:14:06 -03:00
Eren Gölge	f5a9950a2e	phoneme files for testing	2021-05-19 03:16:57 +02:00
Eren Gölge	9b706c5583	enable phonemes in test to match the attention masks	2021-05-19 03:15:54 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	faedea4b60	set use_phonemes False in configs	2021-05-19 00:08:07 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	a14fcf2a13	remove text_processing test	2021-05-18 17:57:28 +02:00
Eren Gölge	12722501bb	styling	2021-05-15 23:48:31 +02:00
Eren Gölge	7e02cff924	reduce pwgan test batch size	2021-05-12 16:10:08 +02:00
Eren Gölge	da49089a72	update melgan training test batch size	2021-05-12 10:12:11 +02:00
Eren Gölge	0213e1cbf4	update configs for tts models to match the field typed with the expected values	2021-05-12 00:57:38 +02:00
Eren Gölge	715b0a65a0	update main.yml for python x64 fix test	2021-05-12 00:57:29 +02:00
Eren Gölge	843d1b3d98	linter fixes	2021-05-11 11:30:00 +02:00
Eren Gölge	19fb1d743d	style update	2021-05-11 11:30:00 +02:00
Eren Gölge	5aee30443f	refactoring tests after Coqpit	2021-05-11 11:30:00 +02:00
Eren Gölge	87384c6008	get_device_id() for tests	2021-05-11 11:29:18 +02:00
Eren Gölge	21dd4d7960	fix load_config imports for Coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	9f2d2d2081	add speaker encoder train test	2021-05-11 11:29:18 +02:00
Eren Gölge	9ee70af9bb	code styling	2021-05-11 11:29:18 +02:00
Eren Gölge	70fc7a7e71	remove bash running tests	2021-05-11 11:29:18 +02:00
Eren Gölge	1be45eae38	add new tests for vocoder trainings using coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	045f1c3e76	add hifigan train test	2021-05-11 11:29:18 +02:00
Eren Gölge	e6f45b9eb7	update train_vocoder_gan.py for coqpit	2021-05-11 11:29:18 +02:00
Eren Gölge	bcebd69d09	remove bash tts training tests	2021-05-11 11:29:17 +02:00
Eren Gölge	51a7e06945	glow_tts_config.py and train test on python	2021-05-11 11:29:17 +02:00
Eren Gölge	c6df8de80a	remove output train folder at the end of the test	2021-05-11 11:29:17 +02:00
Eren Gölge	35341d5482	move bash script based tests to python with coqpit	2021-05-11 11:29:17 +02:00
Eren Gölge	a21c0b5585	config update 2 WIP	2021-05-11 11:28:35 +02:00
Eren Gölge	f7582107da	Merge pull request #453 from Edresson/dev Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.	2021-05-06 17:53:28 +02:00
Edresson	070227d2ab	change python3 to python in the extract tts espectrograms script to avoid incompatibility	2021-05-06 11:32:42 -03:00
Edresson	65860a954a	remove unused vars on test glow tts	2021-05-05 07:15:36 -03:00
Edresson	d78f27ea41	bugfix on tacotron unit test	2021-05-05 06:38:01 -03:00
Edresson	e3f56b613b	update unit test for extract tts spectrograms script	2021-05-04 20:45:07 -03:00
Eren Gölge	8cb27267a4	formatting	2021-05-03 14:26:35 +02:00
Edresson	3ecd556bbe	add unit test for extract tts spectrograms script	2021-05-01 13:41:56 -03:00
Edresson	bb82f4ae8b	add unit test for GlowTTS inference with MAS	2021-04-29 19:39:09 -03:00
Eren Gölge	f02f0338c2	fix .models.json and add testing to check released models availability	2021-04-29 09:32:36 +02:00
Eren Gölge	6353e87166	fix test	2021-04-27 15:04:20 +02:00
Eren Gölge	628abfe644	remove test	2021-04-27 14:35:39 +02:00
Eren Gölge	1235e54738	test for synthesize.py	2021-04-27 14:17:38 +02:00
Eren Gölge	19d9f58009	create dummy model on the fly	2021-04-27 13:27:24 +02:00
Eren Gölge	2f0716073e	enable multi-speaker CoquiTTS models for synthesize.py	2021-04-26 19:36:53 +02:00
Eren Gölge	f37b488876	Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager	2021-04-26 15:25:25 +02:00
Eren Gölge	b82daa5e86	style and linter fixes	2021-04-26 15:22:24 +02:00
Eren Gölge	4cf211348d	styling and linting	2021-04-23 18:04:37 +02:00
Eren Gölge	a878d8fb42	update tests	2021-04-23 18:04:37 +02:00
Eren Gölge	c80d21f311	load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager	2021-04-23 18:04:37 +02:00
Eren Gölge	32e6afc009	add `SpeakerManager` tests	2021-04-23 18:04:37 +02:00
Eren Gölge	757dfb9289	add `SpeakerManager` tests	2021-04-21 13:28:43 +02:00
Eren Gölge	a7f6045644	Merge branch 'reformat' into hifigan-reformat	2021-04-12 12:00:17 +02:00
Eren Gölge	f519012dea	reformatting and styling	2021-04-12 11:47:39 +02:00
Eren Gölge	3c0d1d066d	update permissions of all the shell scripts	2021-04-10 20:04:27 +02:00
Eren Gölge	5fd2333c91	move bash based tests to a separate folder and rename the calling script file. Fix align_tts training test.	2021-04-09 03:07:58 +02:00
Eren Gölge	87ee6ceb57	style update #3	2021-04-09 01:17:15 +02:00
Eren Gölge	e5b9607bc3	isort all imports	2021-04-09 00:45:20 +02:00
Eren Gölge	0e79fa86ad	format with black and pylint 2.7.3	2021-04-09 00:38:08 +02:00
Eren Gölge	5de7eb708b	fix the config	2021-04-09 00:36:49 +02:00
Eren Gölge	cd69da4868	linter fixes #2	2021-04-08 16:57:46 +02:00
Eren Gölge	4d3e1e9d9a	linter fix	2021-04-08 14:57:46 +02:00
Eren Gölge	53f54898bc	small fixes	2021-04-08 14:22:47 +02:00
Eren Gölge	a7fb49843b	update test config	2021-04-08 13:18:03 +02:00
Eren Gölge	ba80e82520	udpate gan_datasets tests	2021-04-08 11:52:35 +02:00
Eren Gölge	a669a492c6	test melgan feature loss	2021-04-07 12:36:56 +02:00
Eren Gölge	c8953f4da9	l1_spec_loss parameters in melgan test config	2021-04-07 12:36:36 +02:00
Eren Gölge	7a382a5c2b	stowed aligntts commit and small refactoring with feed_forward layers	2021-03-30 14:39:16 +02:00
Eren Gölge	2c364c0df8	test fixes	2021-03-30 14:39:16 +02:00
Eren Gölge	2b3e12ea49	correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting	2021-03-30 14:39:16 +02:00
Eren Gölge	ecb6b0d6ad	rename GlowTtts as GlowTTS	2021-03-30 14:39:16 +02:00
Eren Gölge	d9c405f0c3	create feedforward folder for SS layers	2021-03-30 14:39:16 +02:00
Eren Gölge	fd5c24c3b8	add missing config file	2021-03-18 13:33:23 +01:00
Eren Gölge	0514330869	fix mozilla/TTS#685	2021-03-18 13:33:23 +01:00
Eren Gölge	43672e206c	test config for tacotron model	2021-03-18 13:33:00 +01:00
Eren Gölge	65533f33e9	fix #374	2021-03-18 13:33:00 +01:00
Eren Gölge	d790d2fccb	linter fix	2021-03-18 13:33:00 +01:00
WeberJulian	f6cd8e0ecc	test case	2021-03-18 13:33:00 +01:00
Eren Gölge	4cc162a8b5	enable model manager test	2021-03-10 18:20:06 +01:00
Eren Gölge	f24d48c245	linter fix	2021-03-08 16:08:23 +01:00
Eren Gölge	94805236fb	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-03-08 15:21:06 +01:00
Eren Gölge	1e169a5ba5	bunch of short fixes	2021-03-08 14:51:27 +01:00
Eren Gölge	947e3d6a93	rename test	2021-03-08 14:50:54 +01:00
Eren Gölge	00bfa54e21	update server tests	2021-03-08 14:07:22 +01:00
Eren Gölge	dd4f46b60f	disable model manager test until moving models to github releases	2021-03-08 14:06:58 +01:00
Eren Gölge	9a48ba3821	a ton of linter updates	2021-03-08 05:06:54 +01:00
gerazov	2451a813a2	refactored keep_all_best	2021-03-08 02:57:11 +01:00
gerazov	62147994d4	added keep_best and keep_after to test configs.	2021-03-08 02:56:36 +01:00
Eren Gölge	ee71eb4eb7	linter fixes	2021-03-08 02:54:47 +01:00
Eren Gölge	55fc50b26d	update test_text_processing for espeak-ng	2021-03-08 02:54:47 +01:00
Eren Gölge	8ec28b1ac2	linterup date	2021-03-08 02:54:47 +01:00
Eren Gölge	e06c93fe81	model_manager tests	2021-03-08 02:54:47 +01:00
Eren Gölge	6cd642c2e1	add missing phonemes to test_config.json	2021-03-08 02:54:47 +01:00
Eren Gölge	a2e7403ae8	test_compute_statistics.sh	2021-03-08 02:54:47 +01:00
Eren Gölge	3c961370e7	linter fixes	2021-03-08 02:54:21 +01:00
Eren Gölge	83e50757ae	use single process dataloder in tests	2021-03-08 02:54:21 +01:00
Eren Gölge	d57560dcd3	reduce glowtts size for testing	2021-03-08 02:54:21 +01:00

1 2 3 4 5 ...

433 Commits