coqui-tts

Commit Graph

Author	SHA1	Message	Date
WeberJulian	6f01eed672	Add test for language_weighted_sampler	2021-12-20 11:54:10 +00:00
Edresson	a57ddfb4ec	Add remove silence vad script Unit test	2021-12-20 11:54:10 +00:00
Edresson	e068fab6b2	Add find unique phonemes unit tests	2021-12-20 11:54:10 +00:00
WeberJulian	54e33bff61	Make a multilingual test use chars	2021-12-20 11:54:10 +00:00
WeberJulian	09eda31a3f	Fix tests	2021-12-20 11:54:10 +00:00
Edresson	06d89f93a8	Add VITS multilingual d-vectors unit test	2021-12-20 11:54:10 +00:00
Edresson	f394d60695	Fix the bug in multispeaker vits	2021-12-20 11:54:10 +00:00
WeberJulian	1472b6df49	make style	2021-12-20 11:54:10 +00:00
WeberJulian	3b5592abcf	fix test vits	2021-12-20 11:54:10 +00:00
Edresson	bbdb5c38e6	Add VITS multispeaker train unit test	2021-12-20 11:54:09 +00:00
Edresson	92f7f4f400	Active the multispeaker mode in multilingual training	2021-12-20 11:54:09 +00:00
Edresson	e68b042493	Add VITS d-vector unit test	2021-12-20 11:54:09 +00:00
Edresson	959cc8f03c	Add VITS multilingual unit test	2021-12-20 11:54:09 +00:00
Edresson	3fbbebd74d	Fix pylint issues	2021-12-20 11:54:09 +00:00
Michael Hansen	3bc043faeb	Upgrade to gruut 2.0 (#882 )	2021-10-31 11:41:55 +01:00
Eren Gölge	2df0752e73	Model zoo tests (#900 ) * Fix VITS model multi-speaker init * Remove gdrive support in model manager * Add model zoo tests	2021-10-29 17:54:16 +02:00
Eren Gölge	25759d6a61	Split tests	2021-10-21 17:30:15 +00:00
Eren Gölge	e62d3c5cf7	Use absolute imports for tts configs and models	2021-10-21 16:29:06 +00:00
Eren Gölge	4dbe7ed0de	Fix all-zero duration case for GlowTTS	2021-10-01 09:24:26 +00:00
Eren Gölge	7edbe04fe0	Fix WaveRNN config and test	2021-09-30 16:20:12 +00:00
Eren Gölge	4cacbf0d45	Fix WaveRNN test	2021-09-30 14:47:56 +00:00
Eren Gölge	2766dd1d6e	Fix #813 - GlowTTS training (#814 ) * Fix #813 * Update glow_tts recipe * Fix glow-tts test * Linter fix * Run data dep init only in training	2021-09-17 20:06:55 +02:00
Eren Gölge	1e7db32e90	Test FastPitch train	2021-09-11 10:19:47 +00:00
Eren Gölge	26f76fce22	Remove SpeedySpeech from .models.json	2021-09-10 17:47:27 +00:00
Eren Gölge	7ec23e69d4	Skip TF tests on GPU	2021-09-10 17:28:58 +00:00
Eren Gölge	1ebf9ec6bf	Remove speedy_speech implementation	2021-09-10 17:28:20 +00:00
Eren Gölge	7d8f77385a	Use `glow-tts` in synthesis tests	2021-09-10 17:27:33 +00:00
Eren Gölge	d6e29ef98a	Style update	2021-09-10 08:30:33 +00:00
Eren Gölge	3abc3a1d32	Fix GPU init in tests	2021-09-10 08:28:10 +00:00
Eren Gölge	ed4b1d8514	Test `TTS.tts.utils.helpers`	2021-09-10 08:25:21 +00:00
Eren Gölge	8b7e094bde	Implement `forward_tts` - Generic API for feed-forward TTS models (FastPitch, SpeedySpeech) - Tests for `forward-tts` - Edit FastPitchConfig and SpeedySpeechConfig to use `forward_tts`	2021-09-10 08:24:33 +00:00
Eren Gölge	4761853c5c	Fix imports	2021-09-08 13:34:40 +00:00
Eren Gölge	e72c265cd4	Fix linter issues	2021-09-06 15:16:58 +00:00
Eren Gölge	fd287aa438	Update loader tests for dict return	2021-09-06 15:16:58 +00:00
Eren Gölge	2c4bbbf9b9	Use pyworld for pitch	2021-09-06 15:16:58 +00:00
Eren Gölge	076d0cb258	Add tests for certain FastPitch functions	2021-09-06 15:16:58 +00:00
Eren Gölge	d63a6bb690	Set BaseDatasetConfig for tests	2021-09-06 15:16:58 +00:00
Eren Gölge	fba257104d	Compute F0 using librosa	2021-09-06 15:16:58 +00:00
Katsuya Iida	165e5814af	Update Japanese phonemizer (#758 ) * Update default ja vocoder * update * Japanese phonemizer test * Run make style Co-authored-by: Eren Gölge <egolge@coqui.ai>	2021-09-01 09:33:15 +02:00
Eren Gölge	f186856e5d	Add option to sort input sequnce by audio len	2021-08-30 08:10:35 +00:00
Eren Gölge	c312acac7d	Implement VITS model 🚀 VITS model implementation built on Glow TTS and HiFiGAN layers.	2021-08-09 18:02:36 +00:00
Eren Gölge	003e5579e8	Enable `custom_symbols` in text processing Models can define their own custom symbols lists with custom `make_symbols()`	2021-08-09 18:02:36 +00:00
Eren Gölge	e4648ffef1	Fix multi-speaker init of Tacotron models & tests	2021-08-09 18:02:36 +00:00
Agrin Hilmkil	ced4cfdbbf	Allow saving / loading checkpoints from cloud paths (#683 ) * Allow saving / loading checkpoints from cloud paths Allows saving and loading checkpoints directly from cloud paths like Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec. Note: The user will have to install the relevant dependency for each protocol. Otherwise fsspec will fail and specify which dependency is missing. * Append suffix _fsspec to save/load function names * Add a lower bound to the fsspec dependency Skips the 0 major version. * Add missing changes from refactor * Use fsspec for remaining artifacts * Add test case with path requiring fsspec * Avoid writing logs to file unless output_path is local * Document the possibility of using paths supported by fsspec * Fix style and lint * Add missing lint fixes * Add type annotations to new functions * Use Coqpit method for converting config to dict * Fix type annotation in semi-new function * Add return type for load_fsspec * Fix bug where fs not always created * Restore the experiment removal functionality	2021-08-09 18:02:36 +00:00
Eren Gölge	75b201c6c1	Merge pull request #673 from coqui-ai/fix_stopnet Fix stopnet training for Tacotron models	2021-07-24 12:25:38 +02:00
Eren Gölge	fc0c4600bd	Fix stopnet training	2021-07-24 11:39:54 +02:00
Eren Gölge	30eed347b6	Merge pull request #581 from Edresson/dev Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.	2021-07-23 17:22:51 +02:00
WeberJulian	c79a82ed07	refix linter	2021-07-13 23:12:18 +02:00
Edresson	2e5baffa9c	Merge fix and eval split as argparse	2021-07-13 01:47:32 -03:00
Edresson	4eac1c4651	bug fix on train_encoder and unit tests	2021-07-11 12:00:39 -03:00
Eren Gölge	1e9538aaef	Add more model tests to `test_synthesize`	2021-07-04 11:45:49 +02:00
Eren Gölge	47b3b10d6d	Bump up to v0.1.0 🚀	2021-06-29 13:07:59 +02:00
Eren Gölge	9790eddada	Fix wrong argument name 🛠️	2021-06-28 17:03:47 +02:00
Eren Gölge	626c9d41e6	Update tests for the new trainer API	2021-06-28 17:03:19 +02:00
Eren Gölge	fcfd95669a	Update model test configs	2021-06-28 17:03:19 +02:00
Eren Gölge	00c82c516d	rename to	2021-06-28 17:03:19 +02:00
Eren Gölge	fdfb18d230	downsize melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	25238e0658	fix glow-tts `inference()`	2021-06-28 17:03:19 +02:00
Eren Gölge	82582993cc	use one testing sentence in tts tests	2021-06-28 17:03:19 +02:00
Eren Gölge	419735f440	refactor and fix multi-speaker training in Trainer and Tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	269e5a734e	add max_decoder_steps argument to tacotron models	2021-06-28 17:03:19 +02:00
Eren Gölge	304d60197b	reduce multiband melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	db6a97d1a2	rename external speaker embedding arguments as `d_vectors`	2021-06-28 17:03:19 +02:00
Eren Gölge	877bf66b61	reduce size of the metadata.csv used at testing	2021-06-28 17:03:19 +02:00
Eren Gölge	87c61d210a	update test to be less demanding	2021-06-28 17:03:19 +02:00
Eren Gölge	6d6896fd99	reduce fullband-melgan test model size	2021-06-28 17:03:19 +02:00
Eren Gölge	1443d03af1	update test for the new input output API of the tts models	2021-06-28 17:03:19 +02:00
Eren Gölge	ef4ea9e527	update imports for `formatters`	2021-06-28 17:03:19 +02:00
Eren Gölge	6c495c6a6e	fix glow-tts inference and forward functions for handling `cond_input` and refactor its test	2021-06-28 17:03:19 +02:00
Eren Gölge	b500338faa	make style	2021-06-28 17:03:19 +02:00
Eren Gölge	d25f017b42	update `setup_model.py` imports	2021-06-28 17:03:19 +02:00
Eren Gölge	7dff6be871	update tts training tests to use the trainer	2021-06-28 17:03:19 +02:00
Eren Gölge	9134c7dfb6	update `sequence_mask` import globally	2021-06-28 17:03:19 +02:00
Eren Gölge	8def3c87af	trainer-API updates	2021-06-28 17:03:19 +02:00
Eren Gölge	42554cc711	rename MyDataset -> TTSDataset	2021-06-28 17:03:19 +02:00
Edresson Casanova	eb84bb2bc8	Merge branch 'dev' into dev	2021-06-26 15:32:19 -03:00
Eren Gölge	6c7bbcaef0	Use `en-us` for testing phoneme models	2021-06-25 16:52:17 +02:00
Michael Hansen	a41f53fe72	Fix silly error in tests	2021-06-25 14:41:35 +02:00
Michael Hansen	3f172b84d8	Fix linting issues	2021-06-25 14:41:31 +02:00
Michael Hansen	4d8426fa0a	Use eSpeak IPA lexicons by default for phoneme models	2021-06-25 14:41:05 +02:00
Michael Hansen	47191f3ecc	Add tests for gruut phonemization	2021-06-25 14:41:05 +02:00
Edresson	28bec238ca	fix Lint checks	2021-06-18 14:33:50 -03:00
Edresson	83644056e3	fix Lint checks	2021-06-18 14:32:28 -03:00
Eren Gölge	db48c69f0f	reduce fullband melgan model size for testing	2021-06-02 11:44:53 +02:00
Eren Gölge	49c5e5d820	maket style japanese PR	2021-06-02 11:44:46 +02:00
Eren Gölge	0c14278c30	reorg test files	2021-06-02 11:40:26 +02:00
Eren Gölge	73b4083c6c	Merge pull request #502 from kaiidams/kaiidams/kokoro Japanese Tacotron 2 model	2021-06-02 10:20:08 +02:00
Katsuya Iida	1cc18d1972	Move unittest of Japanese phonemizer.	2021-06-01 18:51:34 +09:00
Eren Gölge	bec85ac58d	make style	2021-05-31 16:37:15 +02:00
Eren Gölge	301c516abd	Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev	2021-05-31 15:46:25 +02:00
Edresson	cc192b6843	add resnet speaker encoder train unit test	2021-05-29 22:43:41 -03:00
Eren Gölge	925c08cf95	replace unidecode with anyascii	2021-05-27 14:02:44 +02:00
Edresson	bc5307caa0	add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix	2021-05-26 20:35:58 -03:00
Edresson	7a9a27282a	fix unit tests	2021-05-26 18:14:06 -03:00
Eren Gölge	f5a9950a2e	phoneme files for testing	2021-05-19 03:16:57 +02:00
Eren Gölge	9b706c5583	enable phonemes in test to match the attention masks	2021-05-19 03:15:54 +02:00
Eren Gölge	8a7c40736c	set use_phonemes false	2021-05-19 01:27:26 +02:00
Eren Gölge	faedea4b60	set use_phonemes False in configs	2021-05-19 00:08:07 +02:00
Eren Gölge	ccfaa6b1d5	add `needs_phonemizer` field to models.json. If set true these models are only compatible with v0.0.13 or below.	2021-05-18 17:57:28 +02:00
Eren Gölge	a14fcf2a13	remove text_processing test	2021-05-18 17:57:28 +02:00

1 2 3 4 5 ...

420 Commits