coqui-tts

Commit Graph

Author	SHA1	Message	Date
Eren Gölge	cbbc9e0172	Add FastSpeechConfig	2021-09-11 10:20:37 +00:00
Eren Gölge	d97952611d	Remove unused import	2021-09-10 17:31:41 +00:00
Eren Gölge	d5f256b34c	Update tacotron `r` init	2021-09-10 17:26:23 +00:00
Eren Gölge	ab37fa9c39	Edit AlignTTS	2021-09-10 17:25:00 +00:00
Eren Gölge	66732025e1	Add `base_model` field to `forward_tts` configs	2021-09-10 17:23:48 +00:00
Eren Gölge	a89eb12aca	Fix glow_tts imports	2021-09-10 08:29:51 +00:00
Eren Gölge	0541a25e90	Remove `fastpitch.py` and `speedy_speech.py`	2021-09-10 08:27:48 +00:00
Eren Gölge	3c16013199	Fix Vits imports	2021-09-10 08:26:34 +00:00
Eren Gölge	8b7e094bde	Implement `forward_tts` - Generic API for feed-forward TTS models (FastPitch, SpeedySpeech) - Tests for `forward-tts` - Edit FastPitchConfig and SpeedySpeechConfig to use `forward_tts`	2021-09-10 08:24:33 +00:00
Eren Gölge	bfc6ceac29	Move MAS to `TTS.tts.utils.helpers`	2021-09-09 10:57:19 +00:00
Eren Gölge	4761853c5c	Fix imports	2021-09-08 13:34:40 +00:00
Eren Gölge	c1513ec4cd	Plot pitch over spectrogram	2021-09-06 15:16:58 +00:00
Eren Gölge	d847a68e42	Reformat multi-speaker handling in GlowTTS	2021-09-06 15:16:58 +00:00
Eren Gölge	8d41060d36	Plot unnormalized pitch by `FastPitch`	2021-09-06 15:16:58 +00:00
Eren Gölge	2b59da802c	Fix loader setup in `base_tts`	2021-09-06 15:16:58 +00:00
Eren Gölge	2bf9e83c49	FastPitch refactor and commenting	2021-09-06 15:16:58 +00:00
Eren Gölge	648655fa03	Add `PitchExtractor` and return dict by `collate`	2021-09-06 15:16:58 +00:00
Eren Gölge	59d52a4cd8	Disable autcast for criterions	2021-09-06 15:16:58 +00:00
Eren Gölge	98a7271ce8	Refactor FastPitchv2	2021-09-06 15:16:58 +00:00
Eren Gölge	e429afbce4	Enable aligner for FastPitch	2021-09-06 15:16:58 +00:00
Eren Gölge	81c228a2d8	Update FastPitch don't detach duration network inputs	2021-09-06 15:16:58 +00:00
Eren Gölge	ca29033ef4	Refactor FastPitch model	2021-09-06 15:16:58 +00:00
Eren Gölge	5d59100a88	Don't use align_score for models with duration predictor	2021-09-06 15:16:58 +00:00
Eren Gölge	b7caad39e0	Make optional to detach duration predictor input	2021-09-06 15:16:58 +00:00
Eren Gölge	bc396c393f	Add FastPitch model and FastPitchconfig	2021-09-06 15:16:58 +00:00
Eren Gölge	e802b24ad0	Compute mean and std pitch	2021-09-06 15:16:58 +00:00
Eren Gölge	d085642ac1	Cache pitch features Cache the features at the beginning of `BaseTTS` training.	2021-09-06 15:16:58 +00:00
Eren Gölge	7590c7db7a	Fix `base_tacotron` `aux_input` handling	2021-09-06 15:16:58 +00:00
Eren Gölge	994f2be2c1	Add comput_f0 field	2021-09-06 15:16:58 +00:00
Eren Gölge	2b7e55f01f	Fix vits args types	2021-08-30 23:24:20 +00:00
Eren Gölge	18da8f5dbd	Update pylint 2.10.2 and fix lint issues	2021-08-30 08:10:35 +00:00
Eren Gölge	f186856e5d	Add option to sort input sequnce by audio len	2021-08-30 08:10:35 +00:00
Eren Gölge	2620f62ea8	Move duration_loss inside VitsGeneratorLoss	2021-08-27 07:07:07 +00:00
Eren Gölge	49e1181ea4	Fixes for the vits model	2021-08-26 17:15:09 +00:00
Eren Gölge	3ab8cef99e	Fix VITS model SPD	2021-08-18 14:55:46 +00:00
Eren Gölge	7c0d564965	Syncronize DDP processes	2021-08-13 10:40:50 +00:00
Eren Gölge	ecf5f17dca	Fix distribute.py and ddp training	2021-08-12 22:22:32 +00:00
Eren Gölge	c8b9ca3d71	Fix Tacotron num_char init	2021-08-10 08:56:34 +00:00
Eren Gölge	6af03ac476	Fix `num_char` init in Tacotron models	2021-08-09 21:46:15 +00:00
Eren Gölge	06018251e6	Add VITS and GlowTTS class docs 🗒️	2021-08-09 18:02:36 +00:00
Eren Gölge	f7a72552f1	Make duration predictor dropout configurable	2021-08-09 18:02:36 +00:00
Eren Gölge	c312acac7d	Implement VITS model 🚀 VITS model implementation built on Glow TTS and HiFiGAN layers.	2021-08-09 18:02:36 +00:00
Eren Gölge	232a5abb6a	Update `tts.setup_model` Run `model.make_symbols()` if availabe to set the symbol list	2021-08-09 18:02:36 +00:00
Eren Gölge	e4648ffef1	Fix multi-speaker init of Tacotron models & tests	2021-08-09 18:02:36 +00:00
Eren Gölge	01324c8e70	Update `base_tts.py` Enable calling `make_symbols()` from the model if defined. Compatibility changes for end2end `tts` models in batch formatting. Changes in multi-speaker initialization. Modify `test_run()` to work with dict output iof `synthesis`	2021-08-09 18:02:36 +00:00
Agrin Hilmkil	ced4cfdbbf	Allow saving / loading checkpoints from cloud paths (#683 ) * Allow saving / loading checkpoints from cloud paths Allows saving and loading checkpoints directly from cloud paths like Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec. Note: The user will have to install the relevant dependency for each protocol. Otherwise fsspec will fail and specify which dependency is missing. * Append suffix _fsspec to save/load function names * Add a lower bound to the fsspec dependency Skips the 0 major version. * Add missing changes from refactor * Use fsspec for remaining artifacts * Add test case with path requiring fsspec * Avoid writing logs to file unless output_path is local * Document the possibility of using paths supported by fsspec * Fix style and lint * Add missing lint fixes * Add type annotations to new functions * Use Coqpit method for converting config to dict * Fix type annotation in semi-new function * Add return type for load_fsspec * Fix bug where fs not always created * Restore the experiment removal functionality	2021-08-09 18:02:36 +00:00
Eren Gölge	d9e18e009b	Skip phoneme cache pre-compute if the path exists	2021-08-09 18:02:36 +00:00
Eren Gölge	fc0c4600bd	Fix stopnet training	2021-07-24 11:39:54 +02:00
WeberJulian	25832eb97b	Changes for review	2021-07-15 11:38:45 +02:00
WeberJulian	c79a82ed07	refix linter	2021-07-13 23:12:18 +02:00

1 2 3

143 Commits