Eren Gölge
cfaa51fddc
Update BaseTTS config
2022-02-25 11:11:35 +01:00
Eren Gölge
4cd690e4c1
Updates BaseTTS and configs
2022-02-25 10:57:35 +01:00
Eren Gölge
3eca5ad060
Update config fields for phonemizer
2022-02-25 10:48:03 +01:00
Edresson Casanova
28a7464975
Fix the bug in split dataset function ( #1251 )
...
* Fix the bug in split_dataset
* Make eval_split_size configurable
* Change test_loader to use load_tts_samples function
* Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval
* Fix samplers unit test
* Add data unit test on GitHub workflow
2022-02-21 11:59:36 +03:00
WeberJulian
9cfbacc622
Fix trailing space
2021-12-20 11:54:10 +00:00
WeberJulian
6b03943526
Move multilingual logic out of the trainer
2021-12-20 11:54:10 +00:00
Edresson
45d0b04179
Lint fixs
2021-12-20 11:54:10 +00:00
WeberJulian
da6c1e858c
Fix small issues
2021-12-20 11:54:10 +00:00
WeberJulian
3b5592abcf
fix test vits
2021-12-20 11:54:10 +00:00
Edresson
690b37d0ab
Add support to use the speaker encoder as loss function in VITS model
2021-12-20 11:54:09 +00:00
Edresson
3fbbebd74d
Fix pylint issues
2021-12-20 11:54:09 +00:00
Edresson
ac9416fb86
Add multilingual inference support
2021-12-20 11:54:09 +00:00
Eren Gölge
faafea4cf2
Fix style
2021-11-04 17:04:40 +01:00
Eren Gölge
d6d780e758
Fix FastSpeech config
2021-11-01 16:41:15 +01:00
Eren Gölge
00becf2671
Fix import statements
2021-10-25 19:29:16 +02:00
Eren Gölge
e62d3c5cf7
Use absolute imports for tts configs and models
2021-10-21 16:29:06 +00:00
Eren Gölge
82fed4add2
Make style
2021-10-21 16:05:51 +00:00
Eren Gölge
3ab009ca8d
Edit model configs for multi-speaker
2021-10-21 13:51:37 +00:00
Eren Gölge
a0a5d580e9
Approximate audio length from file size
2021-10-18 08:54:02 +00:00
Eren Gölge
073a2d2eb0
Refactor VITS multi-speaker initialization
2021-10-15 10:20:00 +00:00
Eren Gölge
2766dd1d6e
Fix #813 - GlowTTS training ( #814 )
...
* Fix #813
* Update glow_tts recipe
* Fix glow-tts test
* Linter fix
* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge
1ea011571a
Update SpeedySpeech config
2021-09-12 15:33:27 +00:00
Eren Gölge
cbbc9e0172
Add FastSpeechConfig
2021-09-11 10:20:37 +00:00
Eren Gölge
66732025e1
Add `base_model` field to `forward_tts` configs
2021-09-10 17:23:48 +00:00
Eren Gölge
8b7e094bde
Implement `forward_tts`
...
- Generic API for feed-forward TTS models (FastPitch, SpeedySpeech)
- Tests for `forward-tts`
- Edit FastPitchConfig and SpeedySpeechConfig to use `forward_tts`
2021-09-10 08:24:33 +00:00
Eren Gölge
91a70e80b2
Refactor TTSDataset
...
Return a dict by `collate`
Refactor batch handling in `collate`
A couple of bug fixes
2021-09-06 15:16:58 +00:00
Eren Gölge
debf772ec5
Implement binary alignment loss
2021-09-06 15:16:58 +00:00
Eren Gölge
6e9d4062f2
Add `sort_by_audio_len` option
2021-09-06 15:16:58 +00:00
Eren Gölge
e429afbce4
Enable aligner for FastPitch
2021-09-06 15:16:58 +00:00
Eren Gölge
81c228a2d8
Update FastPitch don't detach duration network inputs
2021-09-06 15:16:58 +00:00
Eren Gölge
57b3aec1b9
Update docstring format
2021-09-06 15:16:58 +00:00
Eren Gölge
7692bfe7f8
Update FastPitch config
2021-09-06 15:16:58 +00:00
Eren Gölge
bc396c393f
Add FastPitch model and FastPitchconfig
2021-09-06 15:16:58 +00:00
Eren Gölge
f186856e5d
Add option to sort input sequnce by audio len
2021-08-30 08:10:35 +00:00
Eren Gölge
2620f62ea8
Move duration_loss inside VitsGeneratorLoss
2021-08-27 07:07:07 +00:00
Eren Gölge
49e1181ea4
Fixes for the vits model
2021-08-26 17:15:09 +00:00
Eren Gölge
3ab8cef99e
Fix VITS model SPD
2021-08-18 14:55:46 +00:00
Eren Gölge
6a7275881d
Add VitsConfig docstring
2021-08-09 18:02:36 +00:00
Eren Gölge
c312acac7d
Implement VITS model 🚀
...
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
bd4e29b4dd
Add `compute_linear_spec=False` to `BaseTTSConfig`
2021-08-09 18:02:36 +00:00
Eren Gölge
0fa6a8c9b8
Fix glow tts default parameters
2021-07-02 10:44:23 +02:00
Eren Gölge
2e1a428b83
Update glowtts docstrings and docs
2021-06-30 14:30:55 +02:00
Eren Gölge
786170fe7d
Update tts model configs
2021-06-28 17:03:19 +02:00
Eren Gölge
269e5a734e
add max_decoder_steps argument to tacotron models
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa
make style
2021-06-28 17:03:19 +02:00
Eren Gölge
fc9a0fb8ce
update aling_tts_config for the trainer
2021-06-28 17:03:19 +02:00
Eren Gölge
06ee57d816
update `speedy_speecy_config.py` for the trainer
2021-06-28 17:03:19 +02:00
Eren Gölge
b2218e882a
update `glow_tts_config.py` for setting the optimizer and the scheduler
2021-06-28 17:03:19 +02:00
Eren Gölge
535a458f40
update Tacotron models for the trainer
2021-06-28 17:03:19 +02:00
Eren Gölge
d09385808a
set test_sentences in config
2021-06-28 17:03:19 +02:00