p0p4k
903a77c197
Update wavenet.py ( #1796 )
...
* Update wavenet.py
Current version does not use "in_channels" argument.
In glowTTS, we use normalizing flows and so "input dim" == "ouput dim" (channels and length). So, the existing code just uses hidden_channel sized tensor as input to first layer as well as outputs hidden_channel sized tensor.
However, since it is a generic implementation, I believe it is better to update it for a more general use.
* "in_channels -> hidden_channels"
2022-08-01 12:20:37 +02:00
p0p4k
10195c4eba
Update decoder.py ( #1792 )
...
Minor comment correction.
2022-07-26 13:06:06 +02:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
Edresson
12968532fe
Add the language embedding dim in the duration predictor class
2021-12-20 11:54:10 +00:00
Edresson
dcb2374bc9
Add multilingual training support to the VITS model
2021-12-20 11:54:09 +00:00
Eren Gölge
fd95926009
Update GlowTTS
2021-09-30 14:47:56 +00:00
Eren Gölge
2766dd1d6e
Fix #813 - GlowTTS training ( #814 )
...
* Fix #813
* Update glow_tts recipe
* Fix glow-tts test
* Linter fix
* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge
bfc6ceac29
Move MAS to `TTS.tts.utils.helpers`
2021-09-09 10:57:19 +00:00
Eren Gölge
4761853c5c
Fix imports
2021-09-08 13:34:40 +00:00
Eren Gölge
e429afbce4
Enable aligner for FastPitch
2021-09-06 15:16:58 +00:00
Eren Gölge
b81560607b
Update docstrings
2021-09-06 15:16:58 +00:00
Eren Gölge
c312acac7d
Implement VITS model 🚀
...
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
2e1a428b83
Update glowtts docstrings and docs
2021-06-30 14:30:55 +02:00
Eren Gölge
d42d1c02ea
Use `torch.linalg.qr` for pytorch > `v1.9.0`
2021-06-28 17:03:47 +02:00
Eren Gölge
9134c7dfb6
update `sequence_mask` import globally
2021-06-28 17:03:19 +02:00
Eren Gölge
f519012dea
reformatting and styling
2021-04-12 11:47:39 +02:00
Eren Gölge
2b3e12ea49
correct imports after refactoring, add AlignTTS (old SSMAS) and some formatting
2021-03-30 14:39:16 +02:00
Eren Gölge
9a48ba3821
a ton of linter updates
2021-03-08 05:06:54 +01:00
Eren Gölge
b464cab9b8
setup.py update and pylint fixes
2021-01-26 02:57:50 +01:00
Eren Gölge
660d61aeeb
maximum_path_numpy and CYTHON adabtable import
2021-01-26 02:57:07 +01:00
root
5c87753e88
glow-tts fix for saving inverse weight
2021-01-20 02:09:42 +00:00
erogol
79c841ccd3
mass refactoring and update
2021-01-11 17:26:58 +01:00
erogol
1d961d6f8a
cladd renaming
2021-01-11 17:26:11 +01:00
erogol
b206162d11
more docstrings
2021-01-11 17:25:04 +01:00
erogol
de2a542f83
glow-tts bug fix
2021-01-07 13:40:32 +01:00
erogol
5a45af48f1
fix
2021-01-06 13:19:40 +01:00
erogol
e7fad928e7
doc strings for the all glow-tts layers
2021-01-06 13:19:40 +01:00
erogol
d3b7284be4
glow-tts comments and refactoring
2021-01-06 13:19:40 +01:00
erogol
e82d31b6ac
glow ttss refactoring
2021-01-06 13:19:40 +01:00
erogol
29f4329d7f
update glow-tts layers and add some comments
2021-01-06 13:19:40 +01:00
erogol
4ef083f0f1
select decoder type for SS
2021-01-06 13:19:40 +01:00
erogol
fede46e96e
pylint and test fixes
2021-01-06 13:19:40 +01:00
erogol
d62cac7252
fix glow-tts prenet bug fix
2021-01-06 13:19:40 +01:00
erogol
fa6907fa0e
update glow-tts parameters and fix rel-attn-win size
2021-01-06 13:19:40 +01:00
erogol
7b20d8cbd3
implement residual BN convolution and add it as an alternative encoder for glow-tts. also generic layers to layers/generic
2021-01-06 13:19:40 +01:00
erogol
665f7ca714
linter fix
2020-09-24 12:57:54 +02:00
erogol
10258724d1
linter fixes
2020-09-22 03:54:16 +02:00
erogol
e0b9fa887f
glow-tts modules added
2020-09-21 14:15:40 +02:00
erogol
e4c6386603
change import for normalization layer
2020-09-21 13:09:52 +02:00
erogol
c008003506
do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder
2020-09-18 12:52:19 +02:00
erogol
3660c57f1e
time seperable convolution encoder, huber loss for duration predictor
2020-09-17 03:10:58 +02:00
erogol
45fbc0d003
convolution encoder with GLU and res connections
2020-09-12 03:40:21 +02:00
erogol
15e6ab3912
glow-tts module renaming updates
2020-09-12 03:33:36 +02:00