Enno Hermann
d772724125
fix: update repository links, package names, metadata
2024-04-03 12:02:44 +02:00
Enno Hermann
7630abb43f
refactor(bin.find_unique_chars): use existing function
2024-03-30 22:22:40 +01:00
Enno Hermann
adbcba06da
refactor(dataset): get audio length with torchaudio
...
Removes a (GPL) dependency
2024-03-14 20:48:29 +01:00
Enno Hermann
e5c6da1c98
Merge pull request #20 from eginhard/return-complex
...
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann
e95f8950eb
fix: torch.stft will soon require return_complex=True
...
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)
https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann
89a061f1d1
docs(tts.models.vits): clarify use of discriminator/generator
...
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann
2e8f47a33d
Merge pull request #10 from eginhard/fix-pinyin
...
fix chinese pinyin phonemes
2024-03-09 16:23:28 +01:00
Enno Hermann
309f39a45f
fix(xtts_manager): name_to_id() should return dict
...
This is how the other embedding managers work
2024-03-08 14:47:00 +01:00
Enno Hermann
efdafd5a7f
style: run black
2024-03-07 11:46:51 +01:00
Enno Hermann
017c84d005
style: make style && make lint
2024-03-06 22:45:35 +01:00
Enno Hermann
e05243c4c8
refactor: read/write csv files with standard library
2024-03-06 16:18:09 +01:00
Enno Hermann
24298da5fc
Merge pull request #1 from eginhard/lint-overhaul
...
Lint overhaul (pylint to ruff)
2024-03-06 16:10:26 +01:00
wangjie
b184e9f0fe
fix chinese pinyin phonemes
2024-01-12 09:11:56 +08:00
Edresson Casanova
5dcc16d193
Bug fix in MP3 and FLAC compute length on TTSDataset ( #3092 )
...
* Bug Fix on XTTS load
* Bug fix in MP3 length on TTSDataset
* Update TTS/tts/datasets/dataset.py
Co-authored-by: Aarni Koskela <akx@iki.fi>
* Uses mutagen for all audio formats
* Add dataloader test wit hall supported audio formats
* Use mutagen.File
* Update
* Fix aux unit tests
* Bug fixe on unit tests
---------
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-12-27 13:23:43 -03:00
Aarni Koskela
d6ea806469
Run `make style`
2023-12-13 14:56:41 +02:00
Aarni Koskela
bd172dabbf
xtts/stream_generator: remove duplicate import + code
2023-12-13 14:56:41 +02:00
Aarni Koskela
32abb1a7c4
xtts/perceiver_encoder: Delete duplicate exists()
2023-12-13 14:56:41 +02:00
Aarni Koskela
aa549e9028
Fix trailing whitespace
2023-12-13 14:56:41 +02:00
Aarni Koskela
33b69c6c09
Add some noqa directives (for now)
2023-12-13 14:56:41 +02:00
Aarni Koskela
00f8f4892a
Ruff autofix unnecessary passes
2023-12-13 14:56:41 +02:00
Aarni Koskela
bc2cf296a3
Ruff autofix PLW3301
2023-12-13 14:56:41 +02:00
Aarni Koskela
64bb41f4fa
Ruff autofix C41
2023-12-13 14:56:41 +02:00
Aarni Koskela
449820ec7d
Ruff autofix E71*
2023-12-13 14:56:41 +02:00
Aarni Koskela
90991e89b4
Ruff autofix unused imports and import order
2023-12-13 14:56:41 +02:00
Eren Gölge
8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
...
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
Eren Gölge
934b87bbd1
Merge pull request #3391 from aaron-lii/multi-gpu
...
support multiple GPU training for XTTS
2023-12-12 13:51:26 +01:00
WeberJulian
5cd750ac7e
Fix API and CI
2023-12-11 20:21:53 +01:00
WeberJulian
e3c9dab7a3
Make CLI work
2023-12-11 18:49:18 +01:00
WeberJulian
a5c0d9780f
rename manager
2023-12-11 18:48:31 +01:00
WeberJulian
36143fee26
Add basic speaker manager
2023-12-11 15:25:46 +01:00
Frederico S. Oliveira
163f9a3fdf
Merge branch 'coqui-ai:dev' into dev
2023-12-11 10:04:07 -03:00
Aaron-Li
b6e929696a
support multiple GPU training
2023-12-08 16:55:32 +08:00
Eren Gölge
e49c512d99
Merge pull request #3351 from aaron-lii/chinese-puncs
...
fix pause problem of Chinese speech
2023-12-04 15:57:42 +01:00
Edresson Casanova
5f900f156a
Add XTTS Fine tuning gradio demo ( #3296 )
...
* Add XTTS FT demo data processing pipeline
* Add training and inference columns
* Uses tabs instead of columns
* Fix demo freezing issue
* Update demo
* Convert stereo to mono
* Bug fix on XTTS inference
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Add parameters to be able to set then on colab demo
* Add erros messages
* Add intuitive error messages
* Update
* Add max_audio_length parameter
* Add XTTS fine-tuner docs
* Update XTTS finetuner docs
* Delete trainer to freeze memory
* Delete unused variables
* Add gc.collect()
* Update xtts.md
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Aaron-Li
7b8808186a
fix pause problem of Chinese speech
2023-12-01 23:30:03 +08:00
Frederico S. Oliveira
bcd500fa7b
Fixing bug
...
Correction in training the Fastspeech/Fastspeech2/FastPitch/SpeedySpeech model using external speaker embedding.
2023-11-30 17:27:05 -03:00
Enno Hermann
39321d02be
fix: correctly strip/restore initial punctuation ( #3336 )
...
* refactor(punctuation): remove orphan code for handling lone punctuation
The case of lone punctuation is already handled at the top of restore(). The
removed if statement would never be called and would in fact raise an
AttributeError because the _punc_index named tuple doesn't have the attribute
`mark`.
* refactor(punctuation): remove unused argument
* fix(punctuation): correctly handle initial punctuation
Stripping and restoring initial punctuation didn't work correctly because the
string-splitting caused an additional empty string to be inserted in the text
list (because `".A".split(".")` => `["", "A"]`). Now, an initial empty string is
skipped and relevant test cases are added.
Fixes #3333
2023-11-30 13:03:16 +01:00
Eren G??lge
3b8894a3dd
Make style
2023-11-27 14:15:50 +01:00
Eren G??lge
11ec9f7471
Add hi in config defaults
2023-11-24 15:38:36 +01:00
Eren G??lge
32065139e7
Simple text cleaner for "hi"
2023-11-24 15:14:34 +01:00
Enno Hermann
2af0220996
fix: don't pass quotes to espeak ( #3286 )
...
Previously, the text was wrapped in an additional set of quotes that was passed
to Espeak. This could result in different phonemization in certain edges and
caused the insertion of an initial separator "_" that had to be removed.
Compare:
$ espeak-ng -q -b 1 -v en-us --ipa=1 '"A"'
_ˈɐ
$ espeak-ng -q -b 1 -v en-us --ipa=1 'A'
ˈeɪ
Fixes #2619
2023-11-24 12:25:37 +01:00
Edresson Casanova
11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training ( #3241 )
...
* Ensures that only GPT model is in training mode during training
* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren G??lge
44880f09ed
Make style
2023-11-17 13:43:34 +01:00
Eren G??lge
26efdf6ee7
Make k_diffusion optional
2023-11-17 13:42:33 +01:00
Julian Weber
fbc18b8c34
Fix zh bug ( #3238 )
2023-11-16 17:51:37 +01:00
Julian Weber
675f983550
Add sentence splitting ( #3227 )
...
* Add sentence spliting
* update requirements
* update default args v2
* Add spanish
* Fix return gpt_latents
* Update requirements
* Fix requirements
2023-11-16 11:01:11 +01:00
Edresson Casanova
73a5bd08c0
Fix XTTS GPT padding and inference issues ( #3216 )
...
* Fix end artifact for fine tuning models
* Bug fix on zh-cn inference
* Remove ununsed code
2023-11-15 14:02:05 +01:00
Julian Weber
04901fb2e4
Add speed control for inference ( #3214 )
...
* Add speed control for inference
* Fix XTTS tests
* Add speed control tests
2023-11-14 16:07:17 +01:00
Eren Gölge
ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
...
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
Eren G??lge
92fa988aec
Fixup
2023-11-13 13:44:06 +01:00