Enno Hermann
49fcbd908b
fix(espeak_wrapper): avoid stuck process on windows
...
Fixes #24
2024-05-29 07:39:03 +02:00
Enno Hermann
203f60f1e1
refactor(espeak_wrapper): remove sync argument
...
_espeak_exe is always called with sync=True, so remove code for sync==False
2024-05-28 21:30:55 +02:00
Enno Hermann
7df4c2fa47
fix: restore TTS.__version__ attribute
...
This is used by the TTS/bin/collect_env_info.py script with which users print
version information for bug reports. We restore the TTS.__version__ attribute so
that old versions of the script still work.
2024-05-28 09:35:55 +02:00
Enno Hermann
df088e99df
Merge pull request #19 from idiap/toml
...
Move from setup.py to pyproject.toml, simplify requirements
2024-05-27 08:59:09 +01:00
Enno Hermann
642cbd472f
Merge pull request #26 from idiap/server-output
...
fix(server): ensure logging output gets actually shown
2024-05-26 09:08:27 +01:00
Enno Hermann
ab7d84bf05
refactor(server): address linter issues
2024-05-23 08:42:21 +02:00
Enno Hermann
8503500d9d
chore(server): remove duplicate code
2024-05-20 12:45:47 +02:00
Enno Hermann
70bd84894d
fix(server): ensure logging output gets actually shown
2024-05-20 12:45:41 +02:00
Enno Hermann
018f1e6453
docs(bark): update docstrings and type hints
2024-05-15 22:56:55 +02:00
Enno Hermann
59a6c9fdf2
fix(bark): add missing argument for load_voice()
...
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann
6d563af623
chore: remove obsolete code for torch<2
...
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann
865a48156d
fix: make korean g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
55ed162f2a
fix: make chinese g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ea893c3795
fix: make bangla g2p deps optional
2024-05-08 18:08:40 +02:00
Enno Hermann
ec50006855
style: run pre-commit
...
Automatic changes from: pre-commit run --all-files
2024-05-08 12:17:47 +02:00
Enno Hermann
fb92e13ebb
build: remove unused/obsolete code
2024-05-08 12:13:41 +02:00
Enno Hermann
259d8fc40b
build: store version in pyproject.toml
2024-05-07 18:27:55 +02:00
Enno Hermann
962f9bbbcf
refactor(espeak_wrapper): fix ruff lint suggestions
2024-05-01 13:31:39 +02:00
Enno Hermann
7b2289a454
fix(espeak_wrapper): capture stderr separately
...
Fixes https://github.com/coqui-ai/TTS/issues/2728
Previously, error messages from espeak were treated as normal output and also
converted to phonemes. This captures and logs them separately.
2024-05-01 12:31:49 +02:00
Enno Hermann
06304504d2
Merge pull request #11 from idiap/py312
...
build: add python 3.12 support
2024-04-23 13:52:00 +02:00
Enno Hermann
2675e743b0
chore: update version to 0.23.1
...
[ci skip]
2024-04-23 09:57:43 +02:00
Enno Hermann
52a52b5e21
fix(LanguageManager): allow initialisation from config with language ids file
...
Previously, running `LanguageManager.init_from_config(config)` would never use
the `language_ids_file` if that field is present because it was overwritten in
the next line with a new manager that manually parses languages from the
datasets in the config. Now that is only used as a fallback.
2024-04-19 11:57:27 +02:00
Enno Hermann
f7d69cc1d7
chore: update version to 0.23.0
2024-04-11 17:01:09 +02:00
Enno Hermann
b3c9685aee
fix(tokenizer): add debug logging
2024-04-11 16:58:12 +02:00
Enno Hermann
2ad790d169
Merge pull request #4 from idiap/hindi
...
feat(xtts): support Hindi for sentence-splitting and fine-tuning
2024-04-11 16:49:44 +02:00
Enno Hermann
dfbe0168e9
Merge pull request #3 from idiap/logging
...
Use Python logging instead of print()
2024-04-11 08:34:44 +02:00
Enno Hermann
d41686502e
feat(xtts): support hindi for sentence-splitting and fine-tuning
...
The XTTS model itself already supports Hindi, it was just in these components.
2024-04-08 15:57:56 +02:00
Enno Hermann
aa40fd277b
docs: update links
2024-04-04 18:21:57 +02:00
Enno Hermann
e689fd1d4a
fix(utils.manage): remove bare except, improve messages
2024-04-03 15:19:45 +02:00
Enno Hermann
7dc5d1eb3d
fix: logging in executables
2024-04-03 15:19:45 +02:00
Enno Hermann
ab64844aba
feat(utils.generic_utils): add custom formatter for logging to console
2024-04-03 15:19:45 +02:00
Enno Hermann
9b2d48f8a6
feat(utils.generic_utils): improve setup_logger() arguments and output
2024-04-03 15:19:45 +02:00
Enno Hermann
b711e19cb6
refactor: remove verbose arguments
...
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann
b6ab85a050
fix: use logging instead of print statements
...
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann
dd3768d4b1
chore: update version to v0.22.1
2024-04-03 12:31:39 +02:00
Enno Hermann
d772724125
fix: update repository links, package names, metadata
2024-04-03 12:02:44 +02:00
Enno Hermann
7630abb43f
refactor(bin.find_unique_chars): use existing function
2024-03-30 22:22:40 +01:00
Enno Hermann
adbcba06da
refactor(dataset): get audio length with torchaudio
...
Removes a (GPL) dependency
2024-03-14 20:48:29 +01:00
Enno Hermann
e5c6da1c98
Merge pull request #20 from eginhard/return-complex
...
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann
e95f8950eb
fix: torch.stft will soon require return_complex=True
...
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)
https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann
89a061f1d1
docs(tts.models.vits): clarify use of discriminator/generator
...
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann
a7753708fb
refactor: remove duplicate methods available in Trainer
2024-03-12 15:06:42 +01:00
Enno Hermann
7673f282be
build: make dependencies for server optional
2024-03-10 20:16:00 +01:00
Enno Hermann
d80f7f4eba
Fix fairseq ( #11 )
...
* fix fairseq mode
* Added line to fix fairseq model issue and made code cleaner.
---------
Co-authored-by: akgupta1337 <akgupta1337@gmail.com>
2024-03-09 16:43:42 +01:00
Enno Hermann
2e8f47a33d
Merge pull request #10 from eginhard/fix-pinyin
...
fix chinese pinyin phonemes
2024-03-09 16:23:28 +01:00
Enno Hermann
309f39a45f
fix(xtts_manager): name_to_id() should return dict
...
This is how the other embedding managers work
2024-03-08 14:47:00 +01:00
Enno Hermann
1aef5ff091
Merge pull request #7 from eginhard/pin-black
...
Pin black for consistent outputs
2024-03-07 17:32:02 +01:00
Enno Hermann
ed8740a39b
Merge pull request #6 from eginhard/fix-bark-url
...
Fix bark model url
2024-03-07 11:50:46 +01:00
Enno Hermann
efdafd5a7f
style: run black
2024-03-07 11:46:51 +01:00
Enno Hermann
f6464d7682
Merge pull request #5 from eginhard/fix-list-models
...
Fix TTS().list_models()
2024-03-07 08:01:29 +01:00
Greer
02d88b5dec
Fix TTS().list_models()
2024-03-06 23:24:02 +01:00
Enno Hermann
017c84d005
style: make style && make lint
2024-03-06 22:45:35 +01:00
Enno Hermann
4e183c61df
fix(api): handle missing attribute in is_multilingual
2024-03-06 22:41:32 +01:00
Enno Hermann
e05243c4c8
refactor: read/write csv files with standard library
2024-03-06 16:18:09 +01:00
Enno Hermann
24298da5fc
Merge pull request #1 from eginhard/lint-overhaul
...
Lint overhaul (pylint to ruff)
2024-03-06 16:10:26 +01:00
Enno Hermann
04d8d4b09a
chore: remove unused imports
2024-03-06 13:27:43 +01:00
Nick Potafiy
dbf1a08a0d
Update generic_utils.py ( #3561 )
...
Handles cases when git branch produces no output or invalid output. Right now, it just crashes with `StopIteration`
2024-02-10 11:20:58 -03:00
wangjie
b184e9f0fe
fix chinese pinyin phonemes
2024-01-12 09:11:56 +08:00
Ivan Peevski
08e00e4b49
Fix bark model
2024-01-08 14:45:04 +10:30
Edresson Casanova
5dcc16d193
Bug fix in MP3 and FLAC compute length on TTSDataset ( #3092 )
...
* Bug Fix on XTTS load
* Bug fix in MP3 length on TTSDataset
* Update TTS/tts/datasets/dataset.py
Co-authored-by: Aarni Koskela <akx@iki.fi>
* Uses mutagen for all audio formats
* Add dataloader test wit hall supported audio formats
* Use mutagen.File
* Update
* Fix aux unit tests
* Bug fixe on unit tests
---------
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-12-27 13:23:43 -03:00
Eren Gölge
55c7063724
Merge pull request #3423 from idiap/fix-aux-tests
...
Fix CI (save best model after 0 steps in tests)
2023-12-14 18:00:30 +01:00
Aarni Koskela
d6ea806469
Run `make style`
2023-12-13 14:56:41 +02:00
Aarni Koskela
bd172dabbf
xtts/stream_generator: remove duplicate import + code
2023-12-13 14:56:41 +02:00
Aarni Koskela
32abb1a7c4
xtts/perceiver_encoder: Delete duplicate exists()
2023-12-13 14:56:41 +02:00
Aarni Koskela
aa549e9028
Fix trailing whitespace
2023-12-13 14:56:41 +02:00
Aarni Koskela
4584ef6580
Simplify branch in TTS/bin/synthesize.py
2023-12-13 14:56:41 +02:00
Aarni Koskela
08fa5d4098
Fix implicitly concatenated docstring
2023-12-13 14:56:41 +02:00
Aarni Koskela
33b69c6c09
Add some noqa directives (for now)
2023-12-13 14:56:41 +02:00
Aarni Koskela
00f8f4892a
Ruff autofix unnecessary passes
2023-12-13 14:56:41 +02:00
Aarni Koskela
bc2cf296a3
Ruff autofix PLW3301
2023-12-13 14:56:41 +02:00
Aarni Koskela
64bb41f4fa
Ruff autofix C41
2023-12-13 14:56:41 +02:00
Aarni Koskela
449820ec7d
Ruff autofix E71*
2023-12-13 14:56:41 +02:00
Aarni Koskela
90991e89b4
Ruff autofix unused imports and import order
2023-12-13 14:56:41 +02:00
Aarni Koskela
72ac2bfa09
Get rid of some star imports
2023-12-13 14:56:41 +02:00
Eren Gölge
fa28f99f15
Update to v0.22.0
2023-12-12 16:10:46 +01:00
Eren Gölge
8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
...
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
Enno Hermann
9f325b1f6c
fixup! Fix aux unit tests
2023-12-12 16:07:16 +01:00
Edresson Casanova
fc099218df
Fix aux unit tests
2023-12-12 16:07:16 +01:00
Eren Gölge
934b87bbd1
Merge pull request #3391 from aaron-lii/multi-gpu
...
support multiple GPU training for XTTS
2023-12-12 13:51:26 +01:00
Eren Gölge
8e6a7cbfbf
Update .models.json
2023-12-12 13:50:01 +01:00
Eren Gölge
4dc0722bbc
Update .models.json
2023-12-12 13:28:16 +01:00
WeberJulian
61b67ef16f
Fix read_json_with_comments
2023-12-11 23:58:52 +01:00
WeberJulian
d47b6df4e5
Make comments in .model.json valid
2023-12-11 23:35:27 +01:00
WeberJulian
b40750d1f5
Remove models that require app.coqui.ai
2023-12-11 23:17:54 +01:00
WeberJulian
5ab228dff2
Fix CI
2023-12-11 22:31:53 +01:00
WeberJulian
8c20a599d8
Remove coqui studio integration from TTS
2023-12-11 22:11:46 +01:00
WeberJulian
5cd750ac7e
Fix API and CI
2023-12-11 20:21:53 +01:00
WeberJulian
e3c9dab7a3
Make CLI work
2023-12-11 18:49:18 +01:00
WeberJulian
0a90359a42
rename speaker file
2023-12-11 18:48:49 +01:00
WeberJulian
a5c0d9780f
rename manager
2023-12-11 18:48:31 +01:00
WeberJulian
36143fee26
Add basic speaker manager
2023-12-11 15:25:46 +01:00
Frederico S. Oliveira
f9117918fe
Update .models.json
2023-12-11 10:47:31 -03:00
Frederico S. Oliveira
163f9a3fdf
Merge branch 'coqui-ai:dev' into dev
2023-12-11 10:04:07 -03:00
WeberJulian
0a136a8535
Download speaker file
2023-12-11 11:29:36 +01:00
Aaron-Li
b6e929696a
support multiple GPU training
2023-12-08 16:55:32 +08:00
Josh Meyer
759d9ab3ae
Print message for either commercial license or CPML
2023-12-07 13:54:48 +01:00
Eren Gölge
e49c512d99
Merge pull request #3351 from aaron-lii/chinese-puncs
...
fix pause problem of Chinese speech
2023-12-04 15:57:42 +01:00
Eren Gölge
2d02015978
Update to v0.21.3
2023-12-01 23:52:57 +01:00
Edresson Casanova
5f900f156a
Add XTTS Fine tuning gradio demo ( #3296 )
...
* Add XTTS FT demo data processing pipeline
* Add training and inference columns
* Uses tabs instead of columns
* Fix demo freezing issue
* Update demo
* Convert stereo to mono
* Bug fix on XTTS inference
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Update gradio demo
* Add parameters to be able to set then on colab demo
* Add erros messages
* Add intuitive error messages
* Update
* Add max_audio_length parameter
* Add XTTS fine-tuner docs
* Update XTTS finetuner docs
* Delete trainer to freeze memory
* Delete unused variables
* Add gc.collect()
* Update xtts.md
---------
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Aaron-Li
7b8808186a
fix pause problem of Chinese speech
2023-12-01 23:30:03 +08:00