Commit Graph

2092 Commits

Author SHA1 Message Date
Enno Hermann 642cbd472f
Merge pull request #26 from idiap/server-output
fix(server): ensure logging output gets actually shown
2024-05-26 09:08:27 +01:00
Enno Hermann ab7d84bf05 refactor(server): address linter issues 2024-05-23 08:42:21 +02:00
Enno Hermann 8503500d9d chore(server): remove duplicate code 2024-05-20 12:45:47 +02:00
Enno Hermann 70bd84894d fix(server): ensure logging output gets actually shown 2024-05-20 12:45:41 +02:00
Enno Hermann 018f1e6453 docs(bark): update docstrings and type hints 2024-05-15 22:56:55 +02:00
Enno Hermann 59a6c9fdf2 fix(bark): add missing argument for load_voice()
Fixes https://github.com/coqui-ai/TTS/issues/2795
2024-05-15 22:56:28 +02:00
Enno Hermann 6d563af623 chore: remove obsolete code for torch<2
Minimum torch version is 2.1 now.
2024-05-08 18:08:40 +02:00
Enno Hermann 865a48156d fix: make korean g2p deps optional 2024-05-08 18:08:40 +02:00
Enno Hermann 55ed162f2a fix: make chinese g2p deps optional 2024-05-08 18:08:40 +02:00
Enno Hermann ea893c3795 fix: make bangla g2p deps optional 2024-05-08 18:08:40 +02:00
Enno Hermann ec50006855 style: run pre-commit
Automatic changes from: pre-commit run --all-files
2024-05-08 12:17:47 +02:00
Enno Hermann fb92e13ebb build: remove unused/obsolete code 2024-05-08 12:13:41 +02:00
Enno Hermann 259d8fc40b build: store version in pyproject.toml 2024-05-07 18:27:55 +02:00
Enno Hermann 962f9bbbcf refactor(espeak_wrapper): fix ruff lint suggestions 2024-05-01 13:31:39 +02:00
Enno Hermann 7b2289a454 fix(espeak_wrapper): capture stderr separately
Fixes https://github.com/coqui-ai/TTS/issues/2728

Previously, error messages from espeak were treated as normal output and also
converted to phonemes. This captures and logs them separately.
2024-05-01 12:31:49 +02:00
Enno Hermann 06304504d2
Merge pull request #11 from idiap/py312
build: add python 3.12 support
2024-04-23 13:52:00 +02:00
Enno Hermann 2675e743b0 chore: update version to 0.23.1
[ci skip]
2024-04-23 09:57:43 +02:00
Enno Hermann 52a52b5e21 fix(LanguageManager): allow initialisation from config with language ids file
Previously, running `LanguageManager.init_from_config(config)` would never use
the `language_ids_file` if that field is present because it was overwritten in
the next line with a new manager that manually parses languages from the
datasets in the config. Now that is only used as a fallback.
2024-04-19 11:57:27 +02:00
Enno Hermann f7d69cc1d7 chore: update version to 0.23.0 2024-04-11 17:01:09 +02:00
Enno Hermann b3c9685aee fix(tokenizer): add debug logging 2024-04-11 16:58:12 +02:00
Enno Hermann 2ad790d169
Merge pull request #4 from idiap/hindi
feat(xtts): support Hindi for sentence-splitting and fine-tuning
2024-04-11 16:49:44 +02:00
Enno Hermann dfbe0168e9
Merge pull request #3 from idiap/logging
Use Python logging instead of print()
2024-04-11 08:34:44 +02:00
Enno Hermann d41686502e feat(xtts): support hindi for sentence-splitting and fine-tuning
The XTTS model itself already supports Hindi, it was just in these components.
2024-04-08 15:57:56 +02:00
Enno Hermann aa40fd277b docs: update links 2024-04-04 18:21:57 +02:00
Enno Hermann e689fd1d4a fix(utils.manage): remove bare except, improve messages 2024-04-03 15:19:45 +02:00
Enno Hermann 7dc5d1eb3d fix: logging in executables 2024-04-03 15:19:45 +02:00
Enno Hermann ab64844aba feat(utils.generic_utils): add custom formatter for logging to console 2024-04-03 15:19:45 +02:00
Enno Hermann 9b2d48f8a6 feat(utils.generic_utils): improve setup_logger() arguments and output 2024-04-03 15:19:45 +02:00
Enno Hermann b711e19cb6 refactor: remove verbose arguments
Can be handled by adjusting logging levels instead.
2024-04-03 15:19:45 +02:00
Enno Hermann b6ab85a050 fix: use logging instead of print statements
Fixes #1691
2024-04-03 15:19:45 +02:00
Enno Hermann dd3768d4b1 chore: update version to v0.22.1 2024-04-03 12:31:39 +02:00
Enno Hermann d772724125 fix: update repository links, package names, metadata 2024-04-03 12:02:44 +02:00
Enno Hermann 7630abb43f refactor(bin.find_unique_chars): use existing function 2024-03-30 22:22:40 +01:00
Enno Hermann adbcba06da refactor(dataset): get audio length with torchaudio
Removes a (GPL) dependency
2024-03-14 20:48:29 +01:00
Enno Hermann e5c6da1c98
Merge pull request #20 from eginhard/return-complex
fix: torch.stft will soon require return_complex=True
2024-03-13 13:50:21 +01:00
Enno Hermann e95f8950eb fix: torch.stft will soon require return_complex=True
Refactor that removes the deprecation warning:
torch.view_as_real(torch.stft(*, return_complex=True)) is equal to
torch.stft(*, return_complex=False)

https://pytorch.org/docs/stable/generated/torch.stft.html
2024-03-13 12:06:27 +01:00
Enno Hermann 89a061f1d1 docs(tts.models.vits): clarify use of discriminator/generator
[ci skip]
2024-03-12 18:59:05 +01:00
Enno Hermann a7753708fb refactor: remove duplicate methods available in Trainer 2024-03-12 15:06:42 +01:00
Enno Hermann 7673f282be build: make dependencies for server optional 2024-03-10 20:16:00 +01:00
Enno Hermann d80f7f4eba
Fix fairseq (#11)
* fix fairseq mode

* Added line to fix fairseq model issue and made code cleaner.

---------

Co-authored-by: akgupta1337 <akgupta1337@gmail.com>
2024-03-09 16:43:42 +01:00
Enno Hermann 2e8f47a33d
Merge pull request #10 from eginhard/fix-pinyin
fix chinese pinyin phonemes
2024-03-09 16:23:28 +01:00
Enno Hermann 309f39a45f fix(xtts_manager): name_to_id() should return dict
This is how the other embedding managers work
2024-03-08 14:47:00 +01:00
Enno Hermann 1aef5ff091
Merge pull request #7 from eginhard/pin-black
Pin black for consistent outputs
2024-03-07 17:32:02 +01:00
Enno Hermann ed8740a39b
Merge pull request #6 from eginhard/fix-bark-url
Fix bark model url
2024-03-07 11:50:46 +01:00
Enno Hermann efdafd5a7f style: run black 2024-03-07 11:46:51 +01:00
Enno Hermann f6464d7682
Merge pull request #5 from eginhard/fix-list-models
Fix TTS().list_models()
2024-03-07 08:01:29 +01:00
Greer 02d88b5dec Fix TTS().list_models() 2024-03-06 23:24:02 +01:00
Enno Hermann 017c84d005 style: make style && make lint 2024-03-06 22:45:35 +01:00
Enno Hermann 4e183c61df fix(api): handle missing attribute in is_multilingual 2024-03-06 22:41:32 +01:00
Enno Hermann e05243c4c8 refactor: read/write csv files with standard library 2024-03-06 16:18:09 +01:00
Enno Hermann 24298da5fc
Merge pull request #1 from eginhard/lint-overhaul
Lint overhaul (pylint to ruff)
2024-03-06 16:10:26 +01:00
Enno Hermann 04d8d4b09a chore: remove unused imports 2024-03-06 13:27:43 +01:00
Nick Potafiy dbf1a08a0d
Update generic_utils.py (#3561)
Handles cases when git branch produces no output or invalid output. Right now, it just crashes with `StopIteration`
2024-02-10 11:20:58 -03:00
wangjie b184e9f0fe fix chinese pinyin phonemes 2024-01-12 09:11:56 +08:00
Ivan Peevski 08e00e4b49
Fix bark model 2024-01-08 14:45:04 +10:30
Edresson Casanova 5dcc16d193
Bug fix in MP3 and FLAC compute length on TTSDataset (#3092)
* Bug Fix on XTTS load

* Bug fix in MP3 length on TTSDataset

* Update TTS/tts/datasets/dataset.py

Co-authored-by: Aarni Koskela <akx@iki.fi>

* Uses mutagen for all audio formats

* Add dataloader test wit hall supported audio formats

* Use mutagen.File

* Update

* Fix aux unit tests

* Bug fixe on unit tests

---------

Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-12-27 13:23:43 -03:00
Eren Gölge 55c7063724
Merge pull request #3423 from idiap/fix-aux-tests
Fix CI (save best model after 0 steps in tests)
2023-12-14 18:00:30 +01:00
Aarni Koskela d6ea806469 Run `make style` 2023-12-13 14:56:41 +02:00
Aarni Koskela bd172dabbf xtts/stream_generator: remove duplicate import + code 2023-12-13 14:56:41 +02:00
Aarni Koskela 32abb1a7c4 xtts/perceiver_encoder: Delete duplicate exists() 2023-12-13 14:56:41 +02:00
Aarni Koskela aa549e9028 Fix trailing whitespace 2023-12-13 14:56:41 +02:00
Aarni Koskela 4584ef6580 Simplify branch in TTS/bin/synthesize.py 2023-12-13 14:56:41 +02:00
Aarni Koskela 08fa5d4098 Fix implicitly concatenated docstring 2023-12-13 14:56:41 +02:00
Aarni Koskela 33b69c6c09 Add some noqa directives (for now) 2023-12-13 14:56:41 +02:00
Aarni Koskela 00f8f4892a Ruff autofix unnecessary passes 2023-12-13 14:56:41 +02:00
Aarni Koskela bc2cf296a3 Ruff autofix PLW3301 2023-12-13 14:56:41 +02:00
Aarni Koskela 64bb41f4fa Ruff autofix C41 2023-12-13 14:56:41 +02:00
Aarni Koskela 449820ec7d Ruff autofix E71* 2023-12-13 14:56:41 +02:00
Aarni Koskela 90991e89b4 Ruff autofix unused imports and import order 2023-12-13 14:56:41 +02:00
Aarni Koskela 72ac2bfa09 Get rid of some star imports 2023-12-13 14:56:41 +02:00
Eren Gölge fa28f99f15
Update to v0.22.0 2023-12-12 16:10:46 +01:00
Eren Gölge 8c1a8b522b
Merge pull request #3405 from coqui-ai/studio_speakers
Add studio speakers to open source XTTS!
2023-12-12 16:10:09 +01:00
Enno Hermann 9f325b1f6c fixup! Fix aux unit tests 2023-12-12 16:07:16 +01:00
Edresson Casanova fc099218df Fix aux unit tests 2023-12-12 16:07:16 +01:00
Eren Gölge 934b87bbd1
Merge pull request #3391 from aaron-lii/multi-gpu
support multiple GPU training for XTTS
2023-12-12 13:51:26 +01:00
Eren Gölge 8e6a7cbfbf
Update .models.json 2023-12-12 13:50:01 +01:00
Eren Gölge 4dc0722bbc
Update .models.json 2023-12-12 13:28:16 +01:00
WeberJulian 61b67ef16f Fix read_json_with_comments 2023-12-11 23:58:52 +01:00
WeberJulian d47b6df4e5 Make comments in .model.json valid 2023-12-11 23:35:27 +01:00
WeberJulian b40750d1f5 Remove models that require app.coqui.ai 2023-12-11 23:17:54 +01:00
WeberJulian 5ab228dff2 Fix CI 2023-12-11 22:31:53 +01:00
WeberJulian 8c20a599d8 Remove coqui studio integration from TTS 2023-12-11 22:11:46 +01:00
WeberJulian 5cd750ac7e Fix API and CI 2023-12-11 20:21:53 +01:00
WeberJulian e3c9dab7a3 Make CLI work 2023-12-11 18:49:18 +01:00
WeberJulian 0a90359a42 rename speaker file 2023-12-11 18:48:49 +01:00
WeberJulian a5c0d9780f rename manager 2023-12-11 18:48:31 +01:00
WeberJulian 36143fee26 Add basic speaker manager 2023-12-11 15:25:46 +01:00
Frederico S. Oliveira f9117918fe
Update .models.json 2023-12-11 10:47:31 -03:00
Frederico S. Oliveira 163f9a3fdf
Merge branch 'coqui-ai:dev' into dev 2023-12-11 10:04:07 -03:00
WeberJulian 0a136a8535 Download speaker file 2023-12-11 11:29:36 +01:00
Aaron-Li b6e929696a support multiple GPU training 2023-12-08 16:55:32 +08:00
Josh Meyer 759d9ab3ae
Print message for either commercial license or CPML 2023-12-07 13:54:48 +01:00
Eren Gölge e49c512d99
Merge pull request #3351 from aaron-lii/chinese-puncs
fix pause problem of Chinese speech
2023-12-04 15:57:42 +01:00
Eren Gölge 2d02015978
Update to v0.21.3 2023-12-01 23:52:57 +01:00
Edresson Casanova 5f900f156a
Add XTTS Fine tuning gradio demo (#3296)
* Add XTTS FT demo data processing pipeline

* Add training and inference columns

* Uses tabs instead of columns

* Fix demo freezing issue

* Update demo

* Convert stereo to mono

* Bug fix on XTTS inference

* Update gradio demo

* Update gradio demo

* Update gradio demo

* Update gradio demo

* Add parameters to be able to set then on colab demo

* Add erros messages

* Add intuitive error messages

* Update

* Add max_audio_length parameter

* Add XTTS fine-tuner docs

* Update XTTS finetuner docs

* Delete trainer to freeze memory

* Delete unused variables

* Add gc.collect()

* Update xtts.md

---------

Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Aaron-Li 7b8808186a fix pause problem of Chinese speech 2023-12-01 23:30:03 +08:00
Frederico S. Oliveira bcd500fa7b Fixing bug
Correction in training the Fastspeech/Fastspeech2/FastPitch/SpeedySpeech model using external speaker embedding.
2023-11-30 17:27:05 -03:00
Frederico S. Oliveira a26e51b0b4
Merge branch 'coqui-ai:dev' into dev 2023-11-30 14:19:05 -03:00
Eren Gölge 6d1905c2b7
Update to v0.21.2 2023-11-30 13:05:10 +01:00
Enno Hermann 39321d02be
fix: correctly strip/restore initial punctuation (#3336)
* refactor(punctuation): remove orphan code for handling lone punctuation

The case of lone punctuation is already handled at the top of restore(). The
removed if statement would never be called and would in fact raise an
AttributeError because the _punc_index named tuple doesn't have the attribute
`mark`.

* refactor(punctuation): remove unused argument

* fix(punctuation): correctly handle initial punctuation

Stripping and restoring initial punctuation didn't work correctly because the
string-splitting caused an additional empty string to be inserted in the text
list (because `".A".split(".")` => `["", "A"]`). Now, an initial empty string is
skipped and relevant test cases are added.

Fixes #3333
2023-11-30 13:03:16 +01:00