coqui-tts

Commit Graph

Author	SHA1	Message	Date
Enno Hermann	49fcbd908b	fix(espeak_wrapper): avoid stuck process on windows Fixes #24	2024-05-29 07:39:03 +02:00
Enno Hermann	203f60f1e1	refactor(espeak_wrapper): remove sync argument _espeak_exe is always called with sync=True, so remove code for sync==False	2024-05-28 21:30:55 +02:00
Enno Hermann	7df4c2fa47	fix: restore TTS.__version__ attribute This is used by the TTS/bin/collect_env_info.py script with which users print version information for bug reports. We restore the TTS.__version__ attribute so that old versions of the script still work.	2024-05-28 09:35:55 +02:00
Enno Hermann	df088e99df	Merge pull request #19 from idiap/toml Move from setup.py to pyproject.toml, simplify requirements	2024-05-27 08:59:09 +01:00
Enno Hermann	642cbd472f	Merge pull request #26 from idiap/server-output fix(server): ensure logging output gets actually shown	2024-05-26 09:08:27 +01:00
Enno Hermann	ab7d84bf05	refactor(server): address linter issues	2024-05-23 08:42:21 +02:00
Enno Hermann	8503500d9d	chore(server): remove duplicate code	2024-05-20 12:45:47 +02:00
Enno Hermann	70bd84894d	fix(server): ensure logging output gets actually shown	2024-05-20 12:45:41 +02:00
Enno Hermann	018f1e6453	docs(bark): update docstrings and type hints	2024-05-15 22:56:55 +02:00
Enno Hermann	59a6c9fdf2	fix(bark): add missing argument for load_voice() Fixes https://github.com/coqui-ai/TTS/issues/2795	2024-05-15 22:56:28 +02:00
Enno Hermann	6d563af623	chore: remove obsolete code for torch<2 Minimum torch version is 2.1 now.	2024-05-08 18:08:40 +02:00
Enno Hermann	865a48156d	fix: make korean g2p deps optional	2024-05-08 18:08:40 +02:00
Enno Hermann	55ed162f2a	fix: make chinese g2p deps optional	2024-05-08 18:08:40 +02:00
Enno Hermann	ea893c3795	fix: make bangla g2p deps optional	2024-05-08 18:08:40 +02:00
Enno Hermann	ec50006855	style: run pre-commit Automatic changes from: pre-commit run --all-files	2024-05-08 12:17:47 +02:00
Enno Hermann	fb92e13ebb	build: remove unused/obsolete code	2024-05-08 12:13:41 +02:00
Enno Hermann	259d8fc40b	build: store version in pyproject.toml	2024-05-07 18:27:55 +02:00
Enno Hermann	962f9bbbcf	refactor(espeak_wrapper): fix ruff lint suggestions	2024-05-01 13:31:39 +02:00
Enno Hermann	7b2289a454	fix(espeak_wrapper): capture stderr separately Fixes https://github.com/coqui-ai/TTS/issues/2728 Previously, error messages from espeak were treated as normal output and also converted to phonemes. This captures and logs them separately.	2024-05-01 12:31:49 +02:00
Enno Hermann	06304504d2	Merge pull request #11 from idiap/py312 build: add python 3.12 support	2024-04-23 13:52:00 +02:00
Enno Hermann	2675e743b0	chore: update version to 0.23.1 [ci skip]	2024-04-23 09:57:43 +02:00
Enno Hermann	52a52b5e21	fix(LanguageManager): allow initialisation from config with language ids file Previously, running `LanguageManager.init_from_config(config)` would never use the `language_ids_file` if that field is present because it was overwritten in the next line with a new manager that manually parses languages from the datasets in the config. Now that is only used as a fallback.	2024-04-19 11:57:27 +02:00
Enno Hermann	f7d69cc1d7	chore: update version to 0.23.0	2024-04-11 17:01:09 +02:00
Enno Hermann	b3c9685aee	fix(tokenizer): add debug logging	2024-04-11 16:58:12 +02:00
Enno Hermann	2ad790d169	Merge pull request #4 from idiap/hindi feat(xtts): support Hindi for sentence-splitting and fine-tuning	2024-04-11 16:49:44 +02:00
Enno Hermann	dfbe0168e9	Merge pull request #3 from idiap/logging Use Python logging instead of print()	2024-04-11 08:34:44 +02:00
Enno Hermann	d41686502e	feat(xtts): support hindi for sentence-splitting and fine-tuning The XTTS model itself already supports Hindi, it was just in these components.	2024-04-08 15:57:56 +02:00
Enno Hermann	aa40fd277b	docs: update links	2024-04-04 18:21:57 +02:00
Enno Hermann	e689fd1d4a	fix(utils.manage): remove bare except, improve messages	2024-04-03 15:19:45 +02:00
Enno Hermann	7dc5d1eb3d	fix: logging in executables	2024-04-03 15:19:45 +02:00
Enno Hermann	ab64844aba	feat(utils.generic_utils): add custom formatter for logging to console	2024-04-03 15:19:45 +02:00
Enno Hermann	9b2d48f8a6	feat(utils.generic_utils): improve setup_logger() arguments and output	2024-04-03 15:19:45 +02:00
Enno Hermann	b711e19cb6	refactor: remove verbose arguments Can be handled by adjusting logging levels instead.	2024-04-03 15:19:45 +02:00
Enno Hermann	b6ab85a050	fix: use logging instead of print statements Fixes #1691	2024-04-03 15:19:45 +02:00
Enno Hermann	dd3768d4b1	chore: update version to v0.22.1	2024-04-03 12:31:39 +02:00
Enno Hermann	d772724125	fix: update repository links, package names, metadata	2024-04-03 12:02:44 +02:00
Enno Hermann	7630abb43f	refactor(bin.find_unique_chars): use existing function	2024-03-30 22:22:40 +01:00
Enno Hermann	adbcba06da	refactor(dataset): get audio length with torchaudio Removes a (GPL) dependency	2024-03-14 20:48:29 +01:00
Enno Hermann	e5c6da1c98	Merge pull request #20 from eginhard/return-complex fix: torch.stft will soon require return_complex=True	2024-03-13 13:50:21 +01:00
Enno Hermann	e95f8950eb	fix: torch.stft will soon require return_complex=True Refactor that removes the deprecation warning: torch.view_as_real(torch.stft(, return_complex=True)) is equal to torch.stft(, return_complex=False) https://pytorch.org/docs/stable/generated/torch.stft.html	2024-03-13 12:06:27 +01:00
Enno Hermann	89a061f1d1	docs(tts.models.vits): clarify use of discriminator/generator [ci skip]	2024-03-12 18:59:05 +01:00
Enno Hermann	a7753708fb	refactor: remove duplicate methods available in Trainer	2024-03-12 15:06:42 +01:00
Enno Hermann	7673f282be	build: make dependencies for server optional	2024-03-10 20:16:00 +01:00
Enno Hermann	d80f7f4eba	Fix fairseq (#11 ) * fix fairseq mode * Added line to fix fairseq model issue and made code cleaner. --------- Co-authored-by: akgupta1337 <akgupta1337@gmail.com>	2024-03-09 16:43:42 +01:00
Enno Hermann	2e8f47a33d	Merge pull request #10 from eginhard/fix-pinyin fix chinese pinyin phonemes	2024-03-09 16:23:28 +01:00
Enno Hermann	309f39a45f	fix(xtts_manager): name_to_id() should return dict This is how the other embedding managers work	2024-03-08 14:47:00 +01:00
Enno Hermann	1aef5ff091	Merge pull request #7 from eginhard/pin-black Pin black for consistent outputs	2024-03-07 17:32:02 +01:00
Enno Hermann	ed8740a39b	Merge pull request #6 from eginhard/fix-bark-url Fix bark model url	2024-03-07 11:50:46 +01:00
Enno Hermann	efdafd5a7f	style: run black	2024-03-07 11:46:51 +01:00
Enno Hermann	f6464d7682	Merge pull request #5 from eginhard/fix-list-models Fix TTS().list_models()	2024-03-07 08:01:29 +01:00
Greer	02d88b5dec	Fix TTS().list_models()	2024-03-06 23:24:02 +01:00
Enno Hermann	017c84d005	style: make style && make lint	2024-03-06 22:45:35 +01:00
Enno Hermann	4e183c61df	fix(api): handle missing attribute in is_multilingual	2024-03-06 22:41:32 +01:00
Enno Hermann	e05243c4c8	refactor: read/write csv files with standard library	2024-03-06 16:18:09 +01:00
Enno Hermann	24298da5fc	Merge pull request #1 from eginhard/lint-overhaul Lint overhaul (pylint to ruff)	2024-03-06 16:10:26 +01:00
Enno Hermann	04d8d4b09a	chore: remove unused imports	2024-03-06 13:27:43 +01:00
Nick Potafiy	dbf1a08a0d	Update generic_utils.py (#3561 ) Handles cases when git branch produces no output or invalid output. Right now, it just crashes with `StopIteration`	2024-02-10 11:20:58 -03:00
wangjie	b184e9f0fe	fix chinese pinyin phonemes	2024-01-12 09:11:56 +08:00
Ivan Peevski	08e00e4b49	Fix bark model	2024-01-08 14:45:04 +10:30
Edresson Casanova	5dcc16d193	Bug fix in MP3 and FLAC compute length on TTSDataset (#3092 ) * Bug Fix on XTTS load * Bug fix in MP3 length on TTSDataset * Update TTS/tts/datasets/dataset.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Uses mutagen for all audio formats * Add dataloader test wit hall supported audio formats * Use mutagen.File * Update * Fix aux unit tests * Bug fixe on unit tests --------- Co-authored-by: Aarni Koskela <akx@iki.fi>	2023-12-27 13:23:43 -03:00
Eren Gölge	55c7063724	Merge pull request #3423 from idiap/fix-aux-tests Fix CI (save best model after 0 steps in tests)	2023-12-14 18:00:30 +01:00
Aarni Koskela	d6ea806469	Run `make style`	2023-12-13 14:56:41 +02:00
Aarni Koskela	bd172dabbf	xtts/stream_generator: remove duplicate import + code	2023-12-13 14:56:41 +02:00
Aarni Koskela	32abb1a7c4	xtts/perceiver_encoder: Delete duplicate exists()	2023-12-13 14:56:41 +02:00
Aarni Koskela	aa549e9028	Fix trailing whitespace	2023-12-13 14:56:41 +02:00
Aarni Koskela	4584ef6580	Simplify branch in TTS/bin/synthesize.py	2023-12-13 14:56:41 +02:00
Aarni Koskela	08fa5d4098	Fix implicitly concatenated docstring	2023-12-13 14:56:41 +02:00
Aarni Koskela	33b69c6c09	Add some noqa directives (for now)	2023-12-13 14:56:41 +02:00
Aarni Koskela	00f8f4892a	Ruff autofix unnecessary passes	2023-12-13 14:56:41 +02:00
Aarni Koskela	bc2cf296a3	Ruff autofix PLW3301	2023-12-13 14:56:41 +02:00
Aarni Koskela	64bb41f4fa	Ruff autofix C41	2023-12-13 14:56:41 +02:00
Aarni Koskela	449820ec7d	Ruff autofix E71*	2023-12-13 14:56:41 +02:00
Aarni Koskela	90991e89b4	Ruff autofix unused imports and import order	2023-12-13 14:56:41 +02:00
Aarni Koskela	72ac2bfa09	Get rid of some star imports	2023-12-13 14:56:41 +02:00
Eren Gölge	fa28f99f15	Update to v0.22.0	2023-12-12 16:10:46 +01:00
Eren Gölge	8c1a8b522b	Merge pull request #3405 from coqui-ai/studio_speakers Add studio speakers to open source XTTS!	2023-12-12 16:10:09 +01:00
Enno Hermann	9f325b1f6c	fixup! Fix aux unit tests	2023-12-12 16:07:16 +01:00
Edresson Casanova	fc099218df	Fix aux unit tests	2023-12-12 16:07:16 +01:00
Eren Gölge	934b87bbd1	Merge pull request #3391 from aaron-lii/multi-gpu support multiple GPU training for XTTS	2023-12-12 13:51:26 +01:00
Eren Gölge	8e6a7cbfbf	Update .models.json	2023-12-12 13:50:01 +01:00
Eren Gölge	4dc0722bbc	Update .models.json	2023-12-12 13:28:16 +01:00
WeberJulian	61b67ef16f	Fix read_json_with_comments	2023-12-11 23:58:52 +01:00
WeberJulian	d47b6df4e5	Make comments in .model.json valid	2023-12-11 23:35:27 +01:00
WeberJulian	b40750d1f5	Remove models that require app.coqui.ai	2023-12-11 23:17:54 +01:00
WeberJulian	5ab228dff2	Fix CI	2023-12-11 22:31:53 +01:00
WeberJulian	8c20a599d8	Remove coqui studio integration from TTS	2023-12-11 22:11:46 +01:00
WeberJulian	5cd750ac7e	Fix API and CI	2023-12-11 20:21:53 +01:00
WeberJulian	e3c9dab7a3	Make CLI work	2023-12-11 18:49:18 +01:00
WeberJulian	0a90359a42	rename speaker file	2023-12-11 18:48:49 +01:00
WeberJulian	a5c0d9780f	rename manager	2023-12-11 18:48:31 +01:00
WeberJulian	36143fee26	Add basic speaker manager	2023-12-11 15:25:46 +01:00
Frederico S. Oliveira	f9117918fe	Update .models.json	2023-12-11 10:47:31 -03:00
Frederico S. Oliveira	163f9a3fdf	Merge branch 'coqui-ai:dev' into dev	2023-12-11 10:04:07 -03:00
WeberJulian	0a136a8535	Download speaker file	2023-12-11 11:29:36 +01:00
Aaron-Li	b6e929696a	support multiple GPU training	2023-12-08 16:55:32 +08:00
Josh Meyer	759d9ab3ae	Print message for either commercial license or CPML	2023-12-07 13:54:48 +01:00
Eren Gölge	e49c512d99	Merge pull request #3351 from aaron-lii/chinese-puncs fix pause problem of Chinese speech	2023-12-04 15:57:42 +01:00
Eren Gölge	2d02015978	Update to v0.21.3	2023-12-01 23:52:57 +01:00
Edresson Casanova	5f900f156a	Add XTTS Fine tuning gradio demo (#3296 ) * Add XTTS FT demo data processing pipeline * Add training and inference columns * Uses tabs instead of columns * Fix demo freezing issue * Update demo * Convert stereo to mono * Bug fix on XTTS inference * Update gradio demo * Update gradio demo * Update gradio demo * Update gradio demo * Add parameters to be able to set then on colab demo * Add erros messages * Add intuitive error messages * Update * Add max_audio_length parameter * Add XTTS fine-tuner docs * Update XTTS finetuner docs * Delete trainer to freeze memory * Delete unused variables * Add gc.collect() * Update xtts.md --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-12-01 23:52:23 +01:00
Aaron-Li	7b8808186a	fix pause problem of Chinese speech	2023-12-01 23:30:03 +08:00

1 2 3 4 5 ...

2096 Commits