coqui-tts

Commit Graph

Author	SHA1	Message	Date
Jindrich Matousek	c312343585	Language of each item (sample/utterance) is set to dataset language only when not defined at the sample/utterance level Speaker name is prepended by dataset name in case of multispeaker datasets Refactor "artic" formatter	2023-09-06 17:05:47 +02:00
Jindrich Matousek	a0db2eeee8	Fix: add `is_eval` when calling `get_sampler` to respect training/validation	2023-09-06 13:59:24 +02:00
Jindrich Matousek	0938f1cfa1	Merge branch 'coqui-ai:main' into main	2023-09-04 14:27:19 +02:00
Eren Gölge	9533f8656c	Make style	2023-09-04 13:58:37 +02:00
Eren Gölge	562a9509f2	Add BE model	2023-09-04 13:57:03 +02:00
Eren Gölge	b4c82685a7	Add model entries	2023-09-04 13:04:58 +02:00
T145	cdc971ff74	Fixed spectrogram checking on librosa 0.10.x (#2899 )	2023-09-04 12:58:27 +02:00
Cohee	b3b1555d82	Fix exception handling in manage.py (#2912 )	2023-09-04 12:54:30 +02:00
Eren G??lge	33b5e87b56	Merge branch 'dev' into main	2023-09-04 12:52:38 +02:00
Eren G??lge	40b527345f	Bump up to v0.16.6	2023-09-04 12:51:53 +02:00
Eren Gölge	d1d95707bd	Update docs (#2919 )	2023-09-04 12:28:36 +02:00
Unik	32b8ebb633	Updated scipy version (#2914 )	2023-09-04 11:39:19 +02:00
Aleś Bułojčyk	fead04f779	Add phonemizer for Belarusian language (#2856 )	2023-08-28 11:20:45 +02:00
Jake Tae	b79b6f0762	feature: add device flag to tts cli (#2875 )	2023-08-28 11:20:12 +02:00
Jake Tae	fa0cbd78fe	Update README with new device API (#2876 ) * docs: update readme w/ .to(device) api * docs: add .to(device) in python quickstart * docs: move section header out of comment * chore: use device instead of hard-coded string * docs: update inference.md	2023-08-28 11:19:00 +02:00
Jindrich Matousek	5504e13570	Merge branch 'coqui-ai:main' into main	2023-08-26 16:34:47 +02:00
Eren Gölge	530a8939fe	Merge pull request #2894 from coqui-ai/dev v0.16.5	2023-08-26 12:00:48 +02:00
Eren Gölge	c0b5e61749	Bump up to v0.16.5	2023-08-26 12:00:25 +02:00
Eren Gölge	a7a96d08dd	Fix loading Bark (#2893 ) * Fixup hubert path * Make style	2023-08-26 11:59:00 +02:00
Eren Gölge	04a36a727b	Bump up to v0.16.4	2023-08-26 10:39:48 +02:00
Eren Gölge	a96562a750	Update .models.json	2023-08-26 10:36:40 +02:00
Jindrich Matousek	37807fef8b	Add vctk_wav formatter: it is the same as vctk but uses wav extension instead of flac	2023-08-23 11:52:14 +01:00
Jindrich Matousek	4085a229fe	Merge remote-tracking branch 'upstream/main'	2023-08-21 17:22:20 +01:00
Jake Tae	409db505d2	Add device support in TTS and Synthesizer (#2855 ) * fix: resolve merge conflicts * fix: retain backwards compatability in functions * feature: utilize device for voice transfer * feature: use device for vocoder * chore: cleanup vocoder cpu logic * fix: add necessary vocoder output device check * fix: add necessary vocoder output device check * fix: indentation * fix: check if waveform is pt tensor before cpu conversion --------- Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>	2023-08-14 21:04:44 +02:00
Julian Weber	febcaf710a	Add customizable data home path (#2871 ) * Add customizable data home path * Add TTS_HOME as an option	2023-08-14 21:02:48 +02:00
Eren Gölge	c4e5effab9	Bump up to v0.16.3	2023-08-13 12:22:04 +02:00
Michael New	1f9d600b83	Denote human voices in README.md (#2851 )	2023-08-13 12:15:17 +02:00
Eren Gölge	3a104d5c49	Update Studio API for XTTS (#2861 ) * Update Studio API for XTTS * Update the docs * Update README.md * Update README.md Update README	2023-08-13 12:04:12 +02:00
Eren G??lge	37b558ccb9	Make style	2023-08-11 12:55:23 +02:00
Eren G??lge	9a8352b8da	Fix import error with Bark	2023-08-11 03:33:59 +02:00
Eren Gölge	c87377b713	Bump up to v0.16.2	2023-08-07 13:21:14 +02:00
Eren Gölge	4186f42b21	Handle missing JA phonemizer (#2843 ) * Handle missing JA phonemizer * Make style	2023-08-07 13:19:38 +02:00
Eren Gölge	48f8133eae	Fix imports (#2845 )	2023-08-07 13:19:26 +02:00
Jindrich Matousek	874143bf04	Add support for phone (char) based length scale Remove length_scale from default aux_input	2023-08-06 13:17:53 +02:00
Jindrich Matousek	d3661d7d26	Fix artic_multispeaker formatter	2023-08-05 10:30:53 +02:00
Javier	4e7f8cd021	Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831 )	2023-08-04 11:02:59 +02:00
ChaseC	52a528cfcf	add post functionality to /api/tts (#2836 )	2023-08-04 10:54:20 +02:00
Eren Gölge	dc04baa1ee	Bump up to v0.16.1	2023-07-31 15:54:45 +02:00
Eren Gölge	17ddd65741	Please p3.11	2023-07-31 15:53:19 +02:00
Eren Gölge	69f080eb47	Fix DelightfulTTS (#2823 ) * Fix tests * Make style	2023-07-31 13:52:45 +02:00
Eren Gölge	483888b9d8	Add kwargs to ignore extra arguments w/o error (#2822 )	2023-07-31 11:37:35 +02:00
AWAS666	9e74b51aa6	Delightful TTS VCTK recipe fixes (#2808 ) * fix: wrong import class * fix: formatter name missing * feat: get rid of clearml	2023-07-31 10:27:42 +02:00
Aleś Bułojčyk	d124f78430	Recipe for Belarusian TTS (#2756 ) * Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com> * Recipe for Belarusian TTS --------- Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>	2023-07-31 10:26:21 +02:00
Javier	c140df5a58	Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816 )	2023-07-31 10:19:49 +02:00
Eren Gölge	b739326503	Bump up to v0.16.0	2023-07-24 16:04:10 +02:00
Eren Gölge	8aacb81849	Fix Tortoise load (#2791 ) * Remove key prunning in tortoise * Make lint	2023-07-24 13:42:47 +02:00
Eren Gölge	b3472a739e	Update README.md	2023-07-24 13:42:20 +02:00
logan hart	6fdb88f8e2	Add Delightful-TTS implementation (#2095 ) * add configs * Update config file * Add model configs * Add model layers * Add layer files * Add layer modules * change config names * Add emotion manager * fIX missing ap bug * Fix missing ap bug * Add base TTS e2e class * Fix wrong variable name in load_tts_samples * Add training script * Remove range predictor and gaussian upsampling * Add helper function * Add vctk recipe * Add conformer docs * Fix linting in conformer.py * Add Docs * remove duplicate import * refactor args * Fix bugs * Removew emotion embedding * remove unused arg * Remove emotion embedding arg * Remove emotion embedding arg * fix style issues * Fix bugs * Fix bugs * Add unittests * make style * fix formatter bug * fix test * Add pyworld compute pitch func * Update requirments.txt * Fix dataset Bug * Chnge layer norm to instance norm * Add missing import * Remove emotions.py * remove ssim loss * Add init layers func to aligner * refactor model layers * remove audio_config arg * Rename loss func * Rename to delightful-tts * Rename loss func * Remove unused modules * refactor imports * replace audio config with audio processor * Add change sample rate option * remove broken resample func * update recipe * fix style, add config docs * fix tests and multispeaker embd dim * remove pyworld * Make style and fix inference * Split tts tests * Fixup * Fixup * Fixup * Add argument names * Set "random" speaker in the model Tortoise/Bark * Use a diff f0_cache path for delightfull tts * Fix delightful speaker handling * Fix lint * Make style --------- Co-authored-by: loganhart420 <loganartpersonal@gmail.com> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-07-24 13:41:26 +02:00
Eren Gölge	f24c5e0276	Update README	2023-07-24 13:30:19 +02:00
Eren Gölge	1652598a33	Test synthesize api separately	2023-07-24 12:38:20 +02:00

1 2 3 4 5 ...

4513 Commits All Branches Search

4513 Commits

All Branches