Issues with VITS: Mixed Voices and Missing Number Synthesis #212

iliuha93 · 2024-05-16T07:39:56Z

Hello,

I am using the VITS model for text-to-speech synthesis with a configuration that specifies using a single voice. However, I am encountering two issues:

Sometimes the output speech is partially voiced by a male and partially by a female voice, even though the configuration is set to use a single voice.
The model does not synthesize numbers correctly.

Here is my current configuration:

{
  "_name_or_path": "facebook/mms-tts-deu",
  "activation_dropout": 0.1,
  "architectures": ["VitsModel"]
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with VITS: Mixed Voices and Missing Number Synthesis #212

Issues with VITS: Mixed Voices and Missing Number Synthesis #212

iliuha93 commented May 16, 2024

Issues with VITS: Mixed Voices and Missing Number Synthesis #212

Issues with VITS: Mixed Voices and Missing Number Synthesis #212

Comments

iliuha93 commented May 16, 2024