Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tortoise Voice Cloning Error- No Checkpoints Downloading #182

Open
Benzene82 opened this issue Sep 19, 2023 · 3 comments
Open

Tortoise Voice Cloning Error- No Checkpoints Downloading #182

Benzene82 opened this issue Sep 19, 2023 · 3 comments
Labels
help wanted Extra attention is needed

Comments

@Benzene82
Copy link

I was hoping to get Tortoise TTS- Voice Cloning working. Text to Speech works fine in Bark and Tortoise, but I get the Error below trying to 'Apply Model Settings'. Sending generated Audio to RVC also opens the /checkpoints folder. I was sifting through GitHub looking for a source, or checkpoint. I see TTS doesn't really work like Stable Diffusion. I generated a few sentences, as you suggested in other issue posts, those models download but the TortoiseTTS model folder only has the ,gitkeep file. I watched all the videos in the (code) but those relate to RVC. Thank you for your time.
Here is the ERROR log...

Running on local URL: http://0.0.0.0:7860/
Traceback (most recent call last):
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\gradio\routes.py", line 437, in run_predict
output = await app.get_blocks().process_api(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\gradio\blocks.py", line 1352, in process_api
result = await self.call_function(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\gradio\blocks.py", line 1077, in call_function
prediction = await anyio.to_thread.run_sync(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\anyio\to_thread.py", line 33, in run_sync
return await get_asynclib().run_sync_in_worker_thread(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\anyio_backends_asyncio.py", line 877, in run_sync_in_worker_thread
return await future
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\anyio_backends_asyncio.py", line 807, in run
result = context.run(func, *args)
File "V:\ZTorTTSWebUI\tts-generation-webui\src\tortoise\gen_tortoise.py", line 49, in switch_model
get_tts(
File "V:\ZTorTTSWebUI\tts-generation-webui\src\tortoise\gen_tortoise.py", line 84, in get_tts
MODEL = TextToSpeech(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\tortoise\api.py", line 233, in init
self.tokenizer = VoiceBpeTokenizer(
File "V:\ZTorTTSWebUI\installer_files\env\lib\site-packages\tortoise\utils\tokenizer.py", line 174, in init
self.tokenizer = Tokenizer.from_file(
Exception: stream did not contain valid UTF-8

@rsxdalv
Copy link
Owner

rsxdalv commented Sep 21, 2023

Unlike StableDiffusion where the "main" model is separated and single, Tortoise and others have multiple models working together. That's why it's an entire folder where you copy all of the models:

For me the place where they are downloaded is here:
C:\Users\<user>\.cache\tortoise\models

and the files are:

Mode                 LastWriteTime         Length Name
----                 -------------         ------ ----
-a----         30-Apr-23   7:15 PM     1716988501 autoregressive.pth
-a----         30-Apr-23   7:17 PM      975620731 clvp2.pth
-a----         30-Apr-23   7:16 PM     1169472627 diffusion_decoder.pth
-a----         30-Apr-23   8:50 PM       25193729 rlg_auto.pth
-a----         30-Apr-23   8:50 PM      100715777 rlg_diffuser.pth
-a----         30-Apr-23   7:17 PM      391384715 vocoder.pth

I tested it and if I leave the tokenizer empty, it works. Could you share the tokenizer file if that could be the fault?

@rsxdalv rsxdalv added the help wanted Extra attention is needed label Sep 21, 2023
@Benzene82
Copy link
Author

Benzene82 commented Sep 22, 2023 via email

@rsxdalv
Copy link
Owner

rsxdalv commented Sep 22, 2023

Ok, if you do English voice clones, you maybe fine with the default tokenizer and can ignore it for now.

Then, when you train a voice clone you might get an autoregressive.pth file. If you copy this file in a new folder under the tortoise models, you will be able to use it. Also, you will need to copy the other pth files as I mentioned above.

It seems like the video you watched was very specific to MRQ. But the actual model can be used without MRQ.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants