Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Degraded quality with timestamps disabled #2186

Open
jpospical opened this issue May 27, 2024 · 1 comment
Open

Degraded quality with timestamps disabled #2186

jpospical opened this issue May 27, 2024 · 1 comment

Comments

@jpospical
Copy link

After commit f7908f9, some audio is often ignored when running with -nt. This seems to happen regardless of audio file, model or CPU (I've tested with Intel and AMD at least). I can reliably reproduce with this clip and large-v1. Specifically, all but the last word "damage" of the section from 00:00:27-00:00:30 is consistently ignored with timestamps=0, but it's transcribed as expected otherwise.

I was going to comment on #1755 rather than creating a new issue, but that PR was merged a long time ago so I wasn't sure if it was appropriate. Apologies if that's what I should've done.

@au-voltzzz
Copy link

au-voltzzz commented Jun 13, 2024

I commented on a different thread ( #2191) which I thought described my issue at first but just inspired me to try the earlier version of whisper.cpp. However, I have just discovered using -nt was the culprit of my issues after trying different things thorughout the day. It happened using v2, v3, and medium.en models. Large portions of audio were not transcribed. For now I am using whisper.cpp-v1.5.4 which fixes the problem.

This is happening for me using Apple M1 (sorry I am not sure what info to share to help describe my issue better)

Edit: I removed the comment on the other thread

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants