-
Notifications
You must be signed in to change notification settings - Fork 171
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
generate duplicated phrases #94
Comments
I have seen this happen outside of whisper-timestamped with other whisper implementations as well. Is it caused by hallucination or not using VAD, I am curious? |
Also seeing this- mostly during quiet parts if that helps at all. Otherwise the transcription is spot on- even with the hardest content. |
For this particular sample, --accurate will get rid of the duplicates. |
Yes, exactly @misutoneko |
When using small or tiny model, the duplicated phrases decrease. WhiperX also has this issue. |
Some people reported that using a higher value for compression_ratio_threshold than the default improves this issue. |
Had the same problem, with >10 repititions for several .mp3's. |
Whisper-timestamped will generate duplicated phrases for some audio, such as https://flex2.acast.com/s/pbs-newshour-segments/u/d3i6fh83elv35t.cloudfront.net/static/2023/05/newswrap-15.mp3
I use small and medium model
The text was updated successfully, but these errors were encountered: