Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Default config of without_timestamps=True affects whisper transcript quality. #932

Open
Artaches opened this issue Nov 25, 2024 · 6 comments

Comments

@Artaches
Copy link

WhisperX default is without_timestamps=True; while faster-whisper's default is without_timestamps=False. This affects transcript quality, whisperX output can have long (5-15s) continuous transcript drops. Attached an example of a small audio clip (~10s, so VADs in WhisperX and Faster-Whisper are off) that has worse transcript output when without_timestamps=True.
whisperxWithoutTimestepsExample.zip

@pramadikaegamo
Copy link

WhisperX default is without_timestamps=True; while faster-whisper's default is without_timestamps=False. This affects transcript quality, whisperX output can have long (5-15s) continuous transcript drops. Attached an example of a small audio clip (~10s, so VADs in WhisperX and Faster-Whisper are off) that has worse transcript output when without_timestamps=True. whisperxWithoutTimestepsExample.zip

yes that's right, when without_timestamps=False some of the previously untranscribed ones become transcribed by changing them to false

@heartInsert
Copy link

where should I change this parameter without_timestamps to False in whsperx ?

@pramadikaegamo
Copy link

where should I change this parameter without_timestamps to False in whsperx ?

in the load_model function you can add the asr_options parameter, as in the following example:

asr_options = {
"without_timestamps": False,
}

model = whisperx.load_model("large-v3", device="cuda", asr_options=asr_options)

@heartInsert
Copy link

@pramadikaegamo thanks a lot .

@sulutian
Copy link

我应该在 whsperx 中的哪里将此参数 without_timestamps 更改为 False?

在 load_model 函数中您可以添加 asr_options 参数,如下例所示:

asr_options = { “without_timestamps”:False, }

模型 = whisperx.load_model(“large-v3”,设备=“cuda”,asr_options=asr_options)

Can he only use this parameter in python and not on the command line?

@pramadikaegamo
Copy link

我应该在 whsperx 中的哪里将此参数 without_timestamps 更改为 False?

在 load_model 函数中您可以添加 asr_options 参数,如下例所示:
asr_options = { “without_timestamps”:False, }
模型 = whisperx.load_model(“large-v3”,设备=“cuda”,asr_options=asr_options)

Can he only use this parameter in python and not on the command line?

Yes, of course, that's what I gave, for example using parameters in Python, not in the command line

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants
@Artaches @heartInsert @pramadikaegamo @sulutian and others