Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

为啥我生成的音频文件和视频的时间对不上 #13

Open
Cqy1995 opened this issue May 20, 2024 · 1 comment
Open

为啥我生成的音频文件和视频的时间对不上 #13

Cqy1995 opened this issue May 20, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@Cqy1995
Copy link

Cqy1995 commented May 20, 2024

image demo的视频时间是30s,生成后的字幕文件是57秒 image
@buxuku buxuku added the bug Something isn't working label May 20, 2024
@buxuku
Copy link
Owner

buxuku commented May 20, 2024

whisper 的文本生成比较依赖于上下文,这个视频我是通过 ffmpeg 截取了 30s, 这可能是视频导致出来的问题。当然, whisper 也确实会存在时间轴对不上的情况,我将尝试寻找更精确的输出方案。

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants