Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

一点意见:是否可以在图片转pdf的过程中实现重复图片的冗余去除 #8

Open
LaBaZh opened this issue May 27, 2024 · 1 comment

Comments

@LaBaZh
Copy link

LaBaZh commented May 27, 2024

No description provided.

@PeiPei233
Copy link
Owner

我之前尝试了一下用一些简单的相似度算法,问题在于这种相似度计算难以界定什么样是“重复”,例如一张ppt只更改了部分文字,大体不变,也会被判断为重复的ppt。
目前我没找到什么好方法来平衡这个问题,同时保证软件不过于臃肿。如果能够接受这种情况,后续可以考虑加个开关打开试验性功能🧐

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants