-
Notifications
You must be signed in to change notification settings - Fork 1.8k
--space-to-offset 1 drops characters #445
Comments
The text drawing code in PDF is According to the comment in pdf2htmlEX code, this is a known limitation. @coolwanglu can we make |
I tried to fix this in #446, @davidhedley can you test the patch with your PDFs? |
As an update to this, |
And also |
--space-to-offset 1 is incorrectly dropping some characters.
Test case:http://download.vistair.com/pdf2htmlEX/Page-2fromBAW-ALL-LHRSB.pdf
If you process with The "v" of "Effective" is dropped (converted to a space).
The font has a custom encoding, however the text is extractable.
The text was updated successfully, but these errors were encountered: