Improved memory usage #1032

Masterwow3 · 2021-10-14T15:58:09Z

We have been using pdf-lib for the last 4 months to generate our PDFs on the WEB and on the server. I noticed that for a PDF with 180 pages and high resolution PNGs, the RAM usage increases a lot (~6GB).

My code change ensures that after a page.drawImage you can directly call an image.embed().

This decodes and encodes the image immediately and remove the decoded image from RAM immediately.
With this change I have the same performance but a maximum RAM usage of 220MB.

Usage without these changes:

Max RAM usage:

Usage with these changes:

Max RAM usage:

Hopding

Hello @Masterwow3! Thanks for investigating this issue and opening a PR! I suspect this is a problem lots of people have run into, so it'll be great to release a fix 😃.

I like the general approach to took to eliminate the memory leak. However, I would like to change a few of the implementation details before merging:

Please use undefined instead of null. The null value isn't used in the pdf-lib codebase.
Instead of clearing the PngEmbedder.image property, we should clear the PDFImage.embedder property after the image has been embedded.
Please add unit tests to ensure that (1) the property is cleared after the first embed, and (2) a single PDFDocument object with an embedded image can still be updated and saved multiple times (unless there's already a test for this).

Let me know if you have any questions!

ritterzk · 2021-11-02T11:55:30Z

Hello @Hopding,

i am pleased that the amendment is accepted. I have adapted the commit as you requested.

The unneeded field "alreadyEmbedded" has been removed.
(https://github.com/Hopding/pdf-lib/pull/1032/files#diff-2e6b05e8490d2c6cc7ca78f78e8fa58bcaef509fbc97754da0808bca0f9d47ffL38)

Additionally, I made the embed() method parellel safe.
(https://github.com/Hopding/pdf-lib/pull/1032/files#diff-2e6b05e8490d2c6cc7ca78f78e8fa58bcaef509fbc97754da0808bca0f9d47ffR131)

Hopding

Looks great. Thanks @Masterwow3!

github-actions bot added the needs-triage label Oct 14, 2021

Hopding requested changes Oct 31, 2021

View reviewed changes

Hopding removed the needs-triage label Oct 31, 2021

github-actions bot added the needs-triage label Nov 2, 2021

Improved memory usage

52d86d7

Hopding approved these changes Nov 6, 2021

View reviewed changes

Hopding merged commit a1abda4 into Hopding:master Nov 6, 2021

Hopding added a commit that referenced this pull request Nov 6, 2021

#1032 cleanup

fbdfbd9

This was referenced Nov 6, 2021

chore(deps): update dependency pdf-lib to v1.17.1 sachinraja/shiki-renderer-pdf#18

Merged

deps(deps): update dependencies-non-major technologiestiftung/conversational-interface#84

Open

renovate bot mentioned this pull request Nov 16, 2021

fix(dependencies): update dependency pdf-lib to ^1.17.1 stencila/encoda#1000

Merged

1 task

renovate bot mentioned this pull request Dec 10, 2021

fix(deps): update dependency pdf-lib to v1.17.1 konnectors/attestation#18

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved memory usage #1032

Improved memory usage #1032

Masterwow3 commented Oct 14, 2021

Hopding left a comment

ritterzk commented Nov 2, 2021

Hopding left a comment

Improved memory usage #1032

Improved memory usage #1032

Conversation

Masterwow3 commented Oct 14, 2021

Hopding left a comment

Choose a reason for hiding this comment

ritterzk commented Nov 2, 2021

Hopding left a comment

Choose a reason for hiding this comment