This repository has been archived by the owner on Feb 19, 2021. It is now read-only.
1.2.0
- New Docker image, now based on Alpine, thanks to the efforts of @addadi and @pit. This new image is dramatically smaller than the Debian-based one, and it also has a new home on Docker Hub. A proper thank-you to @Pit_ for hosting the image on his Docker account all this time, but after some discussion, we decided the image needed a more official-looking home.
- @BastianPoe has added the long-awaited feature to automatically skip the OCR step when the PDF already contains text. This can be overridden by setting
PAPERLESS_OCR_ALWAYS=YES
either in yourpaperless.conf
or in the environment. Note that this also means that Paperless now requireslibpoppler-cpp-dev
to be installed. Important: You'll need to runpip install -r requirements.txt
after the usualgit pull
to properly update. - @BastianPoe has also contributed a monumental amount of work (#291) to solving #158: setting the document creation date based on finding a date in the document text.