imagegrep-bash

unix 'grep' a word inside pdf or image based on OCR

Usage

./imagegrep foo.pdf invoice eng && echo "grab your wallet!"

no repo is complete without a catgif!

Install

wget https://raw.githubusercontent.com/coderofsalvation/imagegrep-bash/master/imagegrep 
chmod 755 imagegrep
./imagegrep foo.pdf invoice eng

Requirements

tesseract-ocr
imagemagick

these packages can be installed using apt-get or yum

Why

To automate, categorize files and their destination folder. OCR usually fails in many cases, but sometimes knowing one word (and its length) is enough. Imagegrep can be used this to scrape gmail and copy invoice-attachments to a preferred folder on my harddrive.

# not covered here: gmail to local maildir using 'offlineimap'
# not covered here: use mu ('maildir-utils' package) to extract pdf attachments

find mailbox/latest/*.pdf | while read file; do 
  ./imagegrep "$file" invoice eng    &&\
     echo "grab your wallet!"        &&\
     mv foo.pdf ~/admin/invoices/.
done

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
LICENSE		LICENSE
README.md		README.md
imagegrep		imagegrep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

imagegrep-bash

Usage

Install

Requirements

Why

About

Releases

Sponsor this project

Packages

Languages

License

coderofsalvation/imagegrep-bash

Folders and files

Latest commit

History

Repository files navigation

imagegrep-bash

Usage

Install

Requirements

Why

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages