This repository contains code and data created for use in the Office of the Collector of Revenue in Benton County Missouri.
This is a Python script created to generate an inventory of microfilm from photos of multiple containers. It utilitizes OpenCV to detect and isolate the label from each container, then performs OCR with Tesseract to generate a CSV listing of record series on each roll.
Run it with one of the sample images:
python3 scripts/microfilm2csv.py testdata/microfilm/1.jpg
You can also pass the -d
flag to get images of each preprocessing step and
the raw OCR results from Tesseract.
A Python script that uses MuPDF to read our PDF tax books and convert them to CSV format.
Run it with one of the sample PDF files:
python3 scripts/book2csv.py testdata/books/real_current.pdf
This is a Python script that preprocesses tax statements generated by our tax collection system using GhostScript and MuPDF. It currently:
- Converts the PostScript output to PDF
- Crops pages from A4 to US Letter
- Reverses the print order of the pages so that they are in the correct order after going through the automatic folding machine
- Splits pages into batches
- Adds sequencing information to the corner of each page
- Detects statements with prior years due and adds an above-the-fold indicator that tells employees to insert a letter with the statement
Run it with one of the sample files:
python3 scripts/statements2pdf.py testdata/statements/current.prn
This is a sideloaded Roku channel that displays notices and events on the TV in the hallway outside of our office.
This is a C#/WinForms application for printing address labels. It supports address validation via the USPS WebTools API. It can also read addresses from our tax collection system. The version of Qt used by the system does not support the Windows UI Automation API, so we use the WinRT OCR API to extract an address from a screenshot of the active window.
This program is designed specifically for the Seiko Smart Label Printer 620 and SLP-2RLH thermal address labels.
This is a C++/Win32 application that exposes any barcode scanner supporting the
WinRT BarcodeScanner
API to multiple applications. By default, scanned
barcodes are injected as keyboard events to the active window. Applications
can also opt-in to receive WM_COPYDATA
messages with the barcode type and the
raw barcode contents. This greatly simplifies the implementation of scanner
clients, especially in cases where one can't easily use the WinRT APIs, such as in a Java/Swing application.
This software is the work of a government agency in the United States. To the extent possible under law, the author(s) have dedicated all copyright and related and neighboring rights to this software to the public domain worldwide. This software is distributed without any warranty.
You should have received a copy of the CC0 Public Domain Dedication along with this software. If not, see http://creativecommons.org/publicdomain/zero/1.0/.
A subset of local property tax data, including names and addresses of taxpayers, is provided for testing purposes. This data is public information and available from a variety of other sources.