Unknown-box Approximation to Improve Optical Character Recognition Performance

This code repository contains the scripts needs to recreate the results mentioned in the manuscript "Unknown-box Approximation to Improve Optical Character Recognition Performance" (Springer publication of ICDAR 2021 | arxiv)

Contents of the repo

Scripts

train_crnn.py - Script to train the CRNN model to avoid cold start problem
train_nn_area.py - Script to train the NN-based preprocessor with VGG dataset
train_nn_patch.py - Script to train the NN-based preprocessor with POS patch dataset
train_sfe_area.py - Script to train the SFE-based preprocessor with VGG dataset
train_sfe_patch.py - Script to train the SFE-based preprocessor with POS patch dataset
eval_prep.py - Evaluate the preprocessor with two datasets and the two OCR engines
properties.py - Contains global properties used by the scripts

Directories

trained_models - Pretrained preprocessor models and CRNN models
datasets - Contains data loader scripts
ocr_helper - Contains codes to connect with OCR engines
models - Contains the two models

Steps to run

We have used Anaconda package manager in the Linux environment (Ubuntu 18.04.5 LTS) and recommends to use the same. Use the following command to create a conda environment named ocr-test with all the dependencies.

conda env create -f environment.yml

Run init_workspace.py to create necessary directories. Download the three dataset zip files from "http://bit.ly/approx-ocr-grad" and put them in a directory named data and unzip them.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unknown-box Approximation to Improve Optical Character Recognition Performance

Contents of the repo

Scripts

Directories

Steps to run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
datasets		datasets
models		models
ocr_helper		ocr_helper
trained_models		trained_models
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
eval_prep.py		eval_prep.py
init_workspace.py		init_workspace.py
properties.py		properties.py
train_crnn.py		train_crnn.py
train_nn_area.py		train_nn_area.py
train_nn_patch.py		train_nn_patch.py
train_sfe_area.py		train_sfe_area.py
train_sfe_patch.py		train_sfe_patch.py
transform_helper.py		transform_helper.py
utils.py		utils.py

paarandika/Gradient-Approx-to-improve-OCR

Folders and files

Latest commit

History

Repository files navigation

Unknown-box Approximation to Improve Optical Character Recognition Performance

Contents of the repo

Scripts

Directories

Steps to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages