Skip to content
Kay-Michael Würzner edited this page Jan 6, 2020 · 8 revisions

Welcome to the tesstrain wiki!

tesstrain (formerly ocrd-train) is a collection of scripts and documentation for training of Tesseract with LSTM (supported by Tesseract 4 and newer releases).

Currently it includes a Makefile which allows training from real line images with ground truth (text transcriptions).

Training from synthetic images is supported by training scripts (Shell, Python) which are still part of the Tesseract code base.

Examples

``

Clone this wiki locally