-
Notifications
You must be signed in to change notification settings - Fork 199
Home
Kay-Michael Würzner edited this page Jan 6, 2020
·
8 revisions
tesstrain (formerly ocrd-train) is a collection of scripts and documentation for training of Tesseract with LSTM (supported by Tesseract 4 and newer releases).
Currently it includes a Makefile
which allows training from real line images with ground truth (text transcriptions).
Training from synthetic images is supported by training scripts (Shell, Python) which are still part of the Tesseract code base.
- Training Fraktur with Austrian Newspapers
- Training Fraktur with GT4HistOCR
- Training Handwritten Text with German Konzilsprotokolle