An Overview of the Tesseract OCR Engine

R Smith

doi:10.1109/icdar.2007.4376991

An Overview of the Tesseract OCR Engine

R Smith

Open Access

https://doi.org/10.1109/icdar.2007.4376991

Copy DOI

Publication Date: Sep 1, 2007

Citations: 1378

Affiliation: Google (United States)

#Tesseract OCR Engine #OCR Engine + Show 6 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier.

Full Text