Finding words in alphabet soup: Inference on freeform character recognition for historical scripts

Nicholas R Howe,Shaolei Feng,R Manmatha

doi:10.1016/j.patcog.2009.01.012

Finding words in alphabet soup: Inference on freeform character recognition for historical scripts

Nicholas R Howe, Shaolei Feng + Show 1 more

Open Access

https://doi.org/10.1016/j.patcog.2009.01.012

Copy DOI

Journal: Pattern Recognition	Publication Date: Jan 20, 2009
Citations: 52	License type: cc-by

Affiliation: Smith College, University of Massachusetts Amherst

#Ensemble Of Hidden Markov Models #Histograms Of Gradients + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper develops word recognition methods for historical handwritten cursive and printed documents. It employs a powerful segmentation-free letter detection method based upon joint boosting with histograms of gradients as features. Efficient inference on an ensemble of hidden Markov models can select the most probable sequence of candidate character detections to recognize complete words in ambiguous handwritten text, drawing on character n -gram and physical separation models. Experiments with two corpora of handwritten historic documents show that this approach recognizes known words more accurately than previous efforts, and can also recognize out-of-vocabulary words.

Full Text