Text line segmentation and word recognition in a system for general writer independent handwriting recognition

U.-V Marti,H Bunke

doi:10.1109/icdar.2001.953775

Abstract

We present a system for recognizing unconstrained English handwritten text based on a large vocabulary. We describe the three main components of the system, which are preprocessing, feature extraction and recognition. In the preprocessing phase the handwritten texts are first segmented into lines. Then each line of text is normalized with respect to of skew, slant, vertical position and width. After these steps, text lines are segmented into single words. For this purpose distances between connected components are measured. Using a threshold, the distances are divided into distances within a word and distances between different words. A line of text is segmented at positions where the distances are larger than the chosen threshold. From each image representing a single word, a sequence of features is extracted. These features are input to a recognition procedure which is based on hidden Markov models. To investigate the stability of the segmentation algorithm the threshold that separates intra- and inter-word distances from each other is varied. If the threshold is small many errors are caused by over-segmentation, while for large thresholds under-segmentation errors occur. The best segmentation performance is 95.56% correctly segmented words, tested on 541 text lines containing 3899 words. Given a correct segmentation rate of 95.56%, a recognition rate of 73.45% on the word level is achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text line segmentation and word recognition in a system for general writer independent handwriting recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Text line segmentation from struck-out handwritten document images
Palaiahnakote Shivakumara ... Tong Lu
Expert Systems With Applications | VOL. 210
Palaiahnakote Shivakumara, et. al.Palaiahnakote Shivakumara ... Tong Lu
25 Jul 2022
Expert Systems With Applications | VOL. 210

Text Line Extraction Based on Distance Map Features and Dynamic Programming
Vicente Bosch Campos ... Alejandro Hector Toselli Rossi
-
Vicente Bosch Campos, et. al.Vicente Bosch Campos ... Alejandro Hector Toselli Rossi
01 Aug 2018
01 Aug 2018

A robust method for line and word segmentation in handwritten text
Abdelaali Hassaine
-
Abdelaali HassaineAbdelaali Hassaine
01 Jan 2013
01 Jan 2013

Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks.
Florian Côme Fizaine ... Michel Paindavoine
Journal of imaging | VOL. 10
Florian Côme Fizaine, et. al.Florian Côme Fizaine ... Michel Paindavoine
05 Mar 2024
Journal of imaging | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text line segmentation and word recognition in a system for general writer independent handwriting recognition

Abstract

Talk to us

Similar Papers