Abstract

A novel segment confidence-based binary segmentation (SCBS) for cursive handwritten words is presented in this paper. SCBS is a character segmentation strategy for off-line cursive handwriting recognition. Unlike the approaches in the literature, SCBS is an unordered segmentation approach. SCBS is repetition of binary segmentation and fusion of segment confidence. Each repetition generates only one final segmentation point. The binary segmentation module is a contour tracing algorithm to find a segmentation path to divide a segment into two segments. A set of segments before binary segmentation is called pre-segments, and a set of segments after binary segmentation is called post-segments. SCBS uses over-segmentation technique to generate suspicious segmentation points on pre-segments. On each suspicious segmentation point, binary segmentation is performed and the highest fusion value is recorded. If the highest fusion value is greater than the one of pre-segments, the suspicious segmentation point becomes the final segmentation point for the iteration. If not, no more segmentation is required. Segment confidence is obtained by fusing mean character, lexical and shape confidences. The proposed approach has been evaluated on local and benchmark (CEDAR) databases.

Highlights

  • Off-line Cursive Handwriting Recognition (OffCHR) is an automatic process to convert an input handwritten document image into computer-recognizable character representations

  • 4.2 Database preparation Two sets of experiments were conducted on a local database and CEDAR benchmark database to check the effectiveness of the proposed approach

  • The local database was created by our group, which has been obtained from multiple writers

Read more

Summary

Introduction

Off-line Cursive Handwriting Recognition (OffCHR) is an automatic process to convert an input handwritten document image into computer-recognizable character representations. Despite sleepless research in OffCHR for decades, the performance of the state-of-the-art OffCHR is below the industrial standard to accommodate the real world problems [2,3,4,5,6]. The researchers in this field agree that the main contributor of the low OffCHR performance is the segmentation [7,8,9,10,11,12,13,14,15]. OffCHR involves a set of processes such as pre-processing, normalization, segmentation, recognition. The segmentation is a very difficult process because of the nature of cursive handwriting and it has become a major error contributor in OffCHR [22,23,24,25,26,27]

Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call