Abstract

It is key technology for handwritten historical document image analysis to segment text lines. There is no doubt that it is very difficult for text line segmentation owning to varied forms of handwritten Manchu historical document images. A novel method for handwritten Manchu historical document segmentation is presented. Initially, this method extracts the blob lines of the text lines by anisotropic Gaussian kernel and obtains connected components by contours. Then, it breaks the adhesion region between lines which is obtained by blob lines merging. Finally the detected text line and connection components combine to form the initial text line region, and text lines segmentation is achieved by boundary area growing. Results of segmentation for heterogeneous documents are provided, and the proposed method is compared with projection profit and seam craving methods in the same settings. The method is good at handling the skew and adhesion Manchu text lines. Tested on dataset of handwritten Manchu historical documents, the recall of 97.57% correct lines is achieved by the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.