Abstract

We employ an edge co-occurrence matrix (ECM) to distinguish handwritten and machine printed text without resorting to line or word information. The ECM is a modified co-occurrence matrix (CM) on edge images. First, the whole image is divided into overlapping range blocks with fixed size. Then, the ECMs are abstracted from these blocks. The ECM only counts the co-occurring edges connected with each other and its up direction part is the part with most distribution. The liner Support Vector Machine (SVM) is used to classify the features. Because of the similarities of neighboring blocks, the Discriminative Random Fields (DRF) is used to further improve the classification accuracy. The experiments on document images taken from HIT and IMA databases show the effectiveness of our proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.