Abstract

In the paper, we compare the methods based on Markov Random fields (MRF) and Conditional Random fields (CRF) for separating text and non-text ink strokes in online handwritten Japanese documents. This paper validates the effect of context information in neighbor strokes based on graphical models of MRF and CRF. The task of separating text and non-text ink strokes in ink documents denotes classifying ink strokes into two classes (text and non-text). For classification, Support Vector Machine (SVM) classifiers are trained on the set of ink strokes. After converting the SVM's outputs to likelihood probabilities, they are assigned to the likelihood clique potentials of MRF and the feature functions of CRF. The classification based on MRF or CRF is considered as a labeling problem, which can be solved using a labeling algorithm. The experiments on Japanese ink documents in the Kondate database shows that the proposed method based on CRF achieves a classification rate of 98.02% while the method based on MRF produces the classification rate of 97.86%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.