Abstract

The pre-processing activities for handwritten Devanagari text recognition includes an significant step called Segmentation. The segmentation accuracy of Devanagari text characters depends entirely on the accurately segmented lines and words in the handwritten documents. The process of segmenting lines and words correctly leads to many issues. More detailed information is lagging on the segmentation of lines and words from Devanagari text documents, whereas it is available more for other script documents in the literature. Here, we accomplished the task of segmenting the lines and words using Connected Components with Statistics Method on PHDIndic_11 dataset. Experimentation using above mentioned method resulted in line segmentation accuracy of 91.91% and word segmentation accuracy of 72.89% which outperforms over Global threshold and Otsu’s optimum threshold methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.