Abstract

In allusion to the problem of low accuracy rate recognition in Chinese/English mixed characters, the paper researches on optimization algorithm for segmentation in Chinese/English mixed characters based on OCR system. Rough segment for text images is based on vertical projection method, which follows to characters segmentation theory, extraction of Chinese character, Chinese character component and English number connectivity regional. In Chinese character component connectivity regional, traditional Chinese character component merging algorithms will cause some Chinese characters components are merged incompletly, therefore, an unit merging algorithm based on feedback recognition is presented to merge Chinese character component, in Chinese character, English and number connectivity regional, as adhesion character can lead to segmentation errors, achieving the detection of adhesion character and re-segmentation through the geometric features of character. The test of mixed character segmentation showes that: In the course of recognition on Chinese/English mixed character, the segmentation optimization algorithm have obvious advantage over the traditional algorithms on the accuracy rate of recognition, especially on Chinese characters those are composed of left and right components.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.