Abstract
Automatic classification of packaging cartons according to their contents is an industrial need. In this paper we present an Optical Character Recognition (OCR) system to segment and recognize the sparse dot matrix text printed on the cartons in order to classify them based on the contents. Proposed solution is robust to non-uniformities in background illumination, shadow artifacts, inclined text, degraded text due to missing dots etc. We propose efficient segmentation technique using simple morphological operations which makes use of the discrete nature of the dot matrix text in distinguishing it from other information. The dot matrix characters can be uniquely characterized by analyzing the pattern of dots. We retrieve this pattern, and feed it as feature vector to the trained Support Vector Machine (SVM) classifier. The combination of the unique patterns and SVM classifier results into high character recognition accuracy, in turn leading to efficient carton classification. Finally, we discuss the result statistics of character recognition and carton classification.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.