Abstract

Detection and recognition of text superimposed in complex background has been considered as a challenging problem. Most of the existing methods first locate the text regions and then feed them into OCR package for recognition. However, these methods cannot achieve good recognition performance due to the complex background. For this purpose, this paper proposes a novel text detection and recognition method by using color clustering to divide images into multiple layers according to main color class. In the proposed method, we exploited a connected component analysis to obtain the candidate text regions from each color layer, and then a cascade Adaboost classifier is adopted to determine whether the candidate text regions is real text regions in the corresponding image layer. Because the monochrome color exists in each layer, the interference of the background can be effectively reduced, which can significantly improve the accuracy of text regions localization. Afterwards, an OCR package is used to recognize the text regions which have been located by the cascade Adaboost classifier. Since the text region has a monochrome color, it helps to greatly improve the recognition rate. Finally, the relationship between different layers is used to verify the recognition results by the text location. The experimental results show that the proposed approach significantly outperforms the existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.