Abstract

The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based applications, multimedia and content-based retrieval. Due to low accuracies of existing scene text detection methods, an improved pipeline is developed for text localizing task. First, candidate text regions are generated using Maximally Stable Extremal Region and Stroke Width Transform methods that capture true positives along with many false positives. A One Class Classifier is trained to label the candidate regions obtained, as text or non-text, which in this case is suitable as non-text class cannot be adequately represented to train a binary classifier. The one class classifier is trained with some popular feature descriptors like Histogram of Oriented Gradients, Grey Level Co-Occurrence Matrix, Discrete Cosine Transform and Gabor filter. Experimental results show high recall for text containing regions and reducing false positives.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.