Abstract

Natural scene text classification is considered to be a challenging task because of diversified set of image contents, presence of degradations including noise, low contrast/resolution and the random appearance of foreground (font, style, sizes and orientations) and background properties. Above all, the high dimension of the input image’s feature space is another major problem in such tasks. This work is aimed to tackle these problems and remove redundant and irrelevant features to improve the generalization properties of the classifier. In other words, the selection of a qualitative and discriminative set of features, aiming to reduce dimensionality that helps to achieve a successful pattern classification. In this work, we use a biologically inspired genetic algorithm because crossover employed in such algorithm significantly improve the quality of multimodal discriminative set of features and hence improve the classification accuracy for diversified natural scene text images. The Support Vector Machine (SVM) algorithm is used for classification and the average F-Score is used as fitness function and target condition. First after preprocessing input images, the whole feature space (population) is built using a multimodal feature representation technique. Second, a feature level fusion approach is used to combine the features. Third, to improve the average F-score of the classifier, we apply a meta-heuristic optimization technique using a GA for feature selection. The proposed algorithm is tested on five publically available datasets and the results are compared with various state-of-the-art methods. The obtained results proved that the proposed algorithm performs well while classifying textual and non-textual region with better accuracy than benchmark state-of-the-art algorithms.

Highlights

  • Features are discriminative elements that help to differentiate different types of objects in an image

  • There are multiple approaches to gather the best subset of features, including, principal component analysis (PCA) [5], [6], ant colony optimization [7], particle swarm optimization (PSO) [8]–[10], firefly [11] and genetic algorithm (GA) [4], [12]

  • PREPROCESSING AND MULTIMODAL FEATURE PREPARATION FOR CLASSIFICATION OF NATURAL SCENE TEXT USING GA FRAMEWORK The proposed diagram is shown in Fig. 2, which is employed to build multimodal feature vector after necessary preprocessing depending upon the image condition

Read more

Summary

Introduction

Features are discriminative elements that help to differentiate different types of objects in an image. It has been observed that pattern recognition classifiers have difficulties achieving a good performance when the feature space has high dimension [1]. To design a better classifier and achieve a good accuracy, a possible strategy consists of reducing the complexity of the model by reducing the number of features, discarding non-informative and redundant features [2], [3] obtained from diverse set of images. It is worth pointing out that GAs are powerful stochastic biologically-inspired techniques that can be used in several image processing applications including image enhancement, image segmentation, image classification, and (naturally) feature selection

Objectives
Methods
Findings
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.