Text In Scene Images Research Articles

Natural scene text classification is considered to be a challenging task because of diversified set of image contents, presence of degradations including noise, low contrast/resolution and the random appearance of foreground (font, style, sizes and orientations) and background properties. Above all, the high dimension of the input image’s feature space is another major problem in such tasks. This work is aimed to tackle these problems and remove redundant and irrelevant features to improve the generalization properties of the classifier. In other words, the selection of a qualitative and discriminative set of features, aiming to reduce dimensionality that helps to achieve a successful pattern classification. In this work, we use a biologically inspired genetic algorithm because crossover employed in such algorithm significantly improve the quality of multimodal discriminative set of features and hence improve the classification accuracy for diversified natural scene text images. The Support Vector Machine (SVM) algorithm is used for classification and the average F-Score is used as fitness function and target condition. First after preprocessing input images, the whole feature space (population) is built using a multimodal feature representation technique. Second, a feature level fusion approach is used to combine the features. Third, to improve the average F-score of the classifier, we apply a meta-heuristic optimization technique using a GA for feature selection. The proposed algorithm is tested on five publically available datasets and the results are compared with various state-of-the-art methods. The obtained results proved that the proposed algorithm performs well while classifying textual and non-textual region with better accuracy than benchmark state-of-the-art algorithms.

Read full abstract

Text in scene images usually contains significant information. Text detection and recognition in the scene is important for a variety of advanced machine vision applications, such as image and video retrieval, automotive assistance, and multilingual translation. In particular, most text recognition systems require texts to be localized in images beforehand and this is a significant demand. The purpose of this study is to provide a method to detect texts in natural images. The proposed approach combines advantages of extremal region, ER, methods and classification of convolutional neural network, CNN. This significantly reduces the false positives and increases the accuracy of detection. The method of sliding windows is employed with different sizes in order to determine text candidates. Extraction of enhanced ERs is performed in three consecutive stages on three distinct color channels, R, G, and B. Then, the results are combined together by an add method. After grouping, the word candidates are classified to two classes of text and non-text sections by a CNN classifier. By applying non-maximum suppression (NMS) algorithm to the same words, words with the highest probability are selected. The average values of accuracy, recall, precision and F-measure of the proposed text detection model on the ICDAR2013 database are 0.893, 0.962, 0.948, and 0.955, respectively. The optimal cut point of the proposed method is 0.648, which has the highest average accuracy, 91.93%. The AUC of ROC and PR diagrams for the proposed model are 0.851 and 0.718, respectively. These results of AUC for ROC and PR curves showed an outstanding enhancement in comparison with the best detection rate of previous methods. Experimental results on the ICDAR2011, ICDAR2013 and ICDAR2015 databases also demonstrate that our algorithm outperforms the state-of-the-art scene text detection methods.

Read full abstract

Text In Scene Images Research Articles

Related Topics

Articles published on Text In Scene Images

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild

Text Region Conditional Generative Adversarial Network for Text Concealment in the Wild

A New Method for Arabic Text Detection in Natural Scene Images

Scene Text Detection Based on Expanding the Text Center Region for Bilingual Tibetan-Chinese

Unattached irregular scene text rectification with refined objective

Recognition of Devanagari Scene Text Using Autoencoder CNN

AksharaNet: A GPU Accelerated Modified Depth-Wise Separable Convolution for Kannada Text Classification

A Robot Object Recognition Method Based on Scene Text Reading in Home Environments

Movie Title Extraction and Script Separation Using Shallow Convolution Neural Network

An Optimized Feature Selection Technique in Diversified Natural Scene Text for Classification Using Genetic Algorithm

A novel pipeline framework for multi oriented scene text image detection and recognition

AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection

Residual attention-based multi-scale script identification in scene text images

Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach

Scene text detection using enhanced Extremal region and convolutional neural network

Adversarial learning based attentional scene text recognizer

Devanagari Text Detection From Natural Scene Images

Fully Convolutional Networks for Text Understanding in Scene Images

SynthText3D: synthesizing scene text images from 3D virtual worlds

Text detection in natural scene images using morphological component analysis and Laplacian dictionary

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text In Scene Images Research Articles

Related Topics

Articles published on Text In Scene Images

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild

Text Region Conditional Generative Adversarial Network for Text Concealment in the Wild

A New Method for Arabic Text Detection in Natural Scene Images

Scene Text Detection Based on Expanding the Text Center Region for Bilingual Tibetan-Chinese

Unattached irregular scene text rectification with refined objective

Recognition of Devanagari Scene Text Using Autoencoder CNN

AksharaNet: A GPU Accelerated Modified Depth-Wise Separable Convolution for Kannada Text Classification

A Robot Object Recognition Method Based on Scene Text Reading in Home Environments

Movie Title Extraction and Script Separation Using Shallow Convolution Neural Network

An Optimized Feature Selection Technique in Diversified Natural Scene Text for Classification Using Genetic Algorithm

A novel pipeline framework for multi oriented scene text image detection and recognition

AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection

Residual attention-based multi-scale script identification in scene text images

Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach

Scene text detection using enhanced Extremal region and convolutional neural network

Adversarial learning based attentional scene text recognizer

Devanagari Text Detection From Natural Scene Images

Fully Convolutional Networks for Text Understanding in Scene Images

SynthText3D: synthesizing scene text images from 3D virtual worlds

Text detection in natural scene images using morphological component analysis and Laplacian dictionary