Text Detection In Scene Images Research Articles

Text detection in scene image has become a hot topic in computer vision and artificial intelligence research, due to its wide range of applications and challenges. Most state-of-the-art methods for text detection based on deep learning rely on text bounding box regression. These methods can not well handle the case that if the scene text is curved. In this paper, we propose a new framework for arbitrarily oriented text detection in natural images based on fully convolutional neural networks. The main idea is to represent a text instance by two forms: text center block and word stroke region. These two elements are detected by two fully convolutional networks, respectively. Final detections are produced by the word region surrounding box algorithm. The proposed method does not need to regress the extant bounding box of the text instance, mainly because the predicted text block region itself implicitly contains position and orientation information. Besides, our method can well handle text in different languages, arbitrary orientations, curved shape and various fonts. To validate the effectiveness of the proposed method, we perform experiments on three public datasets: MSRA-TD500, USTB-SV1K and ICDAR2013, and compare it with other state-of-the-art methods. Experiment results demonstrate that the proposed method achieves competitive results. Based on VGG-16, our method achieves an F-measure of 78.84% on MSRA-TD500, 59.34% on USTB-SV1K, and 88.21% on ICDAR2013.

Developing a text detection method which is invariant to scripts in natural scene images is a challenging task due to different geometrical structures of various scripts. Besides, multi-oriented of text lines in natural scene images make the problem more challenging. This paper proposes to explore ring radius transform (RRT) for text detection in multi-oriented and multi-script environments. The method finds component regions based on convex hull to generate radius matrices using RRT. It is a fact that RRT provides low radius values for the pixels that are near to edges, constant radius values for the pixels that represent stroke width, and high radius values that represent holes created in background and convex hull because of the regular structures of text components. We apply k-means clustering on the radius matrices to group such spatially coherent regions into individual clusters. Then the proposed method studies the radius values of such cluster components that are close to the centroid and far from the centroid to detect text components. Furthermore, we have developed a Bangla dataset (named as ISI-UM dataset) and propose a semi-automatic system for generating its ground truth for text detection of arbitrary orientations, which can be used by the researchers for text detection and recognition in the future. The ground truth will be released to public. Experimental results on our ISI-UM data and other standard datasets, namely, ICDAR 2013 scene, SVT and MSRA data, show that the proposed method outperforms the existing methods in terms of multi-lingual and multi-oriented text detection ability.

Text Detection In Scene Images Research Articles

Related Topics

Articles published on Text Detection In Scene Images

A Method for Bilingual Tibetan-Chinese Scene Image Dataset Synthesis and Text Detection

Scene text detection with fully convolutional neural networks

Script independent approach for multi-oriented text detection in scene image

Text detection in scene images based on exhaustive segmentation

Text Detection in Scene Images Based on Interest Points

MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Detection In Scene Images Research Articles

Related Topics

Articles published on Text Detection In Scene Images

A Method for Bilingual Tibetan-Chinese Scene Image Dataset Synthesis and Text Detection

Scene text detection with fully convolutional neural networks

Script independent approach for multi-oriented text detection in scene image

Text detection in scene images based on exhaustive segmentation

Text Detection in Scene Images Based on Interest Points

MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES