Localized Text Regions Research Articles

To localize text regions and separate close instances, the shrunk polygon is widely used in recent scene text detection methods. However, there exist two problems: 1) Existing methods fail to consider the aspect ratio sensitive problem when reconstructing the text instance from shrunk polygon. 2) Texts with extreme aspect ratios will lead to the fracture of shrunk polygons. To handle these two problems, in this paper, we propose a novel Adaptive Dilation Network (ADNet) to focus on the reconstruction process from shrunk polygon, which aims to provide a tight and complete text representation. Firstly, instead of using a fixed dilation factor, ADNet uses an aspect ratio-wise dilation factor to reconstruct the text region from shrunk polygon for each text instance. Such an instance-wise dilation factor considers the scale correlation between the original and shrunk polygon, and thus can guide an adaptive text region reconstruction for texts with large aspect ratio variance. Secondly, to deal with the fracture of detection results, a new Efficient Spatial Relationship Module (ESRM) is devised to capture long-range dependencies with low computation cost. ESRM uses a novel Weighted Pooling to reduce the resolution of feature maps without much information loss. Compared with the existing methods, ADNet further explores the potential of shrunk polygon-based approaches and obtains excellent detection results at an impressive speed. Extensive experiments on several datasets (Total-Text, CTW1500, MSRA-TD500 and ICDAR2015) verify the superiority of our method. Code will be available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/qqqyd/ADNet</uri> .

Read full abstract

Text detection and localization have great importance for content based image analysis and text based image indexing. The efficiency of text recognition depends on the efficiency of text localization. So, the main goal of the proposed method is to detect and localize text regions with high accuracy. To achieve this goal, a new and efficient method has been introduced for localization of Bangla text from scene images. In order to improve precision and recall as well as f-measure, Maximally Stable Extremal Region (MSER) based method along with double filtering techniques have been used. As MSER algorithm generates many false positives, we have introduced double filtering method for removing these false positives to increase the f-measure to a great extent. Our proposed method works at three basic levels. Firstly, MSER regions are generated from the input color image by converting it into gray scale image. Secondly, some heuristic features are used to filter out most of the false positives or non-text regions. Lastly, Stroke Width Transform (SWT) based filtering method is used to filter out remaining non-text regions. Remaining components are then grouped into candidate text regions marked by bounding box over each region. As there is no benchmark database for Bangla text, the proposed method is implemented on our own prepared database consisting of 200 scene images of Bangla texts and has got prominent performance. To evaluate the performance of our proposed approach, we have also tested the proposed method on International Conference on Document Analysis and Recognition( ICDAR) 2013 benchmark database and have got a better result than the related existing methods.

Read full abstract

Localized Text Regions Research Articles

Related Topics

Articles published on Localized Text Regions

ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection

Dominating set based arbitrary oriented bilingual scene text localization

The Smart Assistant for Library Management and Book Reader for Blind People Using Raspberry Pi

FREE: A Fast and Robust End-to-End Video Text Spotter.

An Enhanced MSER Pruning Algorithm for Detection and Localization of Bangla Texts from Scene Images

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND-HELD OBJECTS FOR BLIND PERSONS

Text Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier

Fast and accurate scene text understanding with image binarization and off-the-shelf OCR

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

Text Detection and Localization in Low Quality Video Images through Image Resolution Enhancement Technique

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Localized Text Regions Research Articles

Related Topics

Articles published on Localized Text Regions

ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection

Dominating set based arbitrary oriented bilingual scene text localization

The Smart Assistant for Library Management and Book Reader for Blind People Using Raspberry Pi

FREE: A Fast and Robust End-to-End Video Text Spotter.

An Enhanced MSER Pruning Algorithm for Detection and Localization of Bangla Texts from Scene Images

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND-HELD OBJECTS FOR BLIND PERSONS

Text Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier

Fast and accurate scene text understanding with image binarization and off-the-shelf OCR

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

Text Detection and Localization in Low Quality Video Images through Image Resolution Enhancement Technique