Text Recognition Tasks Research Articles

Text instance as one category of self-described objects provides valuable information for understanding and describing cluttered scenes. The rich and precise high-level semantics embodied in the text could drastically benefit the understanding of the world around us. While most recent visual phrase grounding approaches focus on general objects, this paper explores extracting designated texts and predicting unambiguous scene text information, i.e., to accurately localize and recognize a specific targeted text instance in a cluttered image from natural language descriptions (referring expressions). To address this issue, first a novel recurrent dense text localization network (DTLN) is proposed to sequentially decode the intermediate convolutional representations of a cluttered scene image into a set of distinct text instance detections. Our approach avoids repeated text detections at multiple scales by recurrently memorizing previous detections, and effectively tackles crowded text instances in close proximity. Second, we propose a context reasoning text retrieval (CRTR) model, which jointly encodes text instances and their context information through a recurrent network, and ranks localized text bounding boxes by a scoring function of context compatibility. Third, a recurrent text recognition module is introduced to extend the applicability of aforementioned DTLN and CRTR models, via text verification or transcription. Quantitative evaluations on standard scene text extraction benchmarks and a newly collected scene text retrieval dataset demonstrate the effectiveness and advantages of our models for the joint scene text localization, retrieval, and recognition task.

Words are the most indispensable information in human life. It is very important to analyze and understand the meaning of words. Compared with the general visual elements, the text conveys rich and high-level moral information, which enables the computer to better understand the semantic content of the text. With the rapid development of computer technology, great achievements have been made in text information detection and recognition. However, when dealing with text characters in natural scene images, there are still some limitations in the detection and recognition of natural scene images. Because natural scene image has more interference and complexity than text, these factors make the detection and recognition of natural scene image text face many challenges. To solve this problem, a new text detection and recognition method based on depth convolution neural network is proposed for natural scene image in this paper. In text detection, this method obtains high-level visual features from the bottom pixels by ResNet network, and extracts the context features from character sequences by BLSTM layer, then introduce to the idea of faster R-CNN vertical anchor point to find the bounding box of the detected text, which effectively improves the effect of text object detection. In addition, in text recognition task, DenseNet model is used to construct character recognition based on Kares. Finally, the output of Softmax is used to classify each character. Our method can replace the artificially defined features with automatic learning and context-based features. It improves the efficiency and accuracy of recognition, and realizes text detection and recognition of natural scene images. And on the PAC2018 competition platform, the experimental results have achieved good results.

Text Recognition Tasks Research Articles

Related Topics

Articles published on Text Recognition Tasks

Scene text spotting based on end-to-end

Random Blur Data Augmentation for Scene Text Recognition

Neural Network for Handwriting Recognition

Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text

Advanced Applications on Bilingual Document Analysis and Processing Systems

Unambiguous Text Localization, Retrieval, and Recognition for Cluttered Scenes.

DetReco: Object-Text Detection and Recognition Based on Deep Neural Network

Accurate, data-efficient, unconstrained text recognition with convolutional neural networks

All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting

Decoupled Attention Network for Text Recognition

A Method of Text Detection and Recognition from Receipt Images Based on CRAFT and CRNN

Toward Arbitrary-Shaped Text Spotting Based on End-to-End

Handwritten Arabic text recognition using multi-stage sub-core-shape HMMs

Mining the displacement of max-pooling for text recognition

Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks

A Novel Dataset for English-Arabic Scene Text Recognition (EASTR)-42K and Its Evaluation Using Invariant Feature Extraction on Detected Extremal Regions

Scene text recognition using residual convolutional recurrent neural network

TextBoxes++: A Single-Shot Oriented Scene Text Detector.

Context-Dependent Robust Text Recognition using Large-scale Restricted Bayesian Network

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Recognition Tasks Research Articles

Related Topics

Articles published on Text Recognition Tasks

Scene text spotting based on end-to-end

Random Blur Data Augmentation for Scene Text Recognition

Neural Network for Handwriting Recognition

Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text

Advanced Applications on Bilingual Document Analysis and Processing Systems

Unambiguous Text Localization, Retrieval, and Recognition for Cluttered Scenes.

DetReco: Object-Text Detection and Recognition Based on Deep Neural Network

Accurate, data-efficient, unconstrained text recognition with convolutional neural networks

All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting

Decoupled Attention Network for Text Recognition

A Method of Text Detection and Recognition from Receipt Images Based on CRAFT and CRNN

Toward Arbitrary-Shaped Text Spotting Based on End-to-End

Handwritten Arabic text recognition using multi-stage sub-core-shape HMMs

Mining the displacement of max-pooling for text recognition

Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks

A Novel Dataset for English-Arabic Scene Text Recognition (EASTR)-42K and Its Evaluation Using Invariant Feature Extraction on Detected Extremal Regions

Scene text recognition using residual convolutional recurrent neural network

TextBoxes++: A Single-Shot Oriented Scene Text Detector.

Context-Dependent Robust Text Recognition using Large-scale Restricted Bayesian Network

TextBoxes: A Fast Text Detector with a Single Deep Neural Network