Attention-Based Deep Neural Network and Its Application to Scene Text Recognition

Haizhen He,Jiehan Li

doi:10.1109/iccsn.2019.8905385

Abstract

Recognize text in natural scenes is a challenging task. We proposed an attention-based deep neural network architecture for scene text recognition, which integrates feature extraction, feature attention, feature labeling and transcription into a unified framework. The primary advantages of the proposed model are: (1) it is an end-to-end model, does not require any segmentation of the input image. Convolutional neural network (CNN) is used as encoder to extract features, recurrent neural network (RNN) is used as decoder based on its characteristics of predict sequence, which composed a encoder-decoder architecture; (2) Soft Attention mechanism is introduced in, to further extract features in the input image, and allowing for end-to-end training within a standard back propagation framework; (3) Experiments are performed on several challenging scene text datasets, including IIIT5K, Street View Text, ICDAR2003 and ICDAR2013. Results of the experiments show that the proposed model is comparable or better than other models, which demonstrate the superiority of the proposed algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention-Based Deep Neural Network and Its Application to Scene Text Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Accurate Scene Text Recognition Based on Recurrent Neural Network
Bolan Su ... Shijian Lu
-
Bolan Su, et. al.Bolan Su ... Shijian Lu
01 Jan 2015
01 Jan 2015

Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
Asghar Ali Chandio ... Mark R Pickering
IEEE Access | VOL. 10
Asghar Ali Chandio, et. al.Asghar Ali Chandio ... Mark R Pickering
01 Jan 2021
IEEE Access | VOL. 10

Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction
Alloy Das ... Umapada Pal
Knowledge-Based Systems | VOL. 305
Alloy Das, et. al.Alloy Das ... Umapada Pal
01 Oct 2024
Knowledge-Based Systems | VOL. 305

An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition.
Baoguang Shi ... Xiang Bai
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 39
Baoguang Shi, et. al.Baoguang Shi ... Xiang Bai
29 Dec 2016
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention-Based Deep Neural Network and Its Application to Scene Text Recognition

Abstract

Talk to us

Similar Papers