Reading Scene Text in Deep Convolutional Sequences

Pan He,Chen Loy,Weilin Huang,Xiaoou Tang,Yu Qiao

doi:10.1609/aaai.v30i1.10465

Abstract

We develop a Deep-Text Recurrent Network (DTRN)that regards scene text reading as a sequence labelling problem. We leverage recent advances of deep convolutional neural networks to generate an ordered highlevel sequence from a whole word image, avoiding the difficult character segmentation problem. Then a deep recurrent model, building on long short-term memory (LSTM), is developed to robustly recognize the generated CNN sequences, departing from most existing approaches recognising each character independently. Our model has a number of appealing properties in comparison to existing scene text recognition methods: (i) It can recognise highly ambiguous words by leveraging meaningful context information, allowing it to work reliably without either pre- or post-processing; (ii) the deep CNN feature is robust to various image distortions; (iii) it retains the explicit order information in word image, which is essential to discriminate word strings; (iv) the model does not depend on pre-defined dictionary, and it can process unknown words and arbitrary strings. It achieves impressive results on several benchmarks, advancing the-state-of-the-art substantially.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reading Scene Text in Deep Convolutional Sequences

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 5, 2016
Citations: 200

Similar Papers

A pooling based scene text proposal technique for scene text reading in the wild
Dinh Nguyenvan ... Mounir Mokhtari
Pattern Recognition | VOL. 87
Dinh Nguyenvan, et. al.Dinh Nguyenvan ... Mounir Mokhtari
10 Oct 2018
Pattern Recognition | VOL. 87

Object Tracking Based on Deep CNN Features Together with Color Features and Sparse Representation
Yuchi Liu ... Yujuan Qi
-
Yuchi Liu, et. al.Yuchi Liu ... Yujuan Qi
01 Dec 2019
01 Dec 2019

Object Tracking Based on Deep CNN Feature and Color Feature
Yujuan Qi ... Yanjiang Wang
-
Yujuan Qi, et. al.Yujuan Qi ... Yanjiang Wang
01 Aug 2018
01 Aug 2018

Accurate Scene Text Recognition Based on Recurrent Neural Network
Bolan Su ... Shijian Lu
-
Bolan Su, et. al.Bolan Su ... Shijian Lu
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reading Scene Text in Deep Convolutional Sequences

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence