Scene word recognition from pieces to whole

Anna Zhu,Seiichi Uchida

doi:10.1007/s11704-017-6420-2

Abstract

Convolutional neural networks (CNNs) have had great success with regard to the object classification problem. For character classification, we found that training and testing using accurately segmented character regions with CNNs resulted in higher accuracy than when roughly segmented regions were used. Therefore, we expect to extract complete character regions from scene images. Text in natural scene images has an obvious contrast with its attachments. Many methods attempt to extract characters through different segmentation techniques. However, for blurred, occluded, and complex background cases, those methods may result in adjoined or over segmented characters. In this paper, we propose a scene word recognition model that integrates words from small pieces to entire after-cluster-based segmentation. The segmented connected components are classified as four types: background, individual character proposals, adjoined characters, and stroke proposals. Individual character proposals are directly inputted to a CNN that is trained using accurately segmented character images. The sliding window strategy is applied to adjoined character regions. Stroke proposals are considered as fragments of entire characters whose locations are estimated by a stroke spatial distribution system. Then, the estimated characters from adjoined characters and stroke proposals are classified by a CNN that is trained on roughly segmented character images. Finally, a lexicon-driven integration method is performed to obtain the final word recognition results. Compared to other word recognition methods, our method achieves a comparable performance on Street View Text and the ICDAR 2003 and ICDAR 2013 benchmark databases. Moreover, our method can deal with recognizing text images of occlusion and improperly segmented text images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scene word recognition from pieces to whole

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science

Lead the way for us

Journal: Frontiers of Computer Science	Publication Date: Apr 1, 2019
Citations: 2

Similar Papers

Scene text recognition with high performance CNN classifier and efficient word inference
Xinhao Liu ... Takahito Kawanishi
-
Xinhao Liu, et. al.Xinhao Liu ... Takahito Kawanishi
01 Mar 2016
01 Mar 2016

Efficient Approach to Detect and Localize Text in Natural Scene Images
S R Surem Samuel ... C Seldev Christopher
-
S R Surem Samuel, et. al.S R Surem Samuel ... C Seldev Christopher
26 Nov 2014
26 Nov 2014

Detecting of Vertically-Oriented Texts in Images Containing Natural Scenes
Yi Ling Ong ... Almon Chai
-
Yi Ling Ong, et. al.Yi Ling Ong ... Almon Chai
07 Dec 2020
07 Dec 2020

Reading Text in the Wild with Convolutional Neural Networks
Max Jaderberg ... Andrea Vedaldi
International Journal of Computer Vision | VOL. 116
Max Jaderberg, et. al.Max Jaderberg ... Andrea Vedaldi
07 May 2015
International Journal of Computer Vision | VOL. 116

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scene word recognition from pieces to whole

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science