NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

Deepak Kumar,M N Anil Prasad,A G Ramakrishnan

doi:10.1117/12.2008519

Abstract

In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Methods for text segmentation from scene images
Deepak Kumar ... A G Ramakrishnan
ELCVIA Electronic Letters on Computer Vision and Image Analysis | VOL. 13
Deepak Kumar, et. al.Deepak Kumar ... A G Ramakrishnan
07 Jun 2014
ELCVIA Electronic Letters on Computer Vision and Image Analysis | VOL. 13

A feature learning method for scene text recognition
Duong Ho Vu ... Ly Quoc Ngoc
-
Duong Ho Vu, et. al. Duong Ho Vu ... Ly Quoc Ngoc
01 Jan 2012
01 Jan 2012

A Hybrid Approach to Localize Farsi Text in Natural Scene Images
Maryam Darab ... Mohammad Rahmati
Procedia Computer Science | VOL. 13
Maryam Darab, et. al.Maryam Darab ... Mohammad Rahmati
01 Jan 2012
Procedia Computer Science | VOL. 13

Text detection in natural scene images using morphological component analysis and Laplacian dictionary
Shuping Liu ... Zhengtao Yu
IEEE/CAA Journal of Automatica Sinica | VOL. 7
Shuping Liu, et. al.Shuping Liu ... Zhengtao Yu
01 Jan 2020
IEEE/CAA Journal of Automatica Sinica | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

Abstract

Talk to us

Similar Papers