Abstract

Localization of texts in natural images could be an important stage in many applications such as content-based image retrieval, visual impairment assistance systems, automatic robot navigation in urban environments and tourist assistance systems. However due to the variations of font, script, scale, orientations, color, shadow and lighting conditions, robust scene text localization is still a challenging task. In this paper, we propose a novel method to localize not only Farsi/Arabic and Latin texts with different sizes, fonts and orientations but also low luminance contrast and poor quality ones in the natural images taken with uneven illumination conditions. Firstly, fast weighted median filtering as a nonlinear edge-preserving smoothing filter and then color contrast preserving decolorization are exploited to make the text localization system more robust for low luminance contrast and poor quality texts. In order to extract the Farsi/Arabic and Latin scene texts and also filter the nontext ones, a unified framework is proposed incorporating the maximally stable extremal regions and a novel proposed region detector called Stable Width Stroke Regions which is based on closed boundary regions. Phase congruency and Laplacian operators are exploited to extract the closed boundary regions. Finally, to extract the single text lines, the Meanshift clustering and radon transform were used. Experimental results show that the proposed method localize low luminance contrast and low quality scene texts for both Farsi/Arabic and Latin scripts encouragingly.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call