Abstract

Text detection in natural scene images is challenging due to the significant variations of the appearance of the text itself and its interaction with the context. The popular stroke width transform (SWT) algorithm is highly efficient but sensitive to the defects of the edges extracted from the input image when searching for the matching edge pixels for computing potential stroke width. In this paper, we propose a novel seed-based variant of SWT that enhances significantly the robustness of the original algorithm to complicated image contextual interference and varied text appearance. We first search for the seed segment of strokes, which is defined as a consecutive sequence of neighbouring rays (pairs of edge pixels) with regular length and satisfying certain constraints, and grow from them to localize more stroke segments. We then exploit the principal width and direction information of stroke captured by the stroke segments detected to rectify inaccurate stroke width and recover missed stroke parts, which are resulted from erroneous and noisy edges in complex natural images. The stroke segments detected are also exploited to improve the accuracy of candidate character localization. The experimental results on public datasets demonstrated the effectiveness of the proposed method.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.