Abstract
Text detection and recognition in video frames is a challenging task due to low contrast and noise from background that hinder the processing. In this paper we have proposed a robust multi-oriented text detection approach in video frames. Proposed method uses sub pixel mapping based super resolution approach to enhance the image. Next, in this enhanced image, Histogram of Oriented Moment feature is extracted from connected components and Support Vector Machine (SVM) classifier is used for multi-oriented text/non-text identification. Finally, Recurrent Neural Network (RNN) based classifier is used for recognition of text characters. We have performed our experiment in ICDAR2013 Robust Reading Competition Karatzas et al. (2013). To evaluate the script independent text extraction performance, we tested our framework in IITR dataset Verma et al. (2016) that contains text of multiple scripts in scene images. We have obtained F-measure of 0.82 which surpasses the current state of art techniques in ICDAR2013 Karatzas et al. (2013) and 0.8 in IITR Dataset Verma et al. (2016) for text detection.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Journal of Visual Communication and Image Representation
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.