Abstract

Content-based image retrieval is an active research area because of vast applications of image and video collections. Text embedded in video can provide significant contribution in identifying the contents of the multimedia data and facilitate the process of video indexing, retrieval and analysis. Text tracking is a vital part of text extraction process. It can speed up the text extraction process and also improves the text localization accuracy for videos. This paper proposes a novel approach for text tracking in videos. This method experimentally proposes Maximally Stable Extremal Regions (MSER) and Speeded Up Robust Features (SURF) descriptors for text tracking in videos. MSER is used for interest point detection as Extremal regions are affine invariant, so it is a suitable option for text tracking which can undergo scale, rotation and translation changes. SURF is chosen as a feature descriptor because of its scale and rotation invariance, speed and robustness. Proposed technique is testified on two datasets; the first is designed to test the tracking methodology as part of this research and second is publicly available Youtube Video Text(YVT) dataset. Succeeding analysis on diverse datasets endures verification to the fact that the projected technique demonstrates visible improvements to text tracking in videos

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.