Temporal Integration for Word-Wise Caption and Scene Text Identification

Sangheeta Roy,Umapada Pal,Palaiahnakote Shivakumara,Ainuddin Wahid Bin Abdul Wahab,Tong Lu

doi:10.1109/icdar.2017.65

Abstract

Generally video consists of edited text (i.e., caption text) and natural text (i.e., scene text), and these two texts differ from one another in nature as well as characteristics. Such different behaviors of caption and scene texts lead to poor accuracy for text recognition in video. In this paper, we explore wavelet decomposition and temporal coherency for the classification of caption and scene text. We propose wavelet of high frequency sub-bands to separate text candidates that are represented by high frequency coefficients in an input word. The proposed method studies the distribution of text candidates over word images based on the fact that the standard deviation of text candidates is high at the first zone, low at the middle zone and high at the third zone. This is extracted by mapping standard deviation values to 8 equal sized bins formed based on the range of standard deviation values. The correlation among bins at the first and second levels of wavelets is explored to differentiate caption and scene text and for determining the number of temporal frames to be analyzed. The properties of caption and scene texts are validated with the chosen temporal frames to find the stable property for classification. Experimental results on three standard datasets (ICDAR 2015, YVT and License Plate Video) show that the proposed method outperforms the existing methods in terms of classification rate and improves recognition rate significantly based on classification results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal Integration for Word-Wise Caption and Scene Text Identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

New Tampered Features for Scene and Caption Text Classification in Video Frame
Sangheeta Roy ... Tong Lu
-
Sangheeta Roy, et. al.Sangheeta Roy ... Tong Lu
01 Oct 2016
01 Oct 2016

Caption text recognition in video frames by MAP matching
A Nakamura ... K Yamamoto
-
A Nakamura, et. al.A Nakamura ... K Yamamoto
03 Aug 2003
03 Aug 2003

A new method for multi-oriented graphics-scene-3D text classification in video
Jiamin Xu ... Seiichi Uchida
Pattern Recognition | VOL. 49
Jiamin Xu, et. al.Jiamin Xu ... Seiichi Uchida
10 Jul 2015
Pattern Recognition | VOL. 49

Video Text extraction and recognition: A survey
Pooja ... Renu Dhir
-
Pooja, et. al. Pooja ... Renu Dhir
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal Integration for Word-Wise Caption and Scene Text Identification

Abstract

Talk to us

Similar Papers