Improving text recognition by distinguishing scene and overlay text

Haojin Yang,Bernhard Quehl,Harald Sack

doi:10.1117/12.2181370

Haojin Yang, Bernhard Quehl + Show 1 more

https://doi.org/10.1117/12.2181370

Copy DOI

Export

Save

Cite

Publication Date: Feb 12, 2015

Citations: 2

Affiliation: Hasso Plattner Institute

Abstract
Full-Text
Similar Papers

Abstract

Listen

Video texts are closely related to the content of a video. They provide a valuable source for indexing and interpretation of video data. Text detection and recognition task in images or videos typically distinguished between overlay and scene text. Overlay text is artificially superimposed on the image at the time of editing and scene text is text captured by the recording system. Typically, OCR systems are specialized on one kind of text type. However, in video images both types of text can be found. In this paper, we propose a method to automatically distinguish between overlay and scene text to dynamically control and optimize post processing steps following text detection. Based on a feature combination a Support Vector Machine (SVM) is trained to classify scene and overlay text. We show how this distinction in overlay and scene text improves the word recognition rate. Accuracy of the proposed methods has been evaluated by using publicly available test data sets.

Full Text