Abstract

Recognizing texts in video is more complex than in other environments such as scanned documents. Video texts appear in various colors, unknown fonts and sizes, often affected by compression artifacts and low quality. In contrast to Latin texts, there are no publicly available datasets which cover all aspects of the Arabic Video OCR domain. This paper describes a new well-defined and annotated Arabic-Text-in-Video dataset called AcTiV 2.0. The dataset is dedicated especially to building and evaluating Arabic video text detection and recognition systems. AcTiV 2.0 contains 189 video clips serving as a raw material for creating 4063 key frames for the detection task and 10,415 cropped text images for the recognition task. AcTiV 2.0 is also distributed with its annotation and evaluation tools that are made open-source for standardization and validation purposes. This paper also reports on the evaluation of several systems tested under the proposed detection and recognition protocols.

Highlights

  • Broadcast news and public-affairs programs are a prominent source of information that provides daily updates on national and world news

  • The present study focuses on the Arabic video OCR problem

  • We present AcTiV 2.0 as an open Arabic-Text-in-Video dataset dedicated to benchmarking and comparison of systems for Arabic text detection, tracking and recognition

Read more

Summary

Introduction

Broadcast news and public-affairs programs are a prominent source of information that provides daily updates on national and world news. Text detection and recognition in video frames is more challenging. Few attempts have been made on the development of detection and recognition systems for overlaid text in Arabic news video [11,12,13]. These systems were tested on private datasets with different evaluation protocols and metrics that make direct comparison and objective benchmarking rather impractical. We present AcTiV 2.0 as an open Arabic-Text-in-Video dataset dedicated to benchmarking and comparison of systems for Arabic text detection, tracking and recognition.

Literature Review
Data Characteristics and Statistics
Annotation Guidelines
Evaluation Protocols and Metrics
Detection Protocols and Metrics
Recognition Protocols and Metrics
Application of AcTiV Datasets
LADI Detector
SID OCR
Experimental Results
Comparison with Other Methods
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call