Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Rémi Landais,Laurent Vinet,Jean-Michel Jolion

doi:10.1007/11551188_74

Abstract

AbstractTexts embedded in video streams convey crucial information for documentation. Many text detection and recognition systems have been designed to automatically extract such documentary data from video streams. Most of the research teams involved argue that commercial OCR do not work properly on images extracted from a video stream. They thus concieve their own detection systems. Nevertheless, commercial OCR have never been evaluated on such corpora. This article details a new methodology to evaluate a commercial OCR on a video document. This methodology is goal directed: the system is penalized proportionally to TFIDF (Term Frequency Inverse Document Frequency) scores of texts [1]. We experiment our methodology on Abbyy FineReader 6.0.KeywordsGround TruthVideo StreamText DetectionScene TextRecognition StageThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction
Alloy Das ... Umapada Pal
Knowledge-Based Systems | VOL. 305
Alloy Das, et. al.Alloy Das ... Umapada Pal
01 Oct 2024
Knowledge-Based Systems | VOL. 305

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.
Asghar Ali Chandio ... Mehwish Leghari
Data in Brief | VOL. 31
Asghar Ali Chandio, et. al.Asghar Ali Chandio ... Mehwish Leghari
21 May 2020
Data in Brief | VOL. 31

Scene text detection and recognition system for visually impaired people in real world
Kaiwei Wang ... Karin U Stein
-
Kaiwei Wang, et. al.Kaiwei Wang ... Karin U Stein
09 Oct 2018
09 Oct 2018

A Text Detection and Recognition System Based on Dual-Attention Mechanism with Artificial Intelligence Technology
Yongjun Qi ... Li Huang
-
Yongjun Qi, et. al.Yongjun Qi ... Li Huang
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Abstract

Talk to us

Similar Papers