Challenges of Deep Learning-based Text Detection in the Wild

John Zelek,Mohamed A Naiel,Paul Fieguth,Steven Wardell,Zobeir Raisi

doi:10.15353/jcvis.v6i1.3543

Abstract

The reported accuracy of recent state-of-the-art text detection methods, mostly deep learning approaches, is in the order of 80% to 90% on standard benchmark datasets. These methods have relaxed some of the restrictions of structured text and environment (i.e., "in the wild") which are usually required for classical OCR to properly function. Even with this relaxation, there are still circumstances where these state-of-the-art methods fail. Several remaining challenges in wild images, like in-plane-rotation, illumination reflection, partial occlusion, complex font styles, and perspective distortion, cause exciting methods to perform poorly. In order to evaluate current approaches in a formal way, we standardize the datasets and metrics for comparison which had made comparison between these methods difficult in the past. We use three benchmark datasets for our evaluations: ICDAR13, ICDAR15, and COCO-Text V2.0. The objective of the paper is to quantify the current shortcomings and to identify the challenges for future text detection research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Challenges of Deep Learning-based Text Detection in the Wild

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Vision and Imaging Systems

Lead the way for us

Journal: Journal of Computational Vision and Imaging Systems	Publication Date: Jan 15, 2021
Citations: 2

Similar Papers

A survey of text detection and recognition algorithms based on deep learning technology
Xiao-Feng Wang ... Zhi-Ze Wu
Neurocomputing | VOL. 556
Xiao-Feng Wang, et. al.Xiao-Feng Wang ... Zhi-Ze Wu
18 Aug 2023
Neurocomputing | VOL. 556

Deformable scene text detection using harmonic features and modified pixel aggregation network
Tanmay Jain ... Cheng-Lin Liu
Pattern Recognition Letters | VOL. 152
Tanmay Jain, et. al.Tanmay Jain ... Cheng-Lin Liu
08 Oct 2021
Pattern Recognition Letters | VOL. 152

Arbitrary Shape Natural Scene Text Detection Method Based on Soft Attention Mechanism and Dilated Convolution
Xiao Qin ... Wei Fan
IEEE Access | VOL. 8
Xiao Qin, et. al.Xiao Qin ... Wei Fan
01 Jan 2020
IEEE Access | VOL. 8

A Text Detection and Recognition Method Based on PSENet and CRNN
Xin He ... Yi He
-
Xin He, et. al.Xin He ... Yi He
01 Sep 2022
01 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Challenges of Deep Learning-based Text Detection in the Wild

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Vision and Imaging Systems