Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Junaid Younas,Faisal Shafait,Muhammad Imran Malik,Shoaib Ahmed Siddiqui,Mohsin Munir,Paul Lukowicz,Sheraz Ahmed

doi:10.3390/app10186460

Abstract

We propose a novel hybrid approach that fuses traditional computer vision techniques with deep learning models to detect figures and formulas from document images. The proposed approach first fuses the different computer vision based image representations, i.e., color transform, connected component analysis, and distance transform, termed as Fi-Fo image representation. The Fi-Fo image representation is then fed to deep models for further refined representation-learning for detecting figures and formulas from document images. The proposed approach is evaluated on a publicly available ICDAR-2017 Page Object Detection (POD) dataset and its corrected version. It produces the state-of-the-art results for formula and figure detection in document images with an f1-score of 0.954 and 0.922, respectively. Ablation study results reveal that the Fi-Fo image representation helps in achieving superior performance in comparison to raw image representation. Results also establish that the hybrid approach helps deep models to learn more discriminating and refined features.

Highlights

Digitization of document images is a growing need for commercial and non-commercial entities, for example, banks, industries, educational institutes, and libraries
ICDAR-2017 Page Object Detection (POD) was released recently for a competition focused on figure, formula, and table detection from document images
We evaluate the deformable variants of Faster-RCNN, R-fully convolutional networks (FCNs), and FPN

Summary

Introduction

Digitization of document images is a growing need for commercial and non-commercial entities, for example, banks, industries, educational institutes, and libraries. Aside from record-keeping, it significantly improves the availability of data just at a click and/or a tap from anywhere in the world, at any time. These digitized documents can be processed in an automated fashion given that the information contained in those documents can be extracted reliably. Reliable extraction of information from documents has been a major focus of the document analysis community for decades [1,2,3,4]

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Sep 16, 2020
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Development and Validation of a Deep Neural Network for Accurate Identification of Endoscopic Images From Patients With Ulcerative Colitis and Crohn's Disease.
Guangcong Ruan ...
Frontiers in Medicine | VOL. 9
Guangcong Ruan, et. al.Guangcong Ruan ...
18 Mar 2022
Frontiers in Medicine | VOL. 9

An Interpretation Architecture for Deep Learning Models with the Application of COVID-19 Diagnosis.
Yuchai Wan ... Hongen Zhou
Entropy (Basel, Switzerland) | VOL. 23
Yuchai Wan, et. al.Yuchai Wan ... Hongen Zhou
07 Feb 2021
Entropy (Basel, Switzerland) | VOL. 23

Shallow vs. Deep Image Representations: A comparative Study Applied for the Problem of Generic Object Recognition
Yasser M Abdullah ... Mussa M Ahmed
-
Yasser M Abdullah, et. al.Yasser M Abdullah ... Mussa M Ahmed
01 Dec 2019
01 Dec 2019

Review of Visual Saliency Prediction: Development Process from Neurobiological Basis to Deep Models
Fei Yan ... Zhiliang Wang
Applied Sciences | VOL. 12
Fei Yan, et. al.Fei Yan ... Zhiliang Wang
29 Dec 2021
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences