Keyword Detection Based on RetinaNet and Transfer Learning for Personal Information Protection in Document Images

Guo-Shiang Lin,Jen-Yung Lin,Jia-Cheng Tu

doi:10.3390/app11209528

Guo-Shiang Lin, Jen-Yung Lin + Show 1 more

Open Access

https://doi.org/10.3390/app11209528

Copy DOI

Abstract

In this paper, a keyword detection scheme is proposed based on deep convolutional neural networks for personal information protection in document images. The proposed scheme is composed of key character detection and lexicon analysis. The first part is the key character detection developed based on RetinaNet and transfer learning. To find the key characters, RetinaNet, which is composed of convolutional layers featuring a pyramid network and two subnets, is exploited to detect key characters within the region of interest in a document image. After the key character detection, the second part is a lexicon analysis, which analyzes and combines several key characters to find the keywords. To train the model of RetinaNet, synthetic image generation and data augmentation are exploited to yield a large image dataset. To evaluate the proposed scheme, many document images are selected for testing, and two performance measurements, IoU (Intersection Over Union) and mAP (Mean Average Precision), are used in this paper. Experimental results show that the mAP rates of the proposed scheme are 85.1% and 85.84% for key character detection and keyword detection, respectively. Furthermore, the proposed scheme is superior to Tesseract OCR (Optical Character Recognition) software for detecting the key characters in document images. The experimental results demonstrate that the proposed method can effectively localize and recognize these keywords within noisy document images with Mandarin Chinese words.

Highlights

There is a lot of information from paper documents for human communication
The experimental results demonstrate that the proposed scheme is superior to Tesseract OCR software for detecting key characters in noisy document images
A keyword detection scheme was proposed based on deep convolutional neural networks for personal information protection in document images

Summary

Introduction

There is a lot of information from paper documents for human communication. Paper documents often contain typical elements such as text, tables, stamps, and signatures. Compared with traditional machine-learning methods developed based on handcrafted features, Deep Neural Networks (DNN) [7,8] have received more and more attention due to their excellent performance in image classification, speech recognition, fraud detection, and so on. [16], a fast CNN-based method is proposed to automatically perform layout analysis for document images. In the existing method [16], a document image is segmented into some blocks, and these blocks are classified into three categories, i.e., text, table, and image, based on a CNN. Some Mandarin Chinese words such as “業主” (Property Owner) and “起造人姓名” (Name of Applicant) can be used as the special information to distinguish Figure 1a from Figure 1b.

System Description

ROI Localization

CNN-Based Key Character Detection

RetinaNet Architecture

Model Training Procedure

Lexicon Analysis

Experimental Results Determine whether another key character

Keyword Detection

Comparison with Tesseract for Key Character Recognition

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 13, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Keyword Detection Based on RetinaNet and Transfer Learning for Personal Information Protection in Document Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Key Character Detection Based on Deep Neural Network for Document Images
Jia-Cheng Tu ... Guo-Shiang Lin
-
Jia-Cheng Tu, et. al.Jia-Cheng Tu ... Guo-Shiang Lin
01 Dec 2018
01 Dec 2018

YOLO Based Recognition of Indian License Plates
Jimit Gandhi ... Lakshmi Kurup
-
Jimit Gandhi, et. al.Jimit Gandhi ... Lakshmi Kurup
01 Jan 2020
01 Jan 2020

Rail surface defect detection based on improved Mask R-CNN
Hao Wang ... Zhibo Wan
Computers and Electrical Engineering | VOL. 102
Hao Wang, et. al.Hao Wang ... Zhibo Wan
04 Aug 2022
Computers and Electrical Engineering | VOL. 102

Towards Removal of Shadows Caused due to Object Interferences in Smartphone Captured Document Images using Multiple Mask Generation Technique
Koushik K.S ... N Shobha Rani
Procedia Computer Science | VOL. 235
Koushik K.S, et. al.Koushik K.S ... N Shobha Rani
01 Jan 2024
Procedia Computer Science | VOL. 235

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Keyword Detection Based on RetinaNet and Transfer Learning for Personal Information Protection in Document Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences