Analyzing the potential of active learning for document image classification

Saifullah Saifullah,Stefan Agne,Andreas Dengel,Sheraz Ahmed

doi:10.1007/s10032-023-00429-8

Abstract

Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15–40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at https://github.com/saifullah3396/doc_al.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Apr 25, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Analyzing the potential of active learning for document image classification

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)

Lead the way for us

Similar Papers

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Enoch Chang
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Enoch Chang
01 Jul 2021
Cancer Research | VOL. 81

Active Deep Learning Technique for Underwater Target Recognition
Jiankun Lyu ... Chao Yang
-
Jiankun Lyu, et. al.Jiankun Lyu ... Chao Yang
17 Oct 2022
17 Oct 2022

Deep Bayesian Active Learning for Learning to Rank: A Case Study in Answer Selection
Qunbo Wang ... Yongchi Zhao
IEEE Transactions on Knowledge and Data Engineering | VOL. 34
Qunbo Wang, et. al.Qunbo Wang ... Yongchi Zhao
01 Nov 2022
IEEE Transactions on Knowledge and Data Engineering | VOL. 34

How useful is active learning for image‐based plant phenotyping?
Koushik Nagasubramanian ... Arti Singh
The Plant Phenome Journal | VOL. 4
Koushik Nagasubramanian, et. al.Koushik Nagasubramanian ... Arti Singh
01 Jan 2020
The Plant Phenome Journal | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analyzing the potential of active learning for document image classification

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)