Classification of binary document images into textual or nontextual data blocks using neural network models

Daniel X Le,George R Thoma,Harry Wechsler

doi:10.1007/s001380050010

Abstract

This paper describes a new method for the classification of binary document images as textual or nontextual data blocks using neural network models. Binary document images are first segmented into blocks by the constrained run-length algorithm (CRLA). The component-labeling procedure is used to label the resulting blocks. The features for each block, calculated from the coordinates of its extremities, are then fed into the input layer of a neural network for classification. Four neural networks were considered, and they include back propagation (BP), radial basis functions (RBF), probabilistic neural network (PNN), and Kohonen's self-organizing feature maps (SOFMs). The performance and behavior of these neural network models are analyzed and compared in terms of training times, memory requirements, and classification accuracy. The experiments carried out on a variety of medical journals show the feasibility of using the neural network approach for textual block classification and indicate that in terms of both accuracy and training time RBF should be preferred.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification of binary document images into textual or nontextual data blocks using neural network models

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications

Lead the way for us

Journal: Machine Vision and Applications	Publication Date: Oct 1, 1995
Citations: 21

Similar Papers

Document classification using connectionist models
D.X Le ... H Wechsler
-
D.X Le, et. al.D.X Le ... H Wechsler
27 Jun 1994
27 Jun 1994

Neural network models in EMG diagnosis
C.S Pattichis ... L.T Middleton
IEEE Transactions on Biomedical Engineering | VOL. 42
C.S Pattichis, et. al.C.S Pattichis ... L.T Middleton
01 May 1995
IEEE Transactions on Biomedical Engineering | VOL. 42

Classification of power system voltage stability conditions using Kohonen's self-organising feature map and learning vector quantisation
Abhinandan De ... Abhijit Chakrabarti
European Transactions on Electrical Power | VOL. 22
Abhinandan De, et. al.Abhinandan De ... Abhijit Chakrabarti
19 Oct 2011
European Transactions on Electrical Power | VOL. 22

Power flow classification for static security assessment
D Niebur ... A.J Germond
-
D Niebur, et. al.D Niebur ... A.J Germond
23 Jul 1991
23 Jul 1991

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of binary document images into textual or nontextual data blocks using neural network models

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications