Balanced Image Data Based Ensemble of Convolutional Neural Networks

Zohaib Md Jan,Brijesh Verma

doi:10.1109/ssci44817.2019.9002726

Abstract

Convolutional Neural Networks (CNN) have been very successful in classifying image datasets predominantly because they can automatically extract features and do not scale in complexity with large images due to the presence of multiple convolutional layers. Though being effective they are constrained with the availability of large amounts of data. CNNs normally require large amounts of input data in order to effectively train a network that can achieve reasonable classification performance. However, training of CNNs with large and unbalanced image data is inefficient in terms of time and accuracy. Therefore in this paper, we propose a novel method to create balanced image data and an ensemble of small CNNs. This will help improving learning and accuracy in deep learning applications. One of the key concepts proposed in this paper is to cluster input images into class pure data clusters. A class pure data cluster is a data cluster that has patterns from a same input class. Further all class pure data clusters are balanced by adding samples from other classes based on a distance measurement generating a pool of balanced data clusters. CNNs are trained on all balanced data clusters and fused using majority voting. We have evaluated the proposed method on well-known benchmark image datasets and the results showed that the proposed technique was able to achieve higher accuracy than the standard CNN.

Full Text