Abstract

The restricted Boltzmann machine (RBM) is a primary building block of deep learning models. As an efficient representation learning approach, deep RBM can effectively extract sophisticated and informative features from raw data. Little research has been undertaken on using deep RBM to extract features from big data however. In this paper, we investigate this problem, and an ensemble approach for big data classification based on Hadoop MapReduce and fuzzy integral is proposed. The proposed method consists of two stages, map and reduce. In the map stage, multiple RBM-based classifiers used for ensemble are trained in parallel. In the reduce stage, the trained multiple RBM-based classifiers are integrated by fuzzy integral. Experiments on five big data sets show that the proposed approach can outperform other baseline methods to achieve state-of-the-art performance.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call