Research on Parallel Classification Algorithms for Large-scale Data

Lijuan Zhou ,Wenbo Wang ,Hui Wang

doi:10.4156/jcit.vol7.issue21.41

Research on Parallel Classification Algorithms for Large-scale Data

Lijuan Zhou , Wenbo Wang + Show 1 more

https://doi.org/10.4156/jcit.vol7.issue21.41

Copy DOI

Journal: Journal of Convergence Information Technology	Publication Date: Nov 30, 2012
Citations: 6

#MapReduce Programming Model #Data Mining + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Because of the growing mass of data and the requirements of data mining's individuation, the traditional centralized data mining method can't adapt to this kind of demand. Cloud computing provided a cheap solution for massive data storage, analysis and handling. In order to achieve the purpose of parallel data mining in cloud environment, an improved algorithm based on the traditional Naive Bayes has been proposed in this paper. First, proposing the designing ideas of the improved algorithm in MapReduce programming model. Then using the actual data to test the algorithm. The experimental result validated that the new algorithm has higher performance and better scalability.

Full Text