Design and implementation of parallel statiatical algorithm based on Hadoop's MapReduce model

Songqing Duan,Juan Yang,Bin Wu,Bai Wang

doi:10.1109/ccis.2011.6045047

Design and implementation of parallel statiatical algorithm based on Hadoop's MapReduce model

Songqing Duan, Juan Yang + Show 2 more

https://doi.org/10.1109/ccis.2011.6045047

Copy DOI

Publication Date: Sep 1, 2011

Citations: 6

Affiliation: Beijing University of Posts and Telecommunications

#Hadoop's MapReduce Model #Hadoop's MapReduce + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The rapid growth of data promotes the development of parallel computing. MapReduce, which is a simplified programming model of distributed parallel computing, is becoming more and more popular. In this paper, we design and implementation of parallel statistical algorithm based on Hadoop's MapReduce model. The algorithm, which is used to grasp the overall characteristics of massive data, involves the calculation of central tendency, dispersion and distribution tendency. By experiment, we come to the conclusion that the algorithm is suitable for dealing with large-scale data.

Full Text