Prediction of Cancer Disease Using Classification Techniques in Map Reduce Programming Model

M A Saleem Durai,Jaiti Handa,Anbarasi M

doi:10.4018/978-1-5225-2863-0.ch007

Abstract

As the volume of data is increasing with time the primary issue is how to store and process such data and get useful information out of it. Analysis of classification algorithms and MapReduce programming model has led to the conclusion that the distributed file system and parallel computing attributes of MapReduce are good for designing classifier model. The major reason for it is parallel processing of data in which data is divided and processed in parallel and the output from each is reduced further for a single output. In this paper, we are going to study how to use MapReduce model to build classifier model. We are using cancer dataset to predict if a person has cancer or not by using Naive Bayes and KNN classification algorithms. We have compared them on the basis on computational time and the factors like sensitivity, specificity, and accuracy. In the end, we would be able to compare these two algorithms and tell which one works better on MapReduce programming model

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prediction of Cancer Disease Using Classification Techniques in Map Reduce Programming Model

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Image filtering with MapReduce in pseudo-distribution mode
Tharindu D Gamage ... Ajith A Pasqual
-
Tharindu D Gamage, et. al.Tharindu D Gamage ... Ajith A Pasqual
01 Apr 2015
01 Apr 2015

A flexible and concurrent MapReduce programming model for shared-data applications
Fan Zhang ... Qutaibah M Malluhi
-
Fan Zhang, et. al.Fan Zhang ... Qutaibah M Malluhi
01 Jan 2012
01 Jan 2012

Large-scale time series data down-sampling based on Map-Reduce programming mode
Jiajia Xu ... Yichang Qiu
-
Jiajia Xu, et. al.Jiajia Xu ... Yichang Qiu
01 Mar 2017
01 Mar 2017

Imbalanced Classification for Big Data
Alberto Fernández ... Ronaldo C Prati
-
Alberto Fernández, et. al.Alberto Fernández ... Ronaldo C Prati
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction of Cancer Disease Using Classification Techniques in Map Reduce Programming Model

Abstract

Talk to us

Similar Papers