A new MapReduce solution for associative classification to handle scalability and skewness in vertical data structure

Devendra K Tayal,Kanak Meena

doi:10.1016/j.future.2019.09.040

Abstract

Associative classification is a promising methodology in information mining that uses the association rule discovery procedures to build the classifier. But they have some limitations like: they are not able to handle big data as they have memory constraints, high time complexity, load imbalance and data skewness. Data skewness occurs invariably when big data analytics comes in picture and affects the efficiency of an approach. This paper presents the MapReduce solution for associative classification in respect of vertical data layout. To handle these problems we have proposed two algorithms MR-MCAR-F (MapReduce-Multi Class Associative Classifier-MapReduce fast algorithm) and MR-MCAR-L (MapReduce-Multi Class Associative Classifier Load parallel frequent pattern growth algorithm). Also in this paper, MapReduce solution of Tid List and Database coverage has been proposed. We have used three type of pruning techniques viz. database coverage, global and distributed pruning. The proposed approaches have been compared with latest approach from the literature survey in terms of accuracy, computation time and data skewness. The existing scalable approaches cannot handle skewness while, our proposed method handles it in a very effective manner. All the experiments have been performed on six datasets which have been extracted from UCI repositories on the Hadoop framework. Proposed algorithms are scalable solutions for associative classification to handle big data and data skewness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new MapReduce solution for associative classification to handle scalability and skewness in vertical data structure

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Journal: Future Generation Computer Systems	Publication Date: Sep 24, 2019
Citations: 7

Similar Papers

A MapReduce solution for associative classification of big data
Alessio Bechini ... Armando Segatori
Information Sciences | VOL. 332
Alessio Bechini, et. al.Alessio Bechini ... Armando Segatori
31 Oct 2015
Information Sciences | VOL. 332

Analysis and processing of academic data from a higher institution with tools for big data
Juan-Pablo Urena-Torres ... Maria Belen Mora Arciniegas
-
Juan-Pablo Urena-Torres, et. al.Juan-Pablo Urena-Torres ... Maria Belen Mora Arciniegas
01 Jun 2017
01 Jun 2017

An optimized approach for unbalanced big data categorizing using fuzzy clustering
Saman Fallah Mehneh ... Jalilgazalan Toosi
-
Saman Fallah Mehneh, et. al.Saman Fallah Mehneh ... Jalilgazalan Toosi
01 Nov 2014
01 Nov 2014

Impact of Big Data on Innovation, Competitive Advantage, Productivity, and Decision Making: Literature Review
Nadeem U Shahid ... Nasir J Sheikh
Open Journal of Business and Management | VOL. 09
Nadeem U Shahid, et. al.Nadeem U Shahid ... Nasir J Sheikh
01 Jan 2020
Open Journal of Business and Management | VOL. 09

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new MapReduce solution for associative classification to handle scalability and skewness in vertical data structure

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems