Improving Efficiency of Classification using PCA and Apriori based Attribute Selection Technique

K Rajeswari,V Vaithiyanathan,Rohit Garud

doi:10.19026/rjaset.6.3485

K Rajeswari, V Vaithiyanathan + Show 1 more

Open Access

https://doi.org/10.19026/rjaset.6.3485

Copy DOI

Abstract

The aim of this study is to select significant features that contribute for accuracy in classification. Data mining is a field where we find lots of data which can be useful or useless in any form available in Data Warehouse. Implementing classification on these huge, uneven, useless data sets with large number of features is just a waste of time degrading the efficiency of classification algorithms and hence the results are not much accurate. Hence we propose a system in which we first use PCA (Principal Component Analysis) for selection of the attributes on which we perform Classification using Bayes theorem, Multi-Layer Perceptron, Decision tree J48 which indeed has given us better result than that of performing Classification on the huge complete data sets with all the attributes. Also association rule mining using traditional Apriori algorithm is experimented to find out sub set of features related to class label. The experiments are conducted using WEKA 3.6.0 Tool.

Highlights

Data mining is a field were huge amount of data which is been mined form data warehouse
Classification is divided into two categories supervised and unsupervised, Supervised classification is the technique in which label is already known before Classification and in Unsupervised we need to find it based on the training sets and apply it on test data
This study proposes a method where classification technique is used only with the important attributes using feature selection techniques namely PCA (Principal Component Analysis) and Association rule mining technique which will select the subset attributes significant for classification

Summary

INTRODUCTION

Data mining is a field were huge amount of data which is been mined form data warehouse. This study proposes a method where classification technique is used only with the important attributes using feature selection techniques namely PCA (Principal Component Analysis) and Association rule mining technique which will select the subset attributes significant for classification. Multi-layer-perceptron: It is the classification algorithm based on neural network which takes a lot of time to execute but the result accuracy is efficient. In our proposal we chose J48 as a decision tree algorithm, Bayes as Bayesian type and Multi-layerPerceptron as Neural Network based classification algorithm because they are the best in their fields of classification techniques. The study (Phyu, 2009) concludes the comparisons of the classification algorithms based on accurate system results and depicts how decision tree and bayes classification technique is well suited for good accuracy. The time taken for training to model the independent variables to dependent variables is large (Rajeswari and Vaithiyanathan, 2012c)

PROPOSED METHODOLOGY

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Research Journal of Applied Sciences, Engineering and Technology	Publication Date: Dec 25, 2013
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving Efficiency of Classification using PCA and Apriori based Attribute Selection Technique

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Research Journal of Applied Sciences, Engineering and Technology

Lead the way for us

Similar Papers

An Analysis of students’ performance using classification algorithms
Mrs M.S Mythili ... Dr A.R.Mohamed Shanavas
IOSR Journal of Computer Engineering | VOL. 16
Mrs M.S Mythili, et. al.Mrs M.S Mythili ... Dr A.R.Mohamed Shanavas
01 Jan 2014
IOSR Journal of Computer Engineering | VOL. 16

A Novel Approach for Association Rule Mining using Pattern Generation
Deepa S Deshpande
International Journal of Information Technology and Computer Science | VOL. 6
Deepa S DeshpandeDeepa S Deshpande
08 Oct 2014
International Journal of Information Technology and Computer Science | VOL. 6

The Development of Data Warehouse and Data Mining System for Serious Mental Illness with High Risk to Violence (SMI-V) Psychiatric Patients: A Case Study of Thailand
Phichayasini Kitwatthanathawon ... Prachasan Vaenthaisong
Global Conference on Business and Social Sciences Proceeding | VOL. 15
Phichayasini Kitwatthanathawon, et. al.Phichayasini Kitwatthanathawon ... Prachasan Vaenthaisong
14 Sep 2023
Global Conference on Business and Social Sciences Proceeding | VOL. 15

Multi Filtration Feature Selection (MFFS) to improve discriminatory ability in clinical data set
S Sasikala ... S Geetha
Applied Computing and Informatics | VOL. 12
S Sasikala, et. al.S Sasikala ... S Geetha
05 Apr 2014
Applied Computing and Informatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Efficiency of Classification using PCA and Apriori based Attribute Selection Technique

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Research Journal of Applied Sciences, Engineering and Technology