Performance Comparison based on Attribute Selection Tools for Data Mining

M Sudha

doi:10.17485/ijst/2014/v7sp7.5

Abstract

Recent years have been wide efforts in attribute selection research. Attribute selection can efficiently reduce the hypothesis space by removing irrelevant and redundant attributes. Attribute reduction of an information system is a key problem in rough set theory and its applications. In this paper, we compare the performance of attribute selection using two technical tools namely WEKA 3.7 and ROSE2. Filter methods are used an alternative measure instead of the error rate to score a feature subset. This measure was chosen to be fast to compute, at the same time as still capturing the usefulness of the feature set. Many filters provide a feature ranking rather than an explicit best feature subset, and the cutoff point in the ranking is chosen via cross-validation. We used Search methods like Best first and Greedy stepwise to evaluate a subset of features as a group for suitability. We use the internet usage data set for this purpose and then comparison results are tabulated for various methods for searching the solution space to eliminate the irrelevant attribute. Results of this research shows us some minding issues of attribute selection tools where we found better ways to have select irrelevant attributes. Comparing the tools of attributes reductions evidence some considerable different between them. Keywords: Classifications, Data Mining, Rough Set Explorer, Search Methods, Selected Attributes, WEKA

Highlights

Attribute set reduction is essentially a task to remove irrelevant and/or redundant features
We describe the data set with various search methods and compare the accuracy obtained by the tools with whole dataset with the selected attributes
The first step of our analysis was to reduce the high data dimensionality[7]. For this purpose we use Rose tool and Weka tool for attribute selection based on various search methods made in the attribute space as shown in Table 1 and Table 2

Summary

Introduction

Attribute set reduction is essentially a task to remove irrelevant and/or redundant features. Irrelevant attributes can be removed without affecting learning performance. The distinction is that a redundant attribute implies the co-presence of another attribute; individually each feature is relevant, but the removal of one of them will not affect learning performance[1]. In this paper we use data mining methods to improve the survey results and in the same hand we are comparing the tools rough set explorer ROSE 2.2 and WEKA. The methods help us to identify attributes which might be classified as highly useful. The parameters describe how effective the internet is It indicates how much a particular student is needed to restrict to use the web sites.

Methods and Materials

Attribute Description

Methodology

Rose 2 Tool

Gain ratio Ranker

Weka Tool

Results and Comparision

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Indian Journal of Science and Technology	Publication Date: Dec 22, 2014
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Performance Comparison based on Attribute Selection Tools for Data Mining

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology

Lead the way for us

Similar Papers

Attribute-Response Correlations in Concept Attainment
Richard L Gottwald
The American Journal of Psychology | VOL. 84
Richard L GottwaldRichard L Gottwald
01 Sep 1971
The American Journal of Psychology | VOL. 84

데이터 마이닝을 위한 제어규칙의 생성
In-Kyoo Park
The Journal of Digital Policy and Management | VOL. 11
In-Kyoo ParkIn-Kyoo Park
28 Nov 2013
The Journal of Digital Policy and Management | VOL. 11

Attribute reduction on real-valued data in rough set theory using hybrid artificial bee colony: extended FTSBPSD algorithm
Srilatha Chebrolu ... Sriram G Sanjeevi
Soft Computing | VOL. 21
Srilatha Chebrolu, et. al.Srilatha Chebrolu ... Sriram G Sanjeevi
23 Aug 2016
Soft Computing | VOL. 21

Integration Method of Ant Colony Algorithm and Rough Set Theory for Simultaneous Real Value Attribute Discretization and Attribute Reduction
Yijun He ... Dezhao Chen
-
Yijun He, et. al.Yijun He ... Dezhao Chen
01 Dec 2007
01 Dec 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Comparison based on Attribute Selection Tools for Data Mining

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology