Estimation of Discriminative Feature Subset Using Community Modularity.

Guodong Zhao,Sanming Liu

doi:10.1038/srep25040

Abstract

Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k-features. An efficient FS method called k-features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments.

Highlights

Feature selection (FS) is widely investigated and utilized in machine learning and data mining research
The proposed approach was compared with several popular FS algorithms, including MIFS_U, mrmr, CMIM, Fisher, Laplacian score[33], RELIEF62, Simba-sig[63], and Greedy Feature Flip (G-Flip-sig)[63]
To address the redundancy problem of ranking in filter methods, the sample graph in k-features that captures the relevant independency among feature subsets is utilized rather than the conditional mutual information (MI) criteria

Summary

Introduction

Feature selection (FS) is widely investigated and utilized in machine learning and data mining research. In this context, a feature, called attribute or variable, represents a property of a process or system. The goal of FS is to select the feature subsets of informative attributes or variables to build models that describe data and to eliminate redundant or irrelevant noise features to improve predictive accuracy[1]. FS maintains the original intrinsic properties of the selected features and facilitates data visualization and understanding[2]. FS has been extensively applied to many applications, such as bio-informatics[3], image retrieval[4], and text classification[5], because of its capabilities

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific reports	Publication Date: Apr 1, 2016
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Estimation of Discriminative Feature Subset Using Community Modularity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports

Lead the way for us

Similar Papers

Feature Selection for High Dimensional Data Using Monte Carlo Tree Search
Muhammad Umar Chaudhry ... Jee-Hyong Lee
IEEE Access | VOL. 6
Muhammad Umar Chaudhry, et. al.Muhammad Umar Chaudhry ... Jee-Hyong Lee
01 Jan 2018
IEEE Access | VOL. 6

Feature Selection Based on Graph Representation
Yassine Akhiat ... Ahmed Zinedine
-
Yassine Akhiat, et. al.Yassine Akhiat ... Ahmed Zinedine
01 Oct 2018
01 Oct 2018

Quantum Computing Based Machine Learning Method and Its Application in Radar Emitter Signal Recognition
Gexiang Zhang ... Laizhao Hu
-
Gexiang Zhang, et. al.Gexiang Zhang ... Laizhao Hu
01 Jan 2004
01 Jan 2004

Cost-sensitive feature selection via the ℓ2,1-norm
Hong Zhao ... Shenglong Yu
International Journal of Approximate Reasoning | VOL. 104
Hong Zhao, et. al.Hong Zhao ... Shenglong Yu
31 Oct 2018
International Journal of Approximate Reasoning | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimation of Discriminative Feature Subset Using Community Modularity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports