An algorithm acceleration framework for correlation-based feature selection

Xuefeng Yan,Yuqing Zhang,Arif Ali Khan,I Barukčić

doi:10.1051/matecconf/202133607011

Abstract

Repeated calculations lead to a sharp increase in the time of correlation-based feature selection. Incremental iteration has been applied in some algorithms to improve the efficiency. However, the computational efficiency of correlation has generally be ignored. An algorithm acceleration framework for correlation-based feature selection (AFCFS) is proposed. In AFCFS, the criterion of the feature selection will be analyzed and reconstructed based on entropy granularity, and the algorithm structure will also be adjusted accordingly. Specifically, all repeated part of calculation will be saved in mapping tables and can be accessed in next time directly, so as to further reduce the calculation repetition rate and improve the efficiency. The experimental results show that AFCFS can greatly reduce the cost time of these algorithms, and keep the corresponding classification accuracy basically unchanged.

Highlights

Correction-based feature selection has been widely used in software defect prediction to construct the feature subset due to its simple principle and good stability
A few researchers have realized the expensive time cost of the feature selection based on correlation and optimized their algorithm structure such as Feature selection with redundancy-complementariness dispersion (RCDFS)[2], Interaction Weight based Feature Selection algorithm (IWFS) [3], and fast greedy feature selection algorithm (FGS_KDE) [4].most of them only optimize the number of iterations by incremental iteration or weight update, and pay little attention on the calculation process of correlation itself
In order to avoid the influence of classifiers, two typical classifier algorithms, K-Nearest Neighbor (KNN) and Naive Bayes (NB) classifiers, are used

Summary

Introduction

Correction-based feature selection has been widely used in software defect prediction to construct the feature subset due to its simple principle and good stability. A few researchers have realized the expensive time cost of the feature selection based on correlation and optimized their algorithm structure such as Feature selection with redundancy-complementariness dispersion (RCDFS)[2], Interaction Weight based Feature Selection algorithm (IWFS) [3], and fast greedy feature selection algorithm (FGS_KDE) [4].most of them only optimize the number of iterations by incremental iteration or weight update, and pay little attention on the calculation process of correlation itself. The repeated calculations can be avoided to the greatest extent, so as to improve the operating efficiency without changing the performance of the algorithm itself

Correlation-based feature selection

Optimization on number of iterations

Optimization on correlation calculations

Experiment design

Experiment result and analysis

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An algorithm acceleration framework for correlation-based feature selection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences

Lead the way for us

Journal: MATEC Web of Conferences	Publication Date: Jan 1, 2021
License type: CC BY 4.0

Similar Papers

A New Swarm-Based Framework for Handwritten Authorship Identification in Forensic Document Analysis
Satrya Fajri Pratama ... Azah Kamilah Muda
-
Satrya Fajri Pratama, et. al.Satrya Fajri Pratama ... Azah Kamilah Muda
01 Jan 2014
01 Jan 2014

A New Framework for Automatic Feature Selection for Tracking
Ming Z Zhang ... Vijayan K Asari
-
Ming Z Zhang, et. al.Ming Z Zhang ... Vijayan K Asari
01 Aug 2007
01 Aug 2007

A framework for cost-based feature selection
V Bolón-Canedo ... A Alonso-Betanzos
Pattern Recognition | VOL. 47
V Bolón-Canedo, et. al.V Bolón-Canedo ... A Alonso-Betanzos
28 Jan 2014
Pattern Recognition | VOL. 47

A joint feature selection framework for multivariate resource usage prediction in cloud servers using stability and prediction performance
Shaifu Gupta ... A D Dileep
The Journal of Supercomputing | VOL. 74
Shaifu Gupta, et. al.Shaifu Gupta ... A D Dileep
27 Jul 2018
The Journal of Supercomputing | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An algorithm acceleration framework for correlation-based feature selection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences