The Impact of Feature Selection Techniques on Software Defect Identification Models

Huiquan Gong,Yuwei Zhang

doi:10.1109/icsess52187.2021.9522337

Abstract

Defect identification is an important task for ensuring the quality of software. Recently, researchers have begun to utilize artificial intelligence techniques to improve the usability of static analysis tools by automatically identifying true defects from the reported SA alarms. Existing methods mainly focus on using the static code features to represent the defective code. However, a challenge that threatens the performance of these machine learning methods is the irrelevant and redundant features. Feature selection techniques can be applied to alleviate this problem. Since many feature selection methods have been proposed, this paper conducts a rigorous experimental evaluation on the impact of feature selection techniques for defect identification and explores whether there is a smallest ratio when using the feature selection techniques for building defect identification models with acceptable performance. Additionally, this paper proposes an effective feature selection approach based on the idea of majority voting, combing the output results of different feature selection techniques. The experimental results for five open-source projects show that there is a best ratio (20%) for feature selection which achieves satisfied performance with far fewer features used for defect identification. This finding can serve as a practical guideline for software defect identification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Impact of Feature Selection Techniques on Software Defect Identification Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Large-Scale Study of the Impact of Feature Selection Techniques on Defect Classification Models
Baljinder Ghotra ... Shane Mcintosh
-
Baljinder Ghotra, et. al.Baljinder Ghotra ... Shane Mcintosh
01 May 2017
01 May 2017

A comprehensive investigation of the impact of feature selection techniques on crashing fault residence prediction models
Kunsong Zhao ... Dan Yang
Information and Software Technology | VOL. 139
Kunsong Zhao, et. al.Kunsong Zhao ... Dan Yang
01 Nov 2021
Information and Software Technology | VOL. 139

A Fraud Detection Model Based on Feature Selection and Undersampling Applied to Web Payment Systems
Rafael Franca Lima ... Adriano Cesar Machado Pereira
-
Rafael Franca Lima, et. al.Rafael Franca Lima ... Adriano Cesar Machado Pereira
01 Dec 2015
01 Dec 2015

The impact of feature reduction techniques on defect prediction models
Masanari Kondo ... Ahmed E Hassan
Empirical Software Engineering | VOL. 24
Masanari Kondo, et. al.Masanari Kondo ... Ahmed E Hassan
22 Jan 2019
Empirical Software Engineering | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Impact of Feature Selection Techniques on Software Defect Identification Models

Abstract

Talk to us

Similar Papers