Abstract

Cross-project fault prediction (CPFP) is a current field of research in the realm of software engineering. CPFP comes into play when there is a scarcity of within-project training data. In particular, it involves constructing a fault prediction model for software project ‘X’ using the defect/fault data of software project ‘Y’. However, the distribution dissimilarity between the two project's data creates a bottleneck in its success. Many existing approaches addressed this issue by selecting relevant instances from the training data without giving any attention to feature selection (FS). Thus, to assess the power of FS for effective CPFP, we investigated two search-based FS algorithms namely Binary Genetic Algorithm (BGA) and Binary Particle Swarm Optimization (BPSO) algorithm. We performed 26 CPFP experiments based on 8 software projects and compared their performance with a CPFP model (ALL_CPFP), built with all features. Although both BPSO _CPFP and BGA _CPFP showed their potential over ALL_CPFP, BPSO_CPFP performed better than BGA_CPFP in capturing the important features for effective CPFP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.