Data-efficient software defect prediction: A comparative analysis of active learning-enhanced models and voting ensembles

Charalampos M Liapis,Aikaterini Karanikola,Sotiris Kotsiantis

doi:10.1016/j.ins.2024.120786

Abstract

As software systems undergo escalating complexity, the identification of bugs and defects becomes pivotal for ensuring seamless user experiences and averting potentially costly post-release issues. This study addresses this critical need by concentrating on the application of active learning methods in code defect prediction. The investigation focuses on the efficacy of active learning combined with ensemble methods, leveraging the dynamic selection and labeling of training instances to increase model performance, reduce the demand for exhaustive labeling efforts, and enhance the effectiveness of code defect prediction systems. Various traditional and ensemble methods are deployed, employing diverse query strategies (uncertainty, margin, and entropy sampling) to assess if active variants can rival original approaches while significantly downsizing the training set. Evaluation encompasses classical classification metrics (AUC, Kappa, and MCC), supplemented by a proposed easy-to-interpret performance index that takes into consideration not only the traditional metric outcomes but also the percentage of the initial dataset utilized, aligned with the dual nature of the problem. The results, elucidated through graphical representations and statistical tests, unveil the advantage of active methods, showcasing reductions of the initial training set by at least 75% in approximately 64% of cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-efficient software defect prediction: A comparative analysis of active learning-enhanced models and voting ensembles

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Similar Papers

Prospects for the application of active learning methods in modern education
Tunzala Verdiyeva
Revista on line de Política e Gestão Educacional | VOL. -
Tunzala VerdiyevaTunzala Verdiyeva
01 Aug 2021
Revista on line de Política e Gestão Educacional | VOL. -

Increased Student Interest in Learning through the Application of Active Learning Methods in the Thematic Learning
Dina Fadilah ... Mijahamuddin Alwi
Journal of Physics: Conference Series | VOL. 1539
Dina Fadilah, et. al.Dina Fadilah ... Mijahamuddin Alwi
01 May 2020
Journal of Physics: Conference Series | VOL. 1539

Active learning for e-rulemaking: public comment categorization
...
-
, et. al. ...
19 Nov 2012
19 Nov 2012

Instance Selection and Class Balancing Techniques for Cross Project Defect Prediction
Alysson Bispo ... Ricardo Prudencio
-
Alysson Bispo, et. al.Alysson Bispo ... Ricardo Prudencio
01 Oct 2018
01 Oct 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-efficient software defect prediction: A comparative analysis of active learning-enhanced models and voting ensembles

Abstract

Talk to us

Similar Papers

More From: Information Sciences