Comparing and experimenting machine learning techniques for code smell detection

Francesca Arcelli Fontana,Marco Zanoni,Mika V Mäntylä,Alessandro Marino

doi:10.1007/s10664-015-9378-4

Abstract

Several code smell detection tools have been developed providing different results, because smells can be subjectively interpreted, and hence detected, in different ways. In this paper, we perform the largest experiment of applying machine learning algorithms to code smells to the best of our knowledge. We experiment 16 different machine-learning algorithms on four code smells (Data Class, Large Class, Feature Envy, Long Method) and 74 software systems, with 1986 manually validated code smell samples. We found that all algorithms achieved high performances in the cross-validation data set, yet the highest performances were obtained by J48 and Random Forest, while the worst performance were achieved by support vector machines. However, the lower prevalence of code smells, i.e., imbalanced data, in the entire data set caused varying performances that need to be addressed in the future studies. We conclude that the application of machine learning to the detection of these code smells can provide high accuracy (>96 %), and only a hundred training examples are needed to reach at least 95 % accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparing and experimenting machine learning techniques for code smell detection

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Jun 6, 2015
Citations: 320

Similar Papers

An Exploratory Evaluation of Continuous Feedback to Enhance Machine Learning Code Smell Detection
Daniel Cruz ... Eduardo Figueiredo
-
Daniel Cruz, et. al.Daniel Cruz ... Eduardo Figueiredo
06 May 2024
06 May 2024

Machine Learning-Based Methods for Code Smell Detection: A Survey
Pravin Singh Yadav ... Manjari Gupta
Applied Sciences | VOL. 14
Pravin Singh Yadav, et. al.Pravin Singh Yadav ... Manjari Gupta
15 Jul 2024
Applied Sciences | VOL. 14

Machine learning techniques for code smell detection: A systematic literature review and meta-analysis
Muhammad Ilyas Azeem ... Qing Wang
Information and Software Technology | VOL. 108
Muhammad Ilyas Azeem, et. al.Muhammad Ilyas Azeem ... Qing Wang
05 Jan 2019
Information and Software Technology | VOL. 108

Design of testing framework for code smell detection (OOPS) using BFO algorithm
Pratiksha Sharma ... Er Arshpreet Kaur
International Journal of Engineering & Technology | VOL. 7
Pratiksha Sharma, et. al.Pratiksha Sharma ... Er Arshpreet Kaur
06 Aug 2018
International Journal of Engineering & Technology | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparing and experimenting machine learning techniques for code smell detection

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering