God Class Research Articles

Code smells indicate potential symptoms or problems in software due to inefficient design or incomplete implementation. These problems can affect software quality in the long-term. Code smell detection is fundamental to improving software quality and maintainability, reducing software failure risk, and helping to refactor the code. Previous works have applied several prediction methods for code smell detection. However, many of them show that machine learning (ML) and deep learning (DL) techniques are not always suitable for code smell detection due to the problem of imbalanced data. So, data imbalance is the main challenge for ML and DL techniques in detecting code smells. To overcome these challenges, this study aims to present a method for detecting code smell based on DL algorithms (Bidirectional Long Short-Term Memory (Bi-LSTM) and Gated Recurrent Unit (GRU)) combined with data balancing techniques (random oversampling and Tomek links) to mitigate data imbalance issue. To establish the effectiveness of the proposed models, the experiments were conducted on four code smells datasets (God class, data Class, feature envy, and long method) extracted from 74 open-source systems. We compare and evaluate the performance of the models according to seven different performance measures accuracy, precision, recall, f-measure, Matthew’s correlation coefficient (MCC), the area under a receiver operating characteristic curve (AUC), the area under the precision–recall curve (AUCPR) and mean square error (MSE). After comparing the results obtained by the proposed models on the original and balanced data sets, we found out that the best accuracy of 98% was obtained for the Long method by using both models (Bi-LSTM and GRU) on the original datasets, the best accuracy of 100% was obtained for the long method by using both models (Bi-LSTM and GRU) on the balanced datasets (using random oversampling), and the best accuracy 99% was obtained for the long method by using Bi-LSTM model and 99% was obtained for the data class and Feature envy by using GRU model on the balanced datasets (using Tomek links). The results indicate that the use of data balancing techniques had a positive effect on the predictive accuracy of the models presented. The results show that the proposed models can detect the code smells more accurately and effectively.

Read full abstract

Background: Continuous modifications, suboptimal software design practices, and stringent project deadlines contribute to the proliferation of code smells. Detecting and refactoring these code smells are pivotal to maintaining complex and essential software systems. Neglecting them may lead to future software defects, rendering systems challenging to maintain, and eventually obsolete. Supervised machine learning techniques have emerged as valuable tools for classifying code smells without needing expert knowledge or fixed threshold values. Further enhancement of classifier performance can be achieved through effective feature selection techniques and the optimization of hyperparameter values. Aim: Performance measures of multiple machine learning classifiers are improved by fine tuning its hyperparameters using various type of meta-heuristic algorithms including swarm intelligent, physics, math, and bio-based etc. Their performance measures are compared to find the best meta-heuristic algorithm in the context of code smell detection and its impact is evaluated based on statistical tests. Method: This study employs sixteen contemporary and robust meta-heuristic algorithms to optimize the hyperparameters of two machine learning algorithms: Support Vector Machine (SVM) and k-nearest Neighbors (k-NN). The No Free Lunch theorem underscores that the success of an optimization algorithm in one application may not necessarily extend to others. Consequently, a rigorous comparative analysis of these algorithms is undertaken to identify the best-fit solutions for code smell detection. A diverse range of optimization algorithms, encompassing Arithmetic, Jellyfish Search, Flow Direction, Student Psychology Based, Pathfinder, Sine Cosine, Jaya, Crow Search, Dragonfly, Krill Herd, Multi-Verse, Symbiotic Organisms Search, Flower Pollination, Teaching Learning Based, Gravitational Search, and Biogeography-Based Optimization, have been implemented. Results: In the case of optimized SVM, the highest attained accuracy, AUC, and F-measure values are 98.75%, 100%, and 98.57%, respectively. Remarkably, significant increases in accuracy and AUC, reaching 32.22% and 45.11% respectively, are observed. For k-NN, the best accuracy, AUC, and F-measure values are all perfect at 100%, with noteworthy hikes in accuracy and ROC-AUC values, amounting to 43.89% and 40.83%, respectively. Conclusion: Optimized SVM exhibits exceptional performance with the Sine Cosine Optimization algorithm, while k-NN attains its peak performance with the Flower Optimization algorithm. Statistical analysis underscores the substantial impact of employing meta-heuristic algorithms for optimizing machine learning classifiers, enhancing their performance significantly. Optimized SVM excels in detecting the God Class, while optimized k-NN is particularly effective in identifying the Data Class. This innovative fusion automates the tuning process and elevates classifier performance, simultaneously addressing multiple longstanding challenges.

Read full abstract

God Class Research Articles

Related Topics

Articles published on God Class

Improving accuracy of code smells detection using machine learning with data balancing techniques

Optimizing LSTM for Code Smell Detection: The Role of Data Balancing

Boosting and Comparing Performance of Machine Learning Classifiers with Meta-heuristic Techniques to Detect Code Smell

Code smells in pull requests: An exploratory study

A study of dealing class imbalance problem with machine learning methods for code smell severity detection using PCA-based feature selection technique

A Systematic Literature Review on the Code Smells Datasets and Validation Mechanisms

Code Smell Detection Using Ensemble Machine Learning Algorithms

Prioritization of god class design smell: A multi-criteria based approach

Deep convolutional neural network model for bad code smells detection based on oversampling method

Automatic detection of Long Method and God Class code smells through neural source code embeddings

Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

A comparison of machine learning algorithms on design smell detection using balanced and imbalanced dataset: A study of God class

Approach of God Class Detection Based on Evolutionary and Semantic Features

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

Code Smell Identification As The Basis For Code Refactoring in The Agricultural Information System Portal Case Study at: Gilangharjo Village, Bantul Regency, Indonesia

Exploratory study of the impact of project domain and size category on the detection of the God class design smell

Code Smells Detection and Visualization: A Systematic Literature Review

Analysing Agreement Among Different Evaluators in God Class and Feature Envy Detection

Технический долг в жизненном цикле разработки ПО: запахи кода

God Class Refactoring Recommendation and Extraction Using Context based Grouping

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

God Class Research Articles

Related Topics

Articles published on God Class

Improving accuracy of code smells detection using machine learning with data balancing techniques

Optimizing LSTM for Code Smell Detection: The Role of Data Balancing

Boosting and Comparing Performance of Machine Learning Classifiers with Meta-heuristic Techniques to Detect Code Smell

Code smells in pull requests: An exploratory study

A study of dealing class imbalance problem with machine learning methods for code smell severity detection using PCA-based feature selection technique

A Systematic Literature Review on the Code Smells Datasets and Validation Mechanisms

Code Smell Detection Using Ensemble Machine Learning Algorithms

Prioritization of god class design smell: A multi-criteria based approach

Deep convolutional neural network model for bad code smells detection based on oversampling method

Automatic detection of Long Method and God Class code smells through neural source code embeddings

Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

A comparison of machine learning algorithms on design smell detection using balanced and imbalanced dataset: A study of God class

Approach of God Class Detection Based on Evolutionary and Semantic Features

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

Code Smell Identification As The Basis For Code Refactoring in The Agricultural Information System Portal Case Study at: Gilangharjo Village, Bantul Regency, Indonesia

Exploratory study of the impact of project domain and size category on the detection of the God class design smell

Code Smells Detection and Visualization: A Systematic Literature Review

Analysing Agreement Among Different Evaluators in God Class and Feature Envy Detection

Технический долг в жизненном цикле разработки ПО: запахи кода

God Class Refactoring Recommendation and Extraction Using Context based Grouping