Code Smell Detection Research Articles

Code review plays an important role in software quality control. A typical review process involves a careful check of a piece of code in an attempt to detect and locate defects and other quality issues/violations. One type of issue that may impact the quality of software is code smells - i.e., bad coding practices that may lead to defects or maintenance issues. Yet, little is known about the extent to which code smells are identified during modern code review. To investigate the concept behind code smells identified in modern code review and what actions reviewers suggest and developers take in response to the identified smells, we conducted an empirical study of code smells in code reviews by analysing reviews from four, large open source projects from the OpenStack (Nova and Neutron) and Qt (Qt Base and Qt Creator) communities. We manually checked a total of 25,415 code review comments obtained by keywords search and random selection; this resulted in the identification of 1,539 smell-related reviews which then allowed the study of the causes of code smells, actions taken against identified smells, time taken to fix identified smells and reasons why developers ignored fixing identified smells. Our analysis found that 1) code smells were not commonly identified in code reviews, 2) smells were usually caused by violation of coding conventions, 3) reviewers usually provided constructive feedback, including fixing (refactoring) recommendations to help developers remove smells, 4) developers generally followed those recommendations and actioned the changes, 5) once identified by reviewers, it usually takes developers less than one week to fix the smells and 6) the main reason why developers chose to ignore the identified smells is that it is not worth fixing the smell. Our results suggest the following: 1) developers should closely follow coding conventions in their projects to avoid introducing code smells, 2) review-based detection of code smells is perceived to be a trustworthy approach by developers, mainly because reviews are context-sensitive (as reviewers are more aware of the context of the code given that they are part of the project’s development team) and 3) program context needs to be fully considered in order to make a decision of whether to fix the identified code smell immediately.

Read full abstract

Code smells are symptoms of wrong design decisions or coding shortcuts that may increase defect rate and decrease maintainability. Research on code smells is accelerating, focusing on code smell detection and using code smells as defect predictors. Recent research shows that even between software developers, agreement on what constitutes a code smell is low, but several publications claim the high performance of detection algorithms—which seems counterintuitive, considering that algorithms should be taught on data labeled by developers. This paper aims to investigate the possible reasons for the inconsistencies between studies in the performance of applied machine learning algorithms compared to developers. It focuses on the reproducibility of existing studies. A systematic literature review was performed among conference and journal articles published between 1999 and 2020 to assess the state of reproducibility of the research performed in those papers. A quasi-gold standard procedure was used to validate the search. Modeling process descriptions, reproduction scripts, data sets, and techniques used for their creation were analyzed. We obtained data from 46 publications. 22 of them contained a detailed description of the modeling process, 17 included any reproduction data (data set, results, or scripts) and 15 used existing data sets. In most of the publications, analyzed projects were hand-picked by the researchers. Most studies do not include any form of an online reproduction package, although this has started to change recently—8% of analyzed studies published before 2018 included a full reproduction package, compared to 22% in years 2018–2019. Ones that do include a package usually use a research group website or even a personal one. Dedicated archives are still rarely used for data packages. We recommend that researchers include complete reproduction packages for their studies and use well-established research data archives instead of their own websites. • Full reproduction package was included in 22% of the studies in 2018–2019 • Model description was inadequate in 28% of studies before 2018 and 43% in 2018–2019 • 33% of reviewed studies use an existing data set (most common strategy) • Custom data set creation is usually informal. Criteria are usually vague

Read full abstract

Code Smell Detection Research Articles

Related Topics

Articles published on Code Smell Detection

What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes

On the Assessment of Interactive Detection of Code Smells in Practice: A Controlled Experiment

Metric-based rule optimizing system for code smell detection using Salp Swarm and Cockroach Swarm algorithm

Hybrid Model with Multi-Level Code Representation for Multi-Label Code Smell Detection (077)

Code Smell Detection Using Ensemble Machine Learning Algorithms

An empirical study of Android behavioural code smells detection

DeleSmell: Code smell detection based on deep learning and latent semantic analysis

Code smells detection via modern code review: a study of the OpenStack and Qt communities

Handling uncertainty in SBSE: a possibilistic evolutionary approach for code smells detection

Deep convolutional neural network model for bad code smells detection based on oversampling method

Automatic detection of Long Method and God Class code smells through neural source code embeddings

HBSniff: A static analysis tool for Java Hibernate object-relational mapping code smell detection

How far are we from reproducible research on code smell detection? A systematic literature review

Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

On the adequacy of static analysis warnings with respect to code smell prediction

Exploring the relationship between refactoring and code debt indicators

Feature reduction techniques based code smell prediction

Improving performance with hybrid feature selection and ensemble machine learning techniques for code smell detection

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

Code smell detection using feature selection and stacking ensemble: An empirical investigation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Code Smell Detection Research Articles

Related Topics

Articles published on Code Smell Detection

What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes

On the Assessment of Interactive Detection of Code Smells in Practice: A Controlled Experiment

Metric-based rule optimizing system for code smell detection using Salp Swarm and Cockroach Swarm algorithm

Hybrid Model with Multi-Level Code Representation for Multi-Label Code Smell Detection (077)

Code Smell Detection Using Ensemble Machine Learning Algorithms

An empirical study of Android behavioural code smells detection

DeleSmell: Code smell detection based on deep learning and latent semantic analysis

Code smells detection via modern code review: a study of the OpenStack and Qt communities

Handling uncertainty in SBSE: a possibilistic evolutionary approach for code smells detection

Deep convolutional neural network model for bad code smells detection based on oversampling method

Automatic detection of Long Method and God Class code smells through neural source code embeddings

HBSniff: A static analysis tool for Java Hibernate object-relational mapping code smell detection

How far are we from reproducible research on code smell detection? A systematic literature review

Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

On the adequacy of static analysis warnings with respect to code smell prediction

Exploring the relationship between refactoring and code debt indicators

Feature reduction techniques based code smell prediction

Improving performance with hybrid feature selection and ensemble machine learning techniques for code smell detection

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

Code smell detection using feature selection and stacking ensemble: An empirical investigation