Towards using visual, semantic and structural features to improve code readability classification

Qing Mi,Yiqun Hao,Liwei Ou,Wei Ma

doi:10.1016/j.jss.2022.111454

Abstract

Context:Code readability, which correlates strongly with software quality, plays a critical role in software maintenance and evolvement. Although existing deep learning-based code readability models have reached a rather high classification accuracy, only structural features are utilized which inevitably limits their model performance. Objective:To address this problem, we propose to extract readability-related features from visual, semantic, and structural aspects from source code in an attempt to further improve code readability classification. Method:First, we convert a code snippet into a RGB matrix (for visual feature extraction), a token sequence (for semantic feature extraction) and a character matrix (for structural feature extraction). Then, we input them into a hybrid neural network that is composed of BERT, CNN, and BiLSTM for feature extraction. Finally, the extracted features are concatenated and input into a classifier to make a code readability classification. Result:A series of experiments are conducted to evaluate our method. The results show that the average accuracy could reach 85.3%, which outperforms all existing models. Conclusion:As an innovative work of extracting readability-related features automatically from visual, semantic, and structural aspects, our method is proved to be effective for the task of code readability classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards using visual, semantic and structural features to improve code readability classification

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software

Lead the way for us

Journal: Journal of Systems and Software	Publication Date: Jul 23, 2022
Citations: 6

Similar Papers

Static and Dynamic Isolated Indian and Russian Sign Language Recognition with Spatial and Temporal Feature Detection Using Hybrid Neural Network
E Rajalakshmi ... Maxim A Bakaev
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
E Rajalakshmi, et. al.E Rajalakshmi ... Maxim A Bakaev
25 Nov 2022
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

The Effect of Feature Extraction Based on Dictionary Learning on ECG Signal Classification
Rahime Ceylan
International Journal of Intelligent Systems and Applications in Engineering | VOL. 1
Rahime CeylanRahime Ceylan
29 Mar 2018
International Journal of Intelligent Systems and Applications in Engineering | VOL. 1

An Approach to Semantic and Structural Features Learning for Software Defect Prediction
Shi Meilong ... Cheng Zeng
Mathematical Problems in Engineering | VOL. 2020
Shi Meilong, et. al.Shi Meilong ... Cheng Zeng
06 Apr 2020
Mathematical Problems in Engineering | VOL. 2020

Structure and semantics in OODB class specifications
J Geller ... E J Neuhold
ACM SIGMOD Record | VOL. 20
J Geller, et. al.J Geller ... E J Neuhold
01 Dec 1991
ACM SIGMOD Record | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards using visual, semantic and structural features to improve code readability classification

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software