One- and Two-Phase Software Requirement Classification Using Ensemble Deep Learning.

Nouf Rahimi,Lamiaa Elrefaei,Fathy Eassa

doi:10.3390/e23101264

Nouf Rahimi, Lamiaa Elrefaei + Show 1 more

Open Access

https://doi.org/10.3390/e23101264

Copy DOI

Journal: Entropy (Basel, Switzerland)	Publication Date: Sep 28, 2021
Citations: 10	License type: CC BY 4.0

Affiliation: Benha University, Jeddah University

Abstract

Recently, deep learning (DL) has been utilized successfully in different fields, achieving remarkable results. Thus, there is a noticeable focus on DL approaches to automate software engineering (SE) tasks such as maintenance, requirement extraction, and classification. An advanced utilization of DL is the ensemble approach, which aims to reduce error rates and learning time and improve performance. In this research, three ensemble approaches were applied: accuracy as a weight ensemble, mean ensemble, and accuracy per class as a weight ensemble with a combination of four different DL models—long short-term memory (LSTM), bidirectional long short-term memory (BiLSTM), a gated recurrent unit (GRU), and a convolutional neural network (CNN)—in order to classify the software requirement (SR) specification, the binary classification of SRs into functional requirement (FRs) or non-functional requirements (NFRs), and the multi-label classification of both FRs and NFRs into further experimental classes. The models were trained and tested on the PROMISE dataset. A one-phase classification system was developed to classify SRs directly into one of the 17 multi-classes of FRs and NFRs. In addition, a two-phase classification system was developed to classify SRs first into FRs or NFRs and to pass the output to the second phase of multi-class classification to 17 classes. The experimental results demonstrated that the proposed classification systems can lead to a competitive classification performance compared to the state-of-the-art methods. The two-phase classification system proved its robustness against the one-phase classification system, as it obtained a 95.7% accuracy in the binary classification phase and a 93.4% accuracy in the second phase of NFR and FR multi-class classification.

Highlights

Software requirement (SR) classification is carried out by software specialists to directly determine the requirements that they need or are interested in
The two-phase classification system proved its robustness against the one-phase classification system, as it obtained a 95.7% accuracy in the binary classification phase and a 93.4% accuracy in the second phase of non-functional requirements (NFRs) and functional requirements (FRs) multi-class classification
The results showed that the convolutional neural network (CNN) classifier scored recall values of 0.945, 0.911, and 0.772 for the binary option of FRs or NFRs, the bidirectional encoder representations from transformers binary option of being security-related or not, and for NFR multi-label classification, respectively

Summary

Introduction

Software requirement (SR) classification is carried out by software specialists to directly determine the requirements that they need or are interested in. User interface designers are probably interested in “look and feel” requirements. The manual labeling or classification of each requirement to determine which category it belongs to is expensive and requires experts in different fields, which results in high costs. This provides motivation for finding a complete system with high performance to label and classify SR into functional requirements (FRs) or non-functional requirement (NFRs), as well as both FRs and NFRs into further classes [1].

Objectives

Results

Conclusion