Software Requirements Classification Using Machine Learning Algorithms.

Edna Dias Canedo,Bruno Cordeiro Mendes

doi:10.3390/e22091057

Abstract

The correct classification of requirements has become an essential task within software engineering. This study shows a comparison among the text feature extraction techniques, and machine learning algorithms to the problem of requirements engineer classification to answer the two major questions “Which works best (Bag of Words (BoW) vs. Term Frequency–Inverse Document Frequency (TF-IDF) vs. Chi Squared ()) for classifying Software Requirements into Functional Requirements (FR) and Non-Functional Requirements (NF), and the sub-classes of Non-Functional Requirements?” and “Which Machine Learning Algorithm provides the best performance for the requirements classification task?”. The data used to perform the research was the PROMISE_exp, a recently made dataset that expands the already known PROMISE repository, a repository that contains labeled software requirements. All the documents from the database were cleaned with a set of normalization steps and the two feature extractions, and feature selection techniques used were BoW, TF-IDF and respectively. The algorithms used for classification were Logist Regression (LR), Support Vector Machine (SVM), Multinomial Naive Bayes (MNB) and k-Nearest Neighbors (kNN). The novelty of our work is the data used to perform the experiment, the details of the steps used to reproduce the classification, and the comparison between BoW, TF-IDF and for this repository not having been covered by other studies. This work will serve as a reference for the software engineering community and will help other researchers to understand the requirement classification process. We noticed that the use of TF-IDF followed by the use of LR had a better classification result to differentiate requirements, with an F-measure of 0.91 in binary classification (tying with SVM in that case), 0.74 in NF classification and 0.78 in general classification. As future work we intend to compare more algorithms and new forms to improve the precision of our models.

Highlights

Text classification is the attempt to organize text documents into categories based on properties and attributes belonging to each text
To answer research questions (RQ).1, we performed the conversion of the requirements into numerical vectors, each of these vectors was combined with the classification algorithms, and we looked at Bag of Words (BoW), Term Frequency and Inverse Document Frequency (TF-IDF)
We conducted three evaluations to determine the best combination of Machine Learning algorithm and feature extraction technique to classify requirements: (1) Effectiveness in binary classification of requirements; (2) effectiveness in multiclass classification of nonfunctional requirements; and (3) effectiveness in multiclass classification of requirements, including Non-Functional Requirements (NFRs) and Functional Requirements (FRs)

Summary

Introduction

Text classification is the attempt to organize text documents into categories based on properties and attributes belonging to each text. The concept may seem simple and, with a small number of documents, it is possible to analyze each document manually and get an idea of the category in which the document belongs Based on this knowledge, it is possible to group similar documents into categories or classes. It is possible to group similar documents into categories or classes It is a more challenging activity when the number of documents to be classified increases to several hundred thousand or millions. The results showed that in most cases, PROMISE_exp has improved the rankings

Objectives

Methods

Results

Conclusion