Abstract

The process of extraction of software entities such as system, use case, and actor from an English natural language description of a user’s software requirements is a linguistic and semantic process of a natural language processing application. Entity extraction is known to be a complicated and challenging problem by researchers in the fields of linguistics or computation, due to the ambiguities in natural languages. This paper presents a named entity recognition method called SyAcUcNER (System Actor Use-Case Named Entity Recognizer), for extracting the system, actor, and use case entities from unstructured English descriptions of user requirements for the software. SyAcUcNER uses one of the Machine Learning (ML) approaches, that is, the Support Vector Machine (SVM) as an effective classifier. Also, SyAcUcNER uses a semantic role labeling process to tag the words in the text of user software requirements. SyAcUcNER is the first work that defines the structure of a requirements engineering specialized NER, the first work that uses a specialized NER model as an approach for extracting actor and use case entities from English language requirements description, and the first time an SVM has been used to specify the semantic meanings of words in a certain domain of discourse; that is the Software Requirements Specification (SRS). The performance of SyAcUcNER, which utilizes WEKA’s SVM, is evaluated using a binomial technique, and the results gained from running SyAcUcNER on text corpora from assorted sources give weighted averages of 76.2% for precision, 76% for recall, and 72.1% for the F-measure.

Highlights

  • The system, use case, and actor are the main entities of the Software Requirements Specification (SRS), which is an unformatted Natural Language (NL) text description of a system

  • The extracting of these entities is considered the first step in the development of desired information system, as the actors are the individuals that use the system like humans, external software, etc., in which each actor has certain roles, and the use cases are used to (1) identify the functional requirements of the developed system that would be used by actors, (2) design the system's architecture, (3) control the implementation of the system, (4) verify and validate the developed system via generating test cases [1]

  • Because of the problems that Artificial Neural Networks (ANN) has, this paper proposes the use of another Machine Learning (ML) method that is the Support Vector Machine (SVM) method to automatically extract system, actor, and use case entities from unstructured NL requirements text documents in English

Read more

Summary

INTRODUCTION

The system, use case, and actor are the main entities of the Software Requirements Specification (SRS), which is an unformatted Natural Language (NL) text description of a system. To facilitate and speed up the performing of extracting the SRS elements from an unstructured and natural languageformed user requirement text, a set of solutions have been proposed to automate this process. Because of the problems that ANN has, this paper proposes the use of another ML method that is the Support Vector Machine (SVM) method to automatically extract system, actor, and use case entities from unstructured NL requirements text documents in English. SVM works stably and it generalizes well to data not included in the training data set or that data that its features would be changed This is because the SVM classification approach is principally reliant on a subset of points only in its work to maximize the gap (margin) between nearby points of classes.

BACKGROUND
RELATED WORKS AND APPROACHES
PROPOSED SYACUCNER APPROACH
NL Functional Requirements
Annotation of the Tokens of the Sentence
Set of Tokens with Annotation
SVM Data Mining Model
RESULTS AND EVALUATION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.