An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Chetan Arora,Mehrdad Sabetzadeh,Shiva Nejati,Lionel Briand

doi:10.1145/3293454

Chetan Arora, Mehrdad Sabetzadeh + Show 2 more

Open Access

https://doi.org/10.1145/3293454

Copy DOI

Abstract

Domain models are a useful vehicle for making the interpretation and elaboration of natural-language requirements more precise. Advances in natural-language processing (NLP) have made it possible to automatically extract from requirements most of the information that is relevant to domain model construction. However, alongside the relevant information, NLP extracts from requirements a significant amount of information that is superfluous (not relevant to the domain model). Our objective in this article is to develop automated assistance for filtering the superfluous information extracted by NLP during domain model extraction. To this end, we devise an active-learning-based approach that iteratively learns from analysts’ feedback over the relevance and superfluousness of the extracted domain model elements and uses this feedback to provide recommendations for filtering superfluous elements. We empirically evaluate our approach over three industrial case studies. Our results indicate that, once trained, our approach automatically detects an average of ≈ 45% of the superfluous elements with a precision of ≈ 96%. Since precision is very high, the automatic recommendations made by our approach are trustworthy. Consequently, analysts can dispose of a considerable fraction – nearly half – of the superfluous elements with minimal manual work. The results are particularly promising, as they should be considered in light of the non-negligible subjectivity that is inherently tied to the notion of relevance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: Jan 9, 2019
Citations: 25	License type: other-oa

R Discovery Prime

R Discovery Prime

An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Similar Papers

Advancements in natural language processing: Implications, challenges, and future directions
Supriyono ... Fachrul Kurniawan
Telematics and Informatics Reports | VOL. 16
Supriyono, et. al. Supriyono ... Fachrul Kurniawan
07 Nov 2024
Telematics and Informatics Reports | VOL. 16

Generating Abstract Test Cases from User Requirements using MDSE and NLP
Sai Chaithra Allala ... Peter J Clarke
-
Sai Chaithra Allala, et. al.Sai Chaithra Allala ... Peter J Clarke
01 Dec 2022
01 Dec 2022

Text to software
Walter F Tichy ... Sven J Koerner
-
Walter F Tichy, et. al.Walter F Tichy ... Sven J Koerner
07 Nov 2010
07 Nov 2010

Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics -
-
-
--
01 Jan 1999
01 Jan 1999

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology