Concept extraction from business documents for software engineering projects

Pierre André Ménard,Sylvie Ratté

doi:10.1007/s10515-015-0184-4

Abstract

Acquiring relevant business concepts is a crucial first step for any software project for which the software experts are not domain experts. The wealth of information buried within an organization's written documentation is a precious source of concepts, relationships and attributes which can be used to model the enterprise's domain. The lack of targeted extraction tools can make perusing through this type of resource a lengthy and costly process. We propose a domain model focused extraction process aimed at the rapid discovery of knowledge relevant to the software expert. To avoid undesirable noise from high-level linguistic tools, the process is mainly composed of positive and negative base filters that are less error prone and more robust. The extracted candidates are then reordered using a weight propagation algorithm based on structural hints from source documents. When tested on French text corpora from public organizations, our process performs 2.7 times better than a statistical baseline for relevant concept discovery. A new metric to assess the performance discovery speed of relevant concepts is introduced. The annotation of a gold standard definition of software engineering oriented concepts for knowledge extraction tasks is also presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Concept extraction from business documents for software engineering projects

Abstract

Talk to us

Similar Papers

More From: Automated Software Engineering

Lead the way for us

Journal: Automated Software Engineering	Publication Date: Aug 21, 2015
Citations: 14

Similar Papers

Enhancing relevant concepts extraction for ontology learning using domain time relevance
Fatima N Al-Aswadi ... Wafa’ Za'Al Alma'Aitah
Information Processing & Management | VOL. 60
Fatima N Al-Aswadi, et. al.Fatima N Al-Aswadi ... Wafa’ Za'Al Alma'Aitah
08 Nov 2022
Information Processing & Management | VOL. 60

3 - Software Engineering for Model-Based Development by Domain Experts
M Bialy ... A Wassyng
Handbook of System Safety and Security | VOL. -
M Bialy, et. al.M Bialy ... A Wassyng
14 Oct 2016
Handbook of System Safety and Security | VOL. -

Text-mined Data Improves Machine Learning Predictions for Detecting Inborn Errors of Immunity
Nicholas Rider ... Kirk Roberts
Clinical Immunology | VOL. 250
Nicholas Rider, et. al.Nicholas Rider ... Kirk Roberts
01 May 2023
Clinical Immunology | VOL. 250

Corporate Governance Issues in the Public Sector: Board Perspective and Peculiarities
Peter Yao Lartey ... Fatoumata Binta Maci Bah
Brazilian Journal of Operations & Production Management | VOL. 17
Peter Yao Lartey, et. al.Peter Yao Lartey ... Fatoumata Binta Maci Bah
01 Jan 2020
Brazilian Journal of Operations & Production Management | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Concept extraction from business documents for software engineering projects

Abstract

Talk to us

Similar Papers

More From: Automated Software Engineering