Abstract

The domain ontology, which plays a significant role in knowledge-based systems, still needs the manual work of domain experts to be constructed currently. The main motivation of this paper is to provide a semi-automatic platform which can construct fairly comprehensive domain ontology from unstructured data. Meanwhile, a brief QA System is proposed to simplify the interaction with the domain experts. A novel algorithm MPVW, which extends from the classical algorithm TF-IDF, is proposed to extract the terminologies from domain documents. MPVW balanced more parameters and factors to evaluate the feature of terminologies. The 3-layers taxonomy and terminology hyponymy height provide sufficient guide and prompt for domain experts to construct ontology from terminologies. According to our approach we have developed ROCP, a rapid ontology construction platform which has been applied in the space debris mitigation domain. The experimental data indicates that ROCP has sufficient accuracy to extract terminologies. Meanwhile, it is effective to relieve the labor of domain experts to construct domain ontology.

Highlights

  • The data integration brings great convenience for knowledge acquisition and association in many domains

  • We have proposed an approach to establish a rapid domain ontology construction platform ROCP

  • ROCP uses a QA mechanism to enable domain experts to achieve the ontology construction from unstructured data, which consists of two main steps as follows

Read more

Summary

Introduction

The data integration brings great convenience for knowledge acquisition and association in many domains. In our previous work (Zhao et al, 2016), we proposed a method of the semi-automatic mapping between a domain database and an existing ontology. There still exist many difficulties in the process of the semi-automatic ontology construction from unstructured data, e.g. Zhao et al: ROCP ontologies are constructed by domain-informatics experts such as the MaterialInformation Ontology (Ashino, 2010) This communication, just as the server and the client, needs a standard “protocol” to ensure efficiency and reliability. The Terminology Hyponymy Height (THH) algorithm and 3-layers nodes taxonomy are proposed for ontology construction. These methods can provide clear guidelines and relieve the labor of domain experts.

Related Work
Overview and Methodology
Document Validation
Ontology assembly
The accuracy comparison of algorithm MPVW and TF-IDF
The time test of the semi-automatic ontology construction
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call