Abstract
The domain ontology, which plays a significant role in knowledge-based systems, still needs the manual work of domain experts to be constructed currently. The main motivation of this paper is to provide a semi-automatic platform which can construct fairly comprehensive domain ontology from unstructured data. Meanwhile, a brief QA System is proposed to simplify the interaction with the domain experts. A novel algorithm MPVW, which extends from the classical algorithm TF-IDF, is proposed to extract the terminologies from domain documents. MPVW balanced more parameters and factors to evaluate the feature of terminologies. The 3-layers taxonomy and terminology hyponymy height provide sufficient guide and prompt for domain experts to construct ontology from terminologies. According to our approach we have developed ROCP, a rapid ontology construction platform which has been applied in the space debris mitigation domain. The experimental data indicates that ROCP has sufficient accuracy to extract terminologies. Meanwhile, it is effective to relieve the labor of domain experts to construct domain ontology.
Highlights
The data integration brings great convenience for knowledge acquisition and association in many domains
We have proposed an approach to establish a rapid domain ontology construction platform ROCP
ROCP uses a QA mechanism to enable domain experts to achieve the ontology construction from unstructured data, which consists of two main steps as follows
Summary
The data integration brings great convenience for knowledge acquisition and association in many domains. In our previous work (Zhao et al, 2016), we proposed a method of the semi-automatic mapping between a domain database and an existing ontology. There still exist many difficulties in the process of the semi-automatic ontology construction from unstructured data, e.g. Zhao et al: ROCP ontologies are constructed by domain-informatics experts such as the MaterialInformation Ontology (Ashino, 2010) This communication, just as the server and the client, needs a standard “protocol” to ensure efficiency and reliability. The Terminology Hyponymy Height (THH) algorithm and 3-layers nodes taxonomy are proposed for ontology construction. These methods can provide clear guidelines and relieve the labor of domain experts.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.