Abstract

Applied to search, question answering, and semantic web of close-or-open domain, knowledge graph (KG) is known for its incompleteness subject to the rapid knowledge growing pace. Inspired by the agricultural grafting technology to fruit variety, this paper proposes a heuristic knowledge grafting strategy (HGS) for manufacturing knowledge graph (MKG) named KnowTree extending and completion with natural language processing (NLP) mining engineering cases document. Based on similarity analysis, firstly the grafting related definitions and mechanisms (completeness, relatedness, connectivity and reutilization) are defined. Then, focused on the four mechanisms, HGS takes a pair same engineering documents as input. KnowWords is built as a collection of KnowScion, and each scion is mined from engineering documents based on the SAO structure network, whose importance is evaluated by SAORank counting the in-out degree of centrality. On another hand, the KnowRoot system is designed based on the extended P & S ontology model to characterize the structure of abstract document into four sub-space of knowledge: know-what (problem), know-why (context), know-how (solution) and know-with (result), where a pre-trained language representation model K-BERT is used to classify the KnowScion candidates into the designed KnowRoot system with a fine-tuning classification task. In the knowledge grafting process, the connection unit is constructed based on the extracted domain knowledge triples of the K-BERT model, where the head element of a triple is from the KnowScion candidate set KnowWords satisfying the threshold value, the tail element is from the domain MKG to be extended, and a connection factor is used to evaluate the relationship of union combination. To the goal of knowledge reuse, the path based reasoning rules are designed for KnowTree reutilization. Finally, take the latest engineering case abstract (ECA) in whitegoods domain as resources, a case study is conducted to validate the proposed HGS strategy.

Highlights

  • Under industry 4.0, contemporary manufacturing enterprises both big and small require new generation information technologies to enable efficient production operation by reducing time and cost of building and extending knowledgebased-systems functionality and capability [1], and being able to make better decisions

  • PREPARING AND PREPROCESSING TEXT DATA: ENGINEERING CASE ABSTRACT (ECA) DOCUMENTS heuristic grafting strategy (HGS) takes a pair of case abstract documents as input, and both documents contain the same content information represented text sentences but functions, where the former ds is used for KnowScion extraction with dependency parsing based SAO and the latter dr is utilized for rootstock design with a pretrained K-BERT model, where domain knowledge has been injected from domain Knowledge graph (KG) respectively

  • WORK This paper proposes a novel HGS strategy for manufacturing knowledge graph (MKG) extending and completion based on knowledge grafting mechanisms of completeness, relatedness, connectivity and reutilization

Read more

Summary

INTRODUCTION

Under industry 4.0, contemporary manufacturing enterprises both big and small require new generation information technologies to enable efficient production operation by reducing time and cost of building and extending knowledgebased-systems functionality and capability [1], and being able to make better decisions. 2) As a common issue in open or close domain of KG that semiautomatic extending and completion focuses on knowledge extraction techniques and endures form a lack of systemic solution for knowledge extraction, integration, reasoning and reuse, which can be abstracted as heterogenous embedding vector space and knowledge noise (detailed in the Section Methodology, B). Take the last engineering case abstract (ECA) documents as source knowledge to reach completeness, SAO (subject, action, object) structure [22], [23] as a minimum dependency tree is used to parse syntactic part-of-speech (POS) information of ECA to automatically extract KnowScion, and an extended P & S (problem & solution) model [24] as a problem-solving model is used to design KnowRoot system.

THE BIONIC SIGNIFICANCES OF HGS
SIMILARITY ANALYSIS
PREPARING AND PREPROCESSING TEXT DATA
HOW TO ACHIEVE THE CONNECTIVITY OF KnowTree
CASE STUDY AND VALIDATION
VALIDATION OF THE EXTENDED KnowTree BASED ON HGS REASONING
Findings
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call