Abstract

Knowledge Graphs (KGs), as one of the key trends which are driving the next wave of technologies, have now become a new form of knowledge representation, and a cornerstone for several applications from generic to specific industrial use cases. However, in some specific domains such as law enforcement, a real and large domain-oriented KG is often unavailable due to data privacy concerns. In such domains it is necessary to generate a synthetic KG which mimics the properties of a real KG in the domain. Although during the last two decades, a variety of graph data generators has been proposed to achieve the generation of different kinds of networks, the state-of-the-art synthetic graph data generators are not feasible to generate a realistic and synthetic KGs because KGs always contain data characteristics with specified semantics. In this work, we propose a schema-driven synthetic KG generation approach with extended graph differential dependencies (GDD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">x</sup> ), which is an extension of the recently developed graph entity/differential dependencies that represent formal constraints for graph data to enable the generation of desired graph patterns in synthetic KG. Next, we develop an effective KG generation algorithm that employs the schema and the pre-defined GDD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">x</sup> s. Finally, we evaluate our synthetic KG generator and compare with several state-of-the-art synthetic graph generators. The results from the experiments show that our KG generation method can generate KGs that exhibit the desired graph patterns, node attributes and degree distributions associated with each entity type in the graph's schema.

Highlights

  • Knowledge graphs (KGs) have recently emerged as a rich and flexible representation of domain knowledge

  • We extend the recently proposed graph differential dependency (GDD) [11] to include the semantics of approximate satisfaction of Right-Hand Side (RHS) function of the dependency and flexibility of sub-graph matching metrics for use as declarative rules to infer new relationships to build a variety of designated graph patterns which need to be embedded in the KGs

  • DESIGN OF EXPERIMETNS 1) METHODS OF COMPARISON We compare out synthetic KG generating methods with 3 state-of-the-art stochastic/semantic-based synthetic graph generating methods, i.e., configuration model [13], gMark [18] and Linked Data Benchmark Council (LDBC) Social Network (LSN) generator [19]

Read more

Summary

Introduction

Knowledge graphs (KGs) have recently emerged as a rich and flexible representation of domain knowledge. Nodes in these graphs represent the entities, and edges show the relationships between the entities. A major advantage of KGs is that they can encode/provide knowledge explicitly. Some KGs in specific domain, such as art [3] and cybersecurity [4], are constructed based on domain-specific knowledge base. The KGs are applied in a lot of downstream tasks, for instances, in information search [3], question answering [5], predictive analytics [6], and object detection [7]. KGs have been applied in the domain

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call